Could your ISO 4 abbreviation thing website handle a few thousand queries (~3500 ish) in the span of a few minutes to a few hours (User:JLaTondre would know the exact timing) a few times per month? Some details are in User talk:JL-Bot/Archive 4#Break, but the main idea would be that JL-Bot would take entries from WP:CRAPWATCH, shove them into your API, and use them to search for additional crap citations. Headbomb {t · c · p · b}23:01, 4 March 2019 (UTC)[reply]
Ah, I guess it somehow treats the second "A" as the undefinite article, removes it (with the space) and concludes the ampersand is part of an initialism... I'll try to fix that, but such individual cases are very low priority to me. Tokenzero (talk) 20:04, 18 March 2019 (UTC)[reply]
MathSciNet will have dotless variants. NLM should only be dotless so you don't need to look for anything that's not already in the infobox. Headbomb {t · c · p · b}19:34, 30 March 2019 (UTC)[reply]
In practice, we tag ISO 4 when it's ISO 4, and ignore the other cats. But they should probably be marked with multiple ones. And that MathSciNet list is very limited. A more complete list is here, but pretty impossible to machine-parse. The best would be ISSN matching probably (e.g. [2]) but that requires a subscription. Headbomb {t · c · p · b}13:54, 4 May 2019 (UTC)[reply]
@Headbomb: I've managed to parse that MathSciNet serials.pdf, though the process is a bit brittle and may make a tiny number of typos in foreign letters, disambiguation parentheses and missing or added spaces. (I currently do have access to MathSciNet, but scraping it page by page is more cumbersome). I've compared the lists with infobox params, the report is here. There are virtually no mismatches, only parameters that are not filled even though the abbrev differs non-trivially from the ISO-4 one. For now I don't plan to fill these params automatically (at least not MathSciNet ones).
For rcats from NLM/MathSciNet, there are currently no redirects to create and only 10 redirects which are not marked: so {{R from abbreviation}} will be replaced with {{R from NLM}}, say. (This is of course expected, given your banners to create them in one click, in the infobox template). I also tried to check, when the nlm= parameter is empty, if the abbreviation parsed from the NLM file matches the infobox ISO-4 abbreviation exactly, after removing dots (and similarly for MathSciNet) -- so the cases not listed on the new report page. Tagging these redirect titles would change 3926 redirects like e.g. Chem Senses, usually by replacing {{R from ISO 4}} with
{{Redirect shell |
{{R from ISO 4}}
{{R from NLM}}
}}
@Tokenzero: Could you just do a mini run, like 10 cases of each? Sorry I missed this, the ping didn't go through the first time, I think. Or at least I missed it. Headbomb {t · c · p · b}13:46, 12 May 2019 (UTC)[reply]
Note that things like Ann. of Math. (2) / Ann. Sci. Éc. Norm. Supér. (4) are referring to the 2nd/4th series of these publications. They should create created as MathSciNet redirects, but the infobox should have the (2)/(4) left out usually, I think. Headbomb {t · c · p · b}13:51, 12 May 2019 (UTC)[reply]
@Headbomb: Forgot about this one: can you check those contribs? To recall in short: 10 edits there add NLM/MathSciNet rcats to existing redirects when the |nlm=/|mathscinet= param matches; there are no more edits of this kind to do, except if new cases appear in regular bot runs. The remaining edits are adding rcats (with rcat shells) when |nlm=/|mathscinet= is empty but the abbrev obtained from NLM/MathSciNet files happens to be equal to the ISO-4 one. There are 3916 more such edits to be done. Tokenzero (talk) 11:51, 7 July 2019 (UTC)[reply]
Looks good. The only thing that could throw a wrench is if you have something like Le Carillon, Die Nahrung or The Review of Stuff, which should be redirected to #C, #N, #R sections. Headbomb {t · c · p · b}17:45, 7 July 2019 (UTC)[reply]
It would also be good for the ISO/variant redirects to also point to those sections, e.g. Active & Passive Electronic Components, Active Passive Electron Compon, Active Passive Electron. Compon. to all redirect to #A. Headbomb {t · c · p · b}17:47, 7 July 2019 (UTC)[reply]
Done (on all the above lists, with all the variant redirects, ~2500 edits). My initial code just read the list itself to know the anchor exactly (because of the the/a/an/.. exceptions and because alternate names of the list itself should have no anchor), but I figured I can just look at all existing redirects and hand-pick the exceptions easily. Tokenzero (talk) 18:20, 8 July 2019 (UTC)[reply]
Hello! Manifestation has given you some cookies. Cookies promote WikiLove and hopefully these have made your day better. Happy munching!
Spread the goodness of cookies by adding {{subst:plate}} to someone's talk page, or eat these cookies on the giver's talk page with {{subst:munch plate}}.
Oops, sorry, I thought I had some better language detection there. There's no way to reliably detect the language. Should I just remove the ISO-4 tags? I can then run again with 'all language rules' for everyone and remove by hand the ISO-4 tag from the few English ones (and a few other outliers). Or maybe it's better not to add the ISO-4 tag at all, there's no way to be sure 'all language rules' doesn't catch some Spanish and restricting it to German is not worth the effort. Tokenzero (talk) 21:04, 12 August 2019 (UTC)[reply]
Well, I figured there was a way for the bot to go "It's German, so use the German/Multi list". If that's not possible, then using a blind 'non-English' thing will be good enough (for the non-English titles at least), and I can cleanup the rest by hand.
I tagged my list with languages by hand: there's 94 ger, 40 eng, 23 lat, 1 fre. The problem is, even with the right language, the result is often going to be wrong. Just a few random examples I caught:
Akademie für Geowissenschaften und Geotechnologien, Veröffentlichungen — geotechnolog- is eng only, so the bot with 'ger,mul' rules does not abbreviate it; however, it matches technolog- via the compound word rule, which the bot can't apply.
Handbuch der Geophysik – geophys- eng/fre vs. physik- ger, same.
Sammlung Geographischer Führer – same.
PFG Photogrammetrie, Fernerkundung, Geoinformation – same, this time geoinformation is fre only (no eng).
Studienbücher der Geographie – again a compound, but this time even eng/fre rules would not help.
Zeitschrift für Geomorphologie, Supplementary Issues – geomorpholog- and morpholog- are both eng,fre only (so no compound rules apply); here also supplement- is eng,fre, so formally one should apply a different set of rules to the two parts.
In the other direction, there's also many examples where adding eng/fre rules (or assuming compounds everywhere) would be wrong.
?Palaeontographica Abteilung A – first word has no ger/mul/lat match, not even as a compound word, all the graph rules are eng,fre only. An obvious omission in the LTWA?
?Annales Universitatis Saraviensis – annal- is eng,fre only. Is it an obvious omission, or is the reasoning that ann. would be more ambiguous in Latin (even though English has words annex, annulus, etc.)?
?Nova Hedwigia – sounds Latin. (Hedwigia is, if I understand correctly, the Latin adjective form of the originally German proper name Hedwig.) Now how are we supposed to interpret the LTWA line
Hedwigia Hedwig. ger
It's probably been added exactly for this journal, heh. (But OK, the last three examples are arguably bugs in ISO-4/LTWA, garbage in garbage out). Tokenzero (talk)
Then let's do it dumbly (save for English), and just apply the whole language list to generate the abbreviations, and I'll inspect the results later. Headbomb {t · c · p · b}22:52, 13 August 2019 (UTC)[reply]
other titles for which at least one redirect has already been manually created (the bot would have to compare the targets, etc., so it assumes the human knew better).
all are related to this publisher except Mycologia Memoir whose abbrevs collide with Mycologia Memoirs (dunno if it's the same journal).
The rules I used in the end were 'ger+eng+fre+mul+lat' instead of all, so in some of the rows neither redirect exists. E.g. Reports of the Princeton University Expedition to Patagonia actually became Rep. Princet. Univ. Exped. Patagonia (because Patagonia is only abbreviated in spa rules), while Studienhefte zur angewandten Geophysik became Studienh. angew. Geophys. (with all rules the bot would prefer the Norwegian pattern Studi-, because it matches more letters than the German rule -heft).
Tokenzero (talk) 09:00, 15 August 2019 (UTC)[reply]
Cool, I'll review those today. Also looking forward to whatever improvements you can give the infobox script. There are some requests that are really low hanging fruits in there, and some that are a bit more complex. Headbomb {t · c · p · b}09:05, 15 August 2019 (UTC)[reply]
Seems to me these redirects would have no value for users (currently searching for any of those variants will immediately* give you what you want, and there's more variants in the wild, like ndashes etc.). If it's only needed for WP:JCW, I believe it would be much better to make the JL-Bot read subpages of User:JL-Bot/Publishers.cfg that could provide the same data and even more. I can then make such config files in whatever format is best: in fact it's much much easier for me, since I don't have to care about existing pages, unexpected vs fixable vs already fixed redirects, requesting deletes after misconfiguration accidents, etc. Another advantage is that IEE Proceedings - C could be together with its other variants, but separate from IEE Proceedings - B (at the bottom level).
It's for both. There's a fair amount of variety with :,.-() as seperators [you'll find all of those in various databases and with editor usage]. I have no idea what how much variety there are for abbreviations are, but they're all likely search terms and WP:CHEAP applies. Headbomb {t · c · p · b}20:18, 26 August 2019 (UTC)[reply]
Since editors make all kinds of variants (like ndashes) and typos that you couldn't ever list, this is not a good solution anyway. Also per WP:CHEAP, this does not fall under "a few of them scattered around", see also "this does not mean we should preemptively create redirects for their own sake". It has negligible value for users (since the journals are not notable in the first place) and it's not infinitely cheap. It is a bit of a burden for me, for one, so I'll just refuse to do this one; sorry, I need to put some limits. I can create a subpage of User:JL-Bot/Publishers.cfg listing all kinds of abbreviations (I'm guessing JL-Bot might already ignore whitespace and punctuation). Tokenzero (talk) 20:34, 26 August 2019 (UTC)[reply]
I noticed that things with (journal) get ISO 4 abbreviations created with (journal) in them. I moved things around, but this should be updated in the future. Everything else looked good. Headbomb {t · c · p · b}20:54, 27 August 2019 (UTC)[reply]
Sorry, I don't know how I could have missed that when scanning the logs. Turns out I screwed up a change in code when handling the previous mistake, it's fixed now. Thanks a lot for the moves. Tokenzero (talk) 21:54, 27 August 2019 (UTC)[reply]
The Arbitration Committee is the panel of editors responsible for conducting the Wikipedia arbitration process. It has the authority to impose binding solutions to disputes between editors, primarily for serious conduct disputes the community has been unable to resolve. This includes the authority to impose site bans, topic bans, editing restrictions, and other measures needed to maintain our editing environment. The arbitration policy describes the Committee's roles and responsibilities in greater detail.
Google Code-In, Google-organized contest in which the Wikimedia Foundation participates, starts in a few weeks. This contest is about taking high school students into the world of opensource. I'm sending you this message because you recently edited a documentation page at the English Wikipedia.
I would like to ask you to take part in Google Code-In as a mentor. That would mean to prepare at least one task (it can be documentation related, or something else - the other categories are Code, Design, Quality Assurance and Outreach) for the participants, and help the student to complete it. Please sign up at the contest page and send us your Google account address to google-code-in-admins@lists.wikimedia.org, so we can invite you in!
From my own experience, Google Code-In can be fun, you can make several new friends, attract new people to your wiki and make them part of your community.
If you have any questions, please let us know at google-code-in-admins@lists.wikimedia.org.
Your script User:Tokenzero/tinfoboxUtil.js is no longer functional because it attempts to get an editToken from mw.user.tokens. The script should instead get a csrfToken. editTokens were removed from mw.user.tokens on October 3, 2019 at Phabricator during this edit as they were redundant to csrfTokens.– BrandonXLF (talk)00:06, 25 November 2019 (UTC)[reply]
18:37, 21 November 2019 diff hist -15 Exp Clin Transplant (2) Marking standard abbrev rcat. Report problems current rollback: 1 edit [rollback] [vandalism]
18:36, 21 November 2019 diff hist -15 J Aerosp Eng (2) Marking standard abbrev rcat. Report problems current rollback: 1 edit [rollback] [vandalism]
18:35, 21 November 2019 diff hist -15 Front Biogeogr (2) Marking standard abbrev rcat. Report problems current rollback: 1 edit [rollback] [vandalism]
18:35, 21 November 2019 diff hist -15 West J Emerg Med (2) Marking standard abbrev rcat. Report problems current rollback: 1 edit [rollback] [vandalism]
18:34, 21 November 2019 diff hist -15 Open Quat (2) Marking standard abbrev rcat. Report problems current rollback: 1 edit [rollback] [vandalism]
18:33, 21 November 2019 diff hist -15 BJPsych Bull (2) Marking standard abbrev rcat. Report problems current rollback: 1 edit [rollback] [vandalism]
status "ok"
message-type "prefix"
message-version "1.0.0"
message
member "http://id.crossref.org/member/78"
name "Elsevier BV"
prefix "http://id.crossref.org/prefix/10.1016"
The name of the registrant (i.e. the publisher/imprint) can be found in name
This will only work with Crossref-assigned prefixes. Other DOI-assigning agencies have other APIs, which I'll be investigating down the road, but Crossref is the big one here.
The request would be to the bot to crawl the following DOI prefixes
Since this would be a userspace bot, there wouldn't be any need to get approval for this. There would be a follow up request to create the appropriate redirects down the road, but basically this would be after human review and massaging of the data to ready for bot use. Headbomb {t · c · p · b} 14:35, 30 December 2019 (UTC)
Headbomb {t · c · p · b}14:35, 30 December 2019 (UTC)[reply]
@Headbomb: Sorry for the delay in answering, I just finished a month of intense travels. Unfortunately I've also decided to retire from bot- and script-writing, at least for the foreseeable future, to prioritise other things. I'll keep maintaining the old stuff. I hope others will answer your bot requests soon enough (this one is nice and easy for someone else to get started). Tokenzero (talk) 21:37, 19 January 2020 (UTC)[reply]
Shame, you'd have been a great maintainer for this one, being familiar with the journal stuff. But life exists, so I hope you enjoy yourself a nice little break! Headbomb {t · c · p · b}02:58, 20 January 2020 (UTC)[reply]
Last run was 31st Jan, back then the article had no infobox. I could make it go over all Category:Periodicals (the BRFA mentioned categories as well as pages containing infobox journal/magazine), I don't remember any good reason not to do that (except that stubs with no infoboxes might get shortly moved/deleted). Tokenzero (talk) 18:35, 5 February 2020 (UTC)[reply]
I created the main redirects for a bunch of Nature journals, but if the bot could add the section redirects to them, that would be great (some of them already have section anchors). And also, if your bot could create the ISO redirects for them, that would be great.
During the regular ISO bot runs. I tried to initiate them now, but it runs a few different ones in sequence and each takes a lot of time. Tokenzero (talk) 17:05, 14 March 2020 (UTC)[reply]
Ay, I'm stupid, the bots ran but they're not supposed to do anything without an infobox. This shouldn't be done automatically at all. I did those semi-manually now. Tokenzero (talk) 17:17, 22 March 2020 (UTC)[reply]
Hi. Thank you for your recent edits. An automated process has detected that when you recently edited Square root of a matrix, you added a link pointing to the disambiguation page Positive semidefinite. Such links are usually incorrect, since a disambiguation page is merely a list of unrelated topics with similar titles. (Read the FAQ • Join us at the DPL WikiProject.)
TokenzeroBot created it with {{R from modification}}, even though that template says it should not be used when a more specific Rcat is available, in this case {{R from other capitalisation}}. (I realize that the task purports to be about ampersands and the creation of this particular redirect happened over a year ago.)
Hello! Voting in the 2020 Arbitration Committee elections is now open until 23:59 (UTC) on Monday, 7 December 2020. All eligible users are allowed to vote. Users with alternate accounts may only vote once.
The Arbitration Committee is the panel of editors responsible for conducting the Wikipedia arbitration process. It has the authority to impose binding solutions to disputes between editors, primarily for serious conduct disputes the community has been unable to resolve. This includes the authority to impose site bans, topic bans, editing restrictions, and other measures needed to maintain our editing environment. The arbitration policy describes the Committee's roles and responsibilities in greater detail.
Hmm, as far as I remember it's one of the things where the standard says one thing but you said it's better to have it the other way, though I may be wrong. Anyway, the issue is that there's journals with somewhat optional subtitles, from "FooBar: Journal of Bionics" to extremes like "FooBar: the fantastic journal with a useless and overlong subtitle", but there's also groups of journals where the subtitles is pretty crucial, like "Wiley Interdisciplinary Reviews: Cognitive Science" vs other "Wiley Interdisciplinary Reviews: ...". The bot can't tell the difference of course. The bot rule was to remove everything following those colons, minuses, n-dashes, and open parens that occured after at least 6 characters. This was probably motivated by the 'optional subtitle' case, which seemed more common. The standard says one should keep (and abbreviate) "dependent titles", but the examples given there suggest they meant only the 'group of journals' case. Anyway, I turned off the bot rule and ran it to make reports again. As you can check in the diffs the difference is rather small, but it seems to me now that indeed turning off the rule looks better (i.e. the longer versions are almost always better or at least not worse). Tokenzero (talk) 22:43, 5 December 2020 (UTC)[reply]
I think it's mostly because |title= of infoboxes have been cleaned up significantly from before. Or at least that what's left are corner cases. Headbomb {t · c · p · b}01:33, 7 December 2020 (UTC)[reply]
p.s. sono un ingegnere italiano che principalmente "corregge errori" o traduce dall'inglese, ma scrivo in italiano perche' non ho troppa confidenza con l'inglese e devo dire che mi sono piaciuti diversi tuoi lavori ...
The bot hasn't run in nearly two months. Mind giving it a kick in the old bucket of bolts / making it run weekly-ish at least? Headbomb {t · c · p · b}22:31, 31 March 2021 (UTC)[reply]
Ouch, sorry, almost identical problem again. Hopefully I fixed it better this time, and I set up a proper notification in case it fails again. Tokenzero (talk) 09:39, 4 April 2021 (UTC)[reply]
Great. I updated about 150 wrongly flag ISO redirects (... Royal ... instead of ... R. ...) last night, so it'll be nice to have regular updates again. Headbomb {t · c · p · b}12:35, 4 April 2021 (UTC)[reply]
Yes, I saw a lot of mislabeled stuff, thanks for fixing it all! I ran it quite a few times to debug a tiny problem with reporting (turned out to be nothing important), and now I changed the schedule to once per week (originally it was twice per month). Tokenzero (talk) 09:49, 18 April 2021 (UTC)[reply]
Hello! Voting in the 2021 Arbitration Committee elections is now open until 23:59 (UTC) on Monday, 6 December 2021. All eligible users are allowed to vote. Users with alternate accounts may only vote once.
The Arbitration Committee is the panel of editors responsible for conducting the Wikipedia arbitration process. It has the authority to impose binding solutions to disputes between editors, primarily for serious conduct disputes the community has been unable to resolve. This includes the authority to impose site bans, topic bans, editing restrictions, and other measures needed to maintain our editing environment. The arbitration policy describes the Committee's roles and responsibilities in greater detail.
Graphs are unavailable due to technical issues. Updates on reimplementing the Graph extension, which will be known as the Chart extension, can be found on Phabricator and on MediaWiki.org.
BRFA activity by month
Welcome to the eighth issue of the English Wikipedia's Bots Newsletter, your source for all things bot. Maintainers disappeared to parts unknown... bots awakening from the slumber of æons... hundreds of thousands of short descriptions... these stories, and more, are brought to you by Wikipedia's most distinguished newsletter about bots.
Our last issue was in August 2019, so there's quite a bit of catching up to do. Due to the vast quantity of things that have happened, the next few issues will only cover a few months at a time. This month, we'll go from September 2019 through the end of the year. I won't bore you with further introductions — instead, I'll bore you with a newsletter about bots.
Overall
Between September and December 2019, there were 33 BRFAs. Of these, Y 25 were approved, and 8 were unsuccessful (N2 3 denied, ? 3 withdrawn, and 2 expired).
TParis goes away, UTRSBot goes kaput: Beeblebroxnoted that the bot for maintaining on-wiki records of UTRS appeals stopped working a while ago. TParis, the semi-retired user who had previously run it, said they were "unlikely to return to actively editing Wikipedia", and the bot had been vanquished by trolls submitting bogus UTRS requests on behalf of real blocked users. While OAuth was a potential fix, neither maintainer had time to implement it. TParis offered to access to the UTRS WMFLabs account to any admin identified with the WMF: "I miss you guys a whole lot [...] but I've also moved on with my life. Good luck, let me know how I can help". Ultimately, SQL ended up in charge. Some progress was made, and the bot continued to work another couple months — but as of press time, UTRSBot has not edited since November 2019.
Curb Safe Charmer adopts reFill: TAnthonypointed out that reFill 2's bug reports were going unanswered; creator Zhaofeng Li had retired from Wikipedia, and a maintainer was needed. As of June 2021, Curb Safe Charmer had taken up the mantle, saying: "Not that I have all the skills needed but better me than nobody! 'Maintainer' might be too strong a term though. Volunteers welcome!"
Graphs are unavailable due to technical issues. Updates on reimplementing the Graph extension, which will be known as the Chart extension, can be found on Phabricator and on MediaWiki.org.
BRFA activity by month
Welcome to the ninth issue of the English Wikipedia's Bots Newsletter, your source for all things bot. Vicious bot-on-bot edit warring... superseded tasks... policy proposals... these stories, and more, are brought to you by Wikipedia's most distinguished newsletter about bots.
After a long hiatus between August 2019 and December 2021, there's quite a bit of ground to cover. Due to the vastness, I decided in December to split the coverage up into a few installments that covered six months each. Some people thought this was a good idea, since covering an entire year in a single issue would make it unmanageably large. Others thought this was stupid, since they were getting talk page messages about crap from almost three years ago. Ultimately, the question of whether each issue covers six months or a year is only relevant for a couple more of them, and then the problem will be behind us forever.
Of course, you can also look on the bright side – we are making progress, and this issue will only be about crap from almost two years ago. Today we will pick up where we left off in December, and go through the first half of 2020.
Overall
In the first half of 2020, there were 71 BRFAs. Of these, Y 59 were approved, and 12 were unsuccessful (with N2 8 denied, ? 2 withdrawn, and 2 expired).
January 2020
Yeah, you're not gonna be able to get away with this anymore.
A new Pywikibot release dropped support for Python 3.4, and it was expected that support for Python 2.7 would be removed in coming updates. Toolforge itself planned to drop Python 2 support in 2022.
On February 1, some concerns were raised about ListeriaBot performing "nonsense" edits. Semi-active operator Magnus Manske (who originally coded the Phase II software|precursor of MediaWiki) was pinged. Meanwhile, the bot was temporarily blocked for several hours until the issue was diagnosed and resolved.
In March, a long discussion was started at Wikipedia talk:Bot policy by Skdb about the troubling trend of bots "expiring" without explanation after their owners became inactive. This can happen for a variety of reasons -- API changes break code, hosting providers' software updates break code, hosting accounts lapse, software changes make bots' edits unnecessary, and policy changes make bots' edits unwanted. The most promising solution seemed to be Toolforge hosting (although it has some problems of its own, like the occasional necessity of refactoring code).
A discussion on the bot noticeboard, "Re-examination of ListeriaBot", was started by Barkeep49, who pointed out repeated operation outside the scope of its BRFA (i.e. editing pages in mainspace, and adding non-free images to others). Some said it was doing good work, and others said it was operating beyond its remit. It was blocked on April 10; the next day it was unblocked, reblocked from article space, reblocked "for specified non-editing actions", unblocked, and indeffed. The next week, several safeguards were implemented in its code by Magnus; the bot was allowed to roam free once more on April 18.
Issues and enquiries are typically expected to be handled on the English Wikipedia. Pages reachable via unified login, like a talk page at Commons or at Italian Wikipedia could also be acceptable [...] External sites like Phabricator or GitHub (which require separate registration or do not allow for IP comments) and email (which can compromise anonymity) can supplement on-wiki communication, but do not replace it.
MajavahBot 3, an impressively meta bot task, was approved this month for maintaining a list of bots running on the English Wikipedia. The page, located at User:MajavahBot/Bot status report, is updated every 24 hours; it contains a list of all accounts with the bot flag, as well as their operator, edit count, last activity date, last edit date, last logged action date, user groups and block status.
In July 2017, Headbomb made a proposal that a section of the Wikipedia:Dashboard be devoted to bots and technical issues. In November 2019, Lua code was written superseding Legobot's tasks on that page, and operator Legoktm was asked to stop them so that the new code could be deployed. After no response to pings, a partial-block of Legobot for the dashboard was proposed. Some months later, on June 16, Headbomb said: "A full block serves nothing. A partial block solves all current issues [...] Just fucking do it. It's been 3 years now." The next day, however, Legoktm disabled the task, and the dashboard was successfully refactored.
On June 7, RexxS blocked Citation bot for disruptive editing, saying it was "still removing links after request to stop". A couple weeks later, a discussion on the bots noticeboard was opened, saying "it is a widely-used and useful bot, but it has one of the longest block logs for any recently-operating bot on Wikipedia". While its last BRFA approval was in 2011, its code and functionality had changed dramatically since then, and AntiCompositeNumber requested that BAG require a new BRFA. Maintainer AManWithNoPlan responded that most blocks were from years ago (when it lacked a proper test suite), and problems since then had mostly been one-off errors (like a June 2019 incident in which a LTA had "weaponized" the bot to harass editors).
David Tornheim opened a discussion about whether bots based on closed-source code should be permitted, and proposed that they not. He cited a recent case in which a maintainer had said "I can only suppose that the code that is available on GitHub is not the actual code that was running on [the bot]". Some disagreed: Naypta said that "I like free software as much as the next person, and I strongly believe that bot operators should make their bot code public, but I don't think it should be that they must do so".
@Headbomb: It was because of an extension that adds potentially common non-ISO 4 variants of abbreviations: if there is an ISO-4 redirect with the substring 'Special' it replaced it with 'Spec.' without checking if it was a complete word, so 'Specialized' in Biochim. Biophys. Acta Specialized Sect. Enzymol. Subj. yielded 'Spec.ized' and the dotless variant.
However, for quite a few of these variants we thought they were non-ISO 4 because of that 'languages' column in the LTWA. Now that we've clarified that actually all rules apply regardless of the language (except when two rules conflict), I believe 'Special' and 'Specialized' should be abbreviated to 'Spec.' in ISO-4 abbreviations (because of the rule special- | spéc. | rus, lit, fre). Similarly: animal- should be anim., atmospher- should be atmos., contribu- should be contrib., and also royal- should be r. (just like real sociedad is r. soc.). As far as I remember, this is mostly fixed for abbrevs of article/infobox titles, but not for redirects like Biochim. Biophys. Acta Specialized Sect. Enzymol. Subj. where the full title never appears in an infobox nor article title.
I don't think disabling (unless it's just those words) is particularly required, I just wanted to check if I G6'd them again, the bot wouldn't recreate them. AFAIA, all the redlinks in that page could be bot-created. Headbomb {t · c · p · b}13:38, 24 April 2022 (UTC)[reply]
Just those words (special-, animal-, atmospher-, contribu-, royal-). Ok, I'll run the bot as before, with these words (just avoiding the subword bug like in 'Spec.ized'), assuming categories aren't that important on those. Those wrongly categorized as ISO-4 will stay unchanged, and the created red ones will just be tagged R-from-abbreviation as before (even though most of them will be correct ISO-4). Tokenzero (talk) 19:12, 24 April 2022 (UTC)[reply]
Hello! Voting in the 2022 Arbitration Committee elections is now open until 23:59 (UTC) on Monday, 12 December 2022. All eligible users are allowed to vote. Users with alternate accounts may only vote once.
The Arbitration Committee is the panel of editors responsible for conducting the Wikipedia arbitration process. It has the authority to impose binding solutions to disputes between editors, primarily for serious conduct disputes the community has been unable to resolve. This includes the authority to impose site bans, topic bans, editing restrictions, and other measures needed to maintain our editing environment. The arbitration policy describes the Committee's roles and responsibilities in greater detail.
@Headbomb Ok, thanks, I added "degli" to the shortwords list. I also updated the LTWA list now (they published a new list in 2021), but it doesn't seem to have changed anything substantial. Tokenzero (talk) 22:36, 2 September 2023 (UTC)[reply]
@Headbomb: Turns out the way the bot caches information about infoboxes and already computed abbreviations makes it see ones that have been deleted (which affects only reports, but probably all reports). I'll see if I can find some easy fix, for now I just reset the cache. Tokenzero (talk) 20:27, 5 August 2024 (UTC)[reply]
Hi, sorry about that. The bot will create one of Asterix & the Big Fight(movie), Asterix and the Big Fight(movie) only if the other already exists (and I've checked that it's consistent with the sequence of deletion logs). So removing both at the same time (like within a week or two) should resolve it. If this is a recurring issue, I could try to avoid creating pages that were ever deleted (which I should probably do anyway, but that might take a while before I find the time to implement that). Tokenzero (talk) 20:27, 28 August 2024 (UTC)[reply]
Hello! Voting in the 2024 Arbitration Committee elections is now open until 23:59 (UTC) on Monday, 2 December 2024. All eligible users are allowed to vote. Users with alternate accounts may only vote once.
The Arbitration Committee is the panel of editors responsible for conducting the Wikipedia arbitration process. It has the authority to impose binding solutions to disputes between editors, primarily for serious conduct disputes the community has been unable to resolve. This includes the authority to impose site bans, topic bans, editing restrictions, and other measures needed to maintain our editing environment. The arbitration policy describes the Committee's roles and responsibilities in greater detail.