Wikipedia:AutoWikiBrowser/Tasks
- Home
Introduction and rules - User manual
How to use AWB - Discussion
Discuss AWB, report errors, and request features - User tasks
Request or help with AWB-able tasks - Technical
Technical documentation
|
|||||||||||||||||
This page has archives. Sections older than 30 days may be automatically archived by Lowercase sigmabot III when more than 3 sections are present. |
This page is for tasks that involve changing the same code in multiple articles. This is a great fit for editors with AutoWikiBrowser permissions.
Please note that Wikipedia:Bot requests sometimes has one-time tasks which can be done easily using AWB. For modifying URLs see WP:URLREQ.
See also Wikipedia:WikiProject Check Wikipedia.
False last and first names in references[edit]
Could someone run AWB to remove the name "Bureau, US Census" from "cite web" references? I see it as "last1=Bureau" and "first1=US Census", or maybe "last=Bureau" and "first=US Census". Kk.urban (talk) 01:20, 9 March 2024 (UTC)
- Working on some REGEX for this, will post here because page list looks huge. Geardona (talk to me?) 01:34, 9 March 2024 (UTC)
- REGEX (replace with nothing) [lL]ast=Bureau|[fF]irst=US Census|[lL]ast\d=Bureau|[fF]irst\d=US Census Geardona (talk to me?) 01:39, 9 March 2024 (UTC)
- Support Sounds fun! Chaotıċ Enby (talk · contribs) 01:50, 9 March 2024 (UTC)
- How big is huge, Geardona? If it's >500 pages it might be worth a bot. (please do not ping on reply) Primefac (talk) 12:26, 11 March 2024 (UTC)
- Could be about 7,000 - there are some false positives. -- GreenC 14:56, 11 March 2024 (UTC)
- @GreenC: Including an insource search for
|first[0-9]?=
shrinks the results to less than 4,500. GoingBatty (talk) 14:17, 12 March 2024 (UTC)
- @GreenC: Including an insource search for
- Could be about 7,000 - there are some false positives. -- GreenC 14:56, 11 March 2024 (UTC)
- How big is huge, Geardona? If it's >500 pages it might be worth a bot. (please do not ping on reply) Primefac (talk) 12:26, 11 March 2024 (UTC)
- Support Sounds fun! Chaotıċ Enby (talk · contribs) 01:50, 9 March 2024 (UTC)
- I think there is some process already in place for removing "Writer, Staff", "Correspondent, Own" and other proforma names. Sadly, I can't recall exactly who or what does this and how often but, if someone can track them down, we may be able to add the prolific Mr. Bureau to an existing list of unwelcome authors. Certes (talk) 16:57, 9 March 2024 (UTC)
- @GoingBatty and BattyBot: I think. Neils51 (talk) 05:05, 12 March 2024 (UTC)
- @Certes: Another option could be to request that "Bureau" be added to the list of unwelcome authors in Category:CS1 errors: generic name. GoingBatty (talk) 14:22, 12 March 2024 (UTC)
- The
|website=
,|work=
,|publisher=
fields need normalization, but that is a very big job for census.gov probably 100s of thousands of pages. They should probably be set to United States Census Bureau. It might need to be done incidentally, with other work, to avoid lighting up watchlists too fast eg. every small town in America. -- GreenC 15:14, 11 March 2024 (UTC)- @GreenC: Maybe you could ask Ohconfucius to include this in their Fix SOURCES script. GoingBatty (talk) 14:11, 12 March 2024 (UTC)
- I was looking in JWB and it maxed out at 1500. Geardona (talk to me?) 23:31, 11 March 2024 (UTC)
- @Kk.urban: I think a bot should only remove the name "Bureau, US Census" if some form of "US Census Bureau" is included in one of the other parameters. What do you think? GoingBatty (talk) 14:10, 12 March 2024 (UTC)
- @Kk.urban and Geardona: - I thought Gerdona's "Working.." icon above meant they were doing to do this, but maybe they were just creating a regex to find the pages. In which case, it doesn't look like anyone has volunteered to do this. -- GreenC 01:22, 21 March 2024 (UTC)
- I am currently working on it, but have been stuck on mobile for a few days. I can keep going in a little bit. I also paused for the bot discussion. Geardona (talk to me?) 01:28, 21 March 2024 (UTC)
- Thanks. -- GreenC 01:31, 21 March 2024 (UTC)
- @Geardona: Did you really mean to remove 100k of text from Kentucky? I tried to restore it, but I can't because of certain sources appear to be censored. @Kk.urban, GreenC, GoingBatty, and Certes: --David Tornheim (talk) 05:30, 21 March 2024 (UTC)
- Im well aware of the issue, I don't think theres anything I can do, its all reverted now (thanks discord people) but I have diff max size set to 150, so Im not sure if this is possible. Geardona (talk to me?) 05:34, 21 March 2024 (UTC)
- Glad it's fixed. I hated to tag so many editors over this, but I wanted to be sure it got corrected ASAP, since I couldn't do it. --David Tornheim (talk) 05:36, 21 March 2024 (UTC)
- Im well aware of the issue, I don't think theres anything I can do, its all reverted now (thanks discord people) but I have diff max size set to 150, so Im not sure if this is possible. Geardona (talk to me?) 05:34, 21 March 2024 (UTC)
- @Geardona: Did you really mean to remove 100k of text from Kentucky? I tried to restore it, but I can't because of certain sources appear to be censored. @Kk.urban, GreenC, GoingBatty, and Certes: --David Tornheim (talk) 05:30, 21 March 2024 (UTC)
- Thanks. -- GreenC 01:31, 21 March 2024 (UTC)
Could AWB go through this list?[edit]
Wikipedia:Database reports/Long stubs OrdinaryGiraffe (talk) 01:35, 16 March 2024 (UTC)
Reasons: cricket is getting nearly 150 hits, almost all of which I had stub applied in a boilerplate way long ago.Same for 20, which turns up 20xx years. They are getting almost 500 hits.OrdinaryGiraffe (talk) 01:42, 16 March 2024 (UTC)Note: Category:Cricket season stubs only has 300 articles, and Category:Cricket competitions only 150, so this is pretty significant. Alternatively, I could ask the WikiProject to look into this.Never mind, the list got updated, but it would still be nice for AWB to deal with it. OrdinaryGiraffe (talk) 01:49, 16 March 2024 (UTC)
- What are you looking for us to do; get rid of the stub templates? Geardona (talk to me?) 06:22, 16 March 2024 (UTC)
- As long as it's more than the AWB soft limit of 500 words exculding templates and tables, yes. OrdinaryGiraffe (talk) 15:52, 16 March 2024 (UTC)
- So remove stub templates from those articles, consider me Working on it. Geardona (talk to me?) 17:31, 16 March 2024 (UTC)
- P.S. Is their a policy discussion about rating articles like this one?
Less than 20around 40 words, but 101KB size. OrdinaryGiraffe (talk) 18:04, 16 March 2024 (UTC)
- P.S. Is their a policy discussion about rating articles like this one?
- I would say that is list class. Geardona (talk to me?) 18:16, 16 March 2024 (UTC)
- Note that this is still being worked on. Geardona (talk to me?) 06:21, 21 March 2024 (UTC)
- Could AWB fix the list class (but tagged stub class in Talk) less than 500 words ones as well? Or would that require a bot? OrdinaryGiraffe (talk) 20:38, 23 March 2024 (UTC)
- I’m not sure that’s possible in any way but manual, I can try to spin up some regex. But it really does not sound possible. As for the bot question, I don’t think that’s possible either. Geardona (talk to me?) 20:47, 23 March 2024 (UTC)
- Could AWB fix the list class (but tagged stub class in Talk) less than 500 words ones as well? Or would that require a bot? OrdinaryGiraffe (talk) 20:38, 23 March 2024 (UTC)
Could someone replace all instances of "GRAMMY" or "GRAMMY®" in visible text with "Grammy"? Mach61 19:26, 18 March 2024 (UTC)
- Searching for "GRAMMY" returns 4,964 results before timing out. That probably needs a bot to fix, if it can be accurate enough (unsure of that). Working on "GRAMMY®". Snowmanonahoe (talk · contribs · typos) 19:33, 18 March 2024 (UTC)
- Some may be accurate quotes, e.g. {{Cite whatever|title=Smith wins GRAMMY®}} or [https://example.com Smith wins GRAMMY®]. See also Emmy® and Award® (e.g. Academy Award®, but also many more obscure gongs). I just fixed a dozen which were similar but not quite regular enough for the proposed task/bot to catch (spaces before ®, ™ or © instead, etc.) I was surprised not to find many more; someone seems to have their eye on this ball. Certes (talk) 20:04, 18 March 2024 (UTC)
- Hrm, that might put it into CONTEXTBOT territory. Primefac (talk) 20:20, 18 March 2024 (UTC)
- This search is almost useful but fails to find articles which use GRAMMY® both in a legitimate quote and in running prose, where just the latter would require attention. It still includes false positives, so is fodder for AWB rather than a bot. Certes (talk) 20:31, 18 March 2024 (UTC)
- I'm pretty sure we still reduce it to title case and remove the trademark symbol when it's in a title. Snowmanonahoe (talk · contribs · typos) 20:33, 18 March 2024 (UTC)
- MOS:TM says
Do not use the ™ and ® symbols, or similar, in either article text or citations
. I'm unsure exactly what that means but it may well cover title= and similar uses, in which case there are a lot to fix (as the original poster implies). Certes (talk) 20:47, 18 March 2024 (UTC)- It means remove those symbols unless the sentence is specifically explaining the branding, like "Grammy is often branded as 'GRAMMY®', including the registered trademark symbol." We remove the symbol and the excessive capitalization otherwise, unless removing the symbol would break something technical; I'm seeing it in one dead URL, but I can't tell if it is valid. – Jonesey95 (talk) 00:04, 19 March 2024 (UTC)
- MOS:TM says
- Hrm, that might put it into CONTEXTBOT territory. Primefac (talk) 20:20, 18 March 2024 (UTC)
Remove external-links entry for druginfo (dead site)[edit]
Per Wikipedia talk:WikiProject Medicine#druginfo.nlm.nih.gov is dead (and antecedent discussions linked therein), 'druginfo.nlm.nih.gov' is permanently dead, but is listed in (quick guestimate) several hundred External Links sections. The task is to remove those entries altogether from pages where the infobox already has a link to the replacement site. JWB is flaky for me and I can't use AWB, hence this request. The specific edit is to delete lines of this form:
* {{cite web | url = https://druginfo.nlm.nih.gov/drugportal/name/cocaine | publisher = U.S. National Library of Medicine | work = Drug Information Portal | title = Cocaine }}
where the underlined strings could vary, in pages that contain the regexp:
/\|\s*PubChem\s*=\s*\d/
that is, a |PubChem=
template field that has a numerical value. DMacks (talk) 21:21, 18 March 2024 (UTC)
- @DMacks: wait, if it's external links sections that are the problem, why are we deleting citation templates? Snowmanonahoe (talk · contribs · typos) 00:37, 19 March 2024 (UTC)
- It indeed was noted in the discussion I linked that they should not have been formatted as citations in the first place. The items to remove are bullet-list, not footnote or any sort of named ref. Once these are gone, we plan to manually deal with the cases that are actually used as refs. DMacks (talk) 02:33, 19 March 2024 (UTC)
- I've made a regex, and that's all I have time for tonight.
\n\* ?\{\{[cC]ite web *\| *url *= *https?://druginfo.nlm.nih.gov/drugportal/name/(.+) *\| *publisher *= *U\.S\. National Library of Medicine *\| *work *= *Drug Information Portal *\| *title *= * (.+) *\}\}
If someone decides to finish this for me:- Nest that in adv. settings under the pubchem regex. Remove the forward slashes at the beginning and end.
- Generate the list with
insource:https://druginfo.nlm.nih.gov/drugportal/name/
- I'll have this done sometime tomorrow night probably. Snowmanonahoe (talk · contribs · typos) 04:12, 19 March 2024 (UTC)
- Done Snowmanonahoe (talk · contribs · typos) 20:04, 20 March 2024 (UTC)
- Thanks for working on that! Any idea why it didn't pick up Phencyclidine? DMacks (talk) 10:04, 23 March 2024 (UTC)
- It doesn't have the pubchem parameter filled in the infobox. Snowmanonahoe (talk · contribs · typos) 15:38, 23 March 2024 (UTC)
- Here is around line 50:
- It doesn't have the pubchem parameter filled in the infobox. Snowmanonahoe (talk · contribs · typos) 15:38, 23 March 2024 (UTC)
- Thanks for working on that! Any idea why it didn't pick up Phencyclidine? DMacks (talk) 10:04, 23 March 2024 (UTC)
- Done Snowmanonahoe (talk · contribs · typos) 20:04, 20 March 2024 (UTC)
| ATC_prefix = None | PubChem = 6468 | DrugBank_Ref = {{drugbankcite|correct|drugbank}}
Removing category name containing mandal for petscan query result[edit]
The articles that are the result of petscan query are articles of villages, towns, or cities that should not be part of categories of mandals. Hence those categories need to be removed. As an example, for Pulivendula, Category:Mandals in Kadapa district should be removed and for P._Gannavaram, Category:Mandals in Konaseema district and Category:Mandals in Andhra Pradesh should be removed. Arjunaraoc (talk) 12:38, 22 March 2024 (UTC)
- @Arjunaraoc, are the articles in this query not about mandals? I'm asking because when I searched for Pulivendula on Google, I found many results mentioning Pulivendula Mandal. – DreamRimmer (talk) 12:57, 22 March 2024 (UTC)
- @DreamRimmer, There are mandals with same name, but separate mandal articles need to be created. So mandal categories are to be removed from these. Arjunaraoc (talk) 13:08, 22 March 2024 (UTC)
- That makes sense. – DreamRimmer (talk) 13:09, 22 March 2024 (UTC)
- @DreamRimmer, I found my bot account ( Arjunaraocbot ) originally used on tewiki working on enwiki. Hence I updated the articles myself. Arjunaraoc (talk) 01:41, 23 March 2024 (UTC)
- @Arjunaraoc, please don't run unapproved bot. You must first get it approved to run on the English Wikipedia. This type of small task can be done with your main account using AWB. – DreamRimmer (talk) 01:54, 23 March 2024 (UTC)
- @DreamRimmer As I use Linux as my desktop, I can't run AWB. Now I will file request for bot. Arjunaraoc (talk) 03:49, 23 March 2024 (UTC)
- @Arjunaraoc: You may be interested in reading Wikipedia:AutoWikiBrowser/Mono and Wine to see how to run AWB on Linux. GoingBatty (talk) 04:39, 23 March 2024 (UTC)
- @DreamRimmer, My bot request for info.--Arjunaraoc (talk) 04:17, 23 March 2024 (UTC)
- @DreamRimmer As I use Linux as my desktop, I can't run AWB. Now I will file request for bot. Arjunaraoc (talk) 03:49, 23 March 2024 (UTC)
- @Arjunaraoc, please don't run unapproved bot. You must first get it approved to run on the English Wikipedia. This type of small task can be done with your main account using AWB. – DreamRimmer (talk) 01:54, 23 March 2024 (UTC)
- @DreamRimmer, I found my bot account ( Arjunaraocbot ) originally used on tewiki working on enwiki. Hence I updated the articles myself. Arjunaraoc (talk) 01:41, 23 March 2024 (UTC)
- That makes sense. – DreamRimmer (talk) 13:09, 22 March 2024 (UTC)
- @DreamRimmer, There are mandals with same name, but separate mandal articles need to be created. So mandal categories are to be removed from these. Arjunaraoc (talk) 13:08, 22 March 2024 (UTC)
Events at the European Track Championships[edit]
Please could somebody tag and list Category:Events at the European Track Championships and its 25 subcats for speedy renaming to "UEC European Track Championships" under C2D? – Fayenatic London 11:57, 26 March 2024 (UTC)
- @Fayenatic london, I'm happy to assist with tagging, but I'm not quite sure how to go about listing them at WP:CFDS. Would you be able to take care of that part? – DreamRimmer (talk) 12:12, 26 March 2024 (UTC)
- Thank you – yes, I'll be happy to do that. I'd paste the current names into Excel and use text formulas to create the nomination lines. – Fayenatic London 12:15, 26 March 2024 (UTC)
- Tagging – DreamRimmer (talk) 12:29, 26 March 2024 (UTC)
- @Fayenatic london, done – DreamRimmer (talk) 12:50, 26 March 2024 (UTC)
- Tagging – DreamRimmer (talk) 12:29, 26 March 2024 (UTC)
- Thank you – yes, I'll be happy to do that. I'd paste the current names into Excel and use text formulas to create the nomination lines. – Fayenatic London 12:15, 26 March 2024 (UTC)