User talk:FlickypediaBackfillrBot

From Wikimedia Commons, the free media repository
Jump to navigation Jump to search

Bot request[edit]

See Commons:Bots/Requests/FlickypediaBackfillrBot – I posted an initial draft of the proposal here for review by a few colleagues at the Flickr Foundation and Wikimedia Foundation, before posting it for community review.

Photos from Flickr which don’t have any SDC[edit]

As part of my work analysing the Wikimedia Commons snapshots, I've found ~950k files on Wikimedia Commons which:

  • Have a Flickr URL somewhere in their Wikitext
  • Don't have any Flickr information in their SDC

It would be beneficial to add SDC to these photos, but that's out-of-scope for FlickypediaBackfillrBot (at least for now).

It's comparatively difficult to tell the difference between, say:

  • This photo is from Flickr.com: {flickr.com/…} where we would want Flickr info in the SDC, and
  • For other photos of this same building, see {flickr.com/…}, where we wouldn’t want Flickr info in the SDC

I’d like to get to this at some point, but it’s out-of-scope for the initial version – it’ll need more manual intervention and care, for a comparatively small selection of photos. Alexwlchan (talk) 10:39, 2 November 2023 (UTC)[reply]

Blocked[edit]

Blocked Indefinitely
Blocked Indefinitely
You have been blocked indefinitely from editing Commons. If you believe this block is unjustified, you may add {{Unblock}} below this message explaining clearly why you should be unblocked. For more information, see Appealing a block.
See the block log for the reason that you have been blocked and the name of the administrator who blocked you.

azərbaycanca  català  čeština  Deutsch  English  español  français  hrvatski  Bahasa Indonesia  italiano  kurdî  la .lojban.  magyar  Nederlands  Plattdüütsch  polski  português  português do Brasil  sicilianu  suomi  svenska  Türkçe  Tiếng Việt  Zazaki  македонски  русский  українська  हिन्दी  বাংলা  മലയാളം  ไทย  မြန်မာဘာသာ  한국어  日本語  中文(简体)‎  中文(繁體)‎  עברית  العربية  فارسی  +/−

Please request the bot flag. Thanks, --Yann (talk) 16:21, 18 December 2023 (UTC)[reply]

Hi Yann – I did [ request the bot flag (Commons:Bots/Requests/FlickypediaBackfillrBot), and I had a conversation from Schlurcher about it after my test edits. I thought everything was a-okay to go, but it sounds like I was meant to wait for something else to happen first?
Sorry if I was a bit quick off the mark! Alexwlchan (talk) 16:23, 18 December 2023 (UTC)[reply]
Unblock request granted

This blocked user asked to be unblocked, and one or more administrators has reviewed and granted this request.

Request reason: "Flickypedia Backfillr Bot approved. See Commons:Bots/Requests/FlickypediaBackfillrBot – this was approved last week. Sorry, I got confused about how the approval process works, and thought it was approved before it was! Alexwlchan (talk) 12:12, 11 March 2024 (UTC)"[reply]
Unblock reason: "Unblocked by Krd on 6 March 2024. Yann (talk) 13:00, 11 March 2024 (UTC)"[reply]
This template should be archived normally.
(Block log)
(unblock)
(Change local status for a global block)
(contribs)

čeština  Deutsch  English  español  français  hrvatski  magyar  Plattdüütsch  português  suomi  हिन्दी  македонски  русский  slovenščina  Tiếng Việt  中文(简体)  中文(繁體)  中文(臺灣)  +/−

Notes from running the bot in practice[edit]

Now the bot is approved, I'm starting to run it in larger and larger numbers.

I thought it would be useful to note a couple of areas where it gets “confused” – i.e. where it can’t work out how to update the SDC, so it does nothing and just flags a warning for inspection.

  • creator (P170) when there’s a non-Flickr user in there. e.g. there are files where the Flickr user is also a WMC user, they uploaded their own photo to WMC, and they put their Wikimedia username in the creator field. The bot only expects to see a Flickr user statement on Flickr photos, so it gets confused.
    It might be useful to add the Flickr username as an additional statement.
  • creator (P170) when the Flickr user ID (P3267) field contains the Flickr "path alias" instead of the NSID.
    e.g. if the https://www.flickr.com/people/nasacommons account has the P3267 nasacommons instead of 44494372@N05.
    The P3267 field is already used quite inconsistently across Wikidata/WMC, and there’s an opportunity for more cleanup there.
  • inception (P571) when the precision on the "date taken" hasn’t been mapped correctly from Flickr.
    e.g. the date taken on Flickr is 2024 but it’s been mapped as 1 January 2024.
    It would be useful to fix these, but right now I don't know if this is widespread enough to be worth automating, or whether they can be fixed with manual edits.

Plus a certain amount of link rot on the Flickr side – photos and photographers whose pages no longer exist (or are private), so the bot can't get a full set of metadata. In this case it adds a ‎Flickr photo ID (P12120) statement and nothing else. Alexwlchan (talk) 21:12, 12 March 2024 (UTC)[reply]

BOT flag[edit]

Appears that the BOT flag has not been set as the edits are not been marked as BOT edits in the watchlist, thus the "ignore BOT" option is not excluding the edits from the watchlist. Keith D (talk) 17:49, 16 March 2024 (UTC)[reply]

I just ran into this; my watchlist is impossible. mr.choppers (talk)-en- 00:49, 17 March 2024 (UTC)[reply]
Hey both, thanks for the feedback, sorry for the disruption!
I'm a bit busy this week, so I've paused the bot until I can look at this properly. Alexwlchan (talk) 14:14, 18 March 2024 (UTC)[reply]
Hi both, thanks for the feedback!
I thought I was sending the API parameter to mark these edits as coming from a bot, but it doesn’t seem to be working correctly – I’ve popped a question on the village pump to find out what I need to change. Alexwlchan (talk) 11:56, 10 April 2024 (UTC)[reply]
Okay, I believe I’ve got this sorted and the bot is running again, setting the flag correctly. Thanks for letting me know! Alexwlchan (talk) 06:35, 9 May 2024 (UTC)[reply]