Commons:Bots/Requests/Cewbot 3

From Wikimedia Commons, the free media repository
Jump to navigation Jump to search

Cewbot 3 (talk · contribs)

Operator: Kanashimi (talk · contributions · Statistics · Recent activity · block log · User rights log · uploads · Global account information)

Bot's tasks for which permission is being sought: Upload pictures under {{PD}} from http://www.publicdomainpictures.net/ with tags.

Automatic or manually assisted: automatic

Edit type (e.g. Continuous, daily, one time run): Weekly

Maximum edit rate (e.g. edits per minute): 12 per minute

Bot flag requested: (Y/N): No

Programming language(s): JavaScript (CeJS)

Kanashimi (talk) 01:33, 28 February 2017 (UTC)[reply]

Discussion

I find there are some useful pictures on http://www.publicdomainpictures.net/. Please give me some advice, thank you. --Kanashimi (talk) 01:33, 28 February 2017 (UTC)[reply]

Before the test edits, I think I should check some points first:
  • Is the license properly? For there is a license declaration following the picture in the page of publicdomainpictures, I think yes.
  • Are the pictures deserved? As mentioned above, there are some useful pictures associated with wiki articles, so I think yes.
  • Are there other bots doing the same work? I have searched wiki commons and it seems no.
So it's my pleasure if i could accept some advice. --Kanashimi (talk) 06:48, 28 February 2017 (UTC)[reply]
Addressing the above points:
  • The license is linked to http://creativecommons.org/publicdomain/zero/1.0/ => {{Cc-0}}
  • Assuming by "deserved" you mean in scope, it's hard to tell whether these images are in scope. Do you plan to check each image manually before upload or batch transfer all the images? And do you have a few example images where the images will be used in wiki articles?
  • And where do all these images come from? How can we be confident that they are not license laundering?
In any case, a test run is worth a thousand words. We can always delete the images if something goes wrong --Zhuyifei1999 (talk) 07:12, 28 February 2017 (UTC)[reply]
You don't need a bot account in order to batch upload images. publicdomainpictures.net looks like a mirror of pixabay or freepic and is just an aggregator site, possibly a reinvention of same site I uploaded from 4 years ago, Category:Public-domain-image.com. They probably are not license laundered, but it will be hard to say whether the releases to this site are not recycled from somewhere else, and it is likely that many of the images are already on Commons. The difficulty will be if the EXIF data has changed, as we will then have many hard to detect duplicates. -- (talk) 09:55, 28 February 2017 (UTC)[reply]
  • You say "I find there are some useful pictures" – which of them would you be going to upload, and which are the criteria for choosing? Single / few images could, as said, be copied without approval, for "all" I'd prefer to have some discussion about the raised duplicates issue, and perhaps also about scope. --Krd 08:50, 3 March 2017 (UTC)[reply]
  • The images appear to be available at the source in low-res for free and in high-res for registered users only. Which versions will be transferred? Has there been any contact with the source site operator? --Krd 08:50, 3 March 2017 (UTC)[reply]
I planned to upload the low-resolution version. However, thanks for 's explanation, since I have no way to detect if the images are only different in EXIF or sizes, so it is hard, yes. Does commons have any API to do the function? Or we should do it ourselves? --Kanashimi (talk) 03:46, 6 March 2017 (UTC)[reply]
When we're at it already, why should we use the low-res version? I tend to say this is all in all not necessarily a reasonable job. --Krd 14:13, 11 March 2017 (UTC)[reply]
There is no system for 'thumbprinting' images, or calculating the SHA1 for an image without its EXIF or other metadata. There is general expectation that at some future point, there will be image thumbprinting, but it's equally likely that this may take as many years to get to as Commons being made redundant by a better open knowledge resource. As per Krd, if this is done, then the high resolution images should be harvested. -- (talk) 09:02, 15 March 2017 (UTC)[reply]
Still, thanks for Fæ's advice. Well... Since I can not detect duplicate images with different information only, I should  withdrawn the task. --Kanashimi (talk) 10:16, 16 March 2017 (UTC)[reply]

Closing as withdrawn. Please feel free to reopen at any time. --Krd 07:52, 18 March 2017 (UTC)[reply]