Commons:Bots/Requests/SmartifyBot

From Wikimedia Commons, the free media repository
Jump to navigation Jump to search

SmartifyBot (talk · contribs)

Operator: Rob Lowe - Smartify (talk · contributions · Statistics · Recent activity · block log · User rights log · uploads · Global account information)

Bot's tasks for which permission is being sought: Supply Smartify artwork content to Wikipedia, in particular Yale Center for British Art

Automatic or manually assisted: Automatic (supervised)

Edit type (e.g. Continuous, daily, one time run): Quarterly

Maximum edit rate (e.g. edits per minute): 2

Bot flag requested: (Y/N): N (initially)

Programming language(s): Python (pywikibot)

SmartifyBot (talk) 18:44, 13 December 2020 (UTC)[reply]

Discussion
@Rob Lowe - Smartify: this being Wikimedia Commons and not Wikipedia, can you elaborate a bit more what your plans are and maybe a link to some source code? Multichill (talk) 20:16, 13 December 2020 (UTC)[reply]
Smartify is an iPhone/Android App that uses recognition technology to identify artworks. We typically work in collaboration with galleries and museums to make their collections available in the App for recognition/searching/tours etc. This project is a result of a collaboration with the Yale Center for British Art who have ~60,000 open access artworks (paintings/drawings/prints/sculptures) that they want to make available in Smartify. They have also asked that we make the images available in Wikimedia Commons and if possible keep them up-to-date. Some YCBA content was loaded by Google Arts & Culture in 2012, about 5,000 works I think, those will not be duplicated. I have just started work on the project, so there is a lot to do. Rob Lowe - Smartify 12:17, 14 December 2020 (UTC)[reply]
@Rob Lowe - Smartify: Can you please keep your own account and your bot account separate? You shouldn't be replying here using your bot account. It's a role account that should only be used for automated edits.
Sounds like a fun project! I worked on that collection for the paintings some time ago, see d:Wikidata:WikiProject sum of all paintings/Collection/Yale Center for British Art, but their website looks much better these days.
I see you're planning to use Pywikibot. At https://phabricator.wikimedia.org/diffusion/PWBC/browse/master/scripts/data_ingestion.py and https://github.com/multichill/toollabs/tree/master/bot/commons you might find some useful code. Pywikibot will just take care of the upload part, the hard part is doing the data mapping. You need to map the relevant fields in your source data to {{Artwork}} or, if you feel adventurous, do it in structured data.
The release doesn't mention an explicit license. We like explicit releases a lot. Would be nice if they could switch to {{Cc-zero}}, that's how other institution like the Metropolitan Museum of Art (https://www.metmuseum.org/about-the-met/policies-and-documents/open-access ), Cleveland museum of Art (https://www.clevelandart.org/open-access) and Smithsonian (https://www.si.edu/openaccess) are doing it too. Not sure if that's an option. Multichill (talk) 17:58, 14 December 2020 (UTC)[reply]
Christmas being over, it's back to work. I'll certainly be talking to Yale about the licence and I'll see if they are happy with {{Cc-zero}}. In the mean time I've uploaded this test file, any comments/criticisms/suggestions gratefully received. In addition can I just ask, is there a bot running that automatically adds parallel entries into Wikidata from {{Artwork}} items? Or do I need to add the record myself at the same time as uploading to Commons? Rob Lowe - Smartify 08:52, 15 January 2021 (UTC)[reply]
How is ensured that only files are uploaded as PD-Art|PD-old-100 for which this really applies? Are all files showing historical artwork? --Krd 08:29, 19 January 2021 (UTC)[reply]
@Rob Lowe - Smartify:  ? --Krd 07:34, 16 March 2021 (UTC)[reply]
@Krd: Yale have recently decided to use CC0 rather than PD-Art|PD-old-100, and only those works in their collection that qualify are going to be uploaded. Rob Lowe - Smartify (talk) 19:48, 17 March 2021 (UTC)[reply]
CC-0 is a release made by the copyright holder, while PD-Art|PD-old-100 is as public domain status, which are two different things. How is ensured that images tagged by them with CC-0 have the rights of the original creator expired? --Krd 08:46, 21 March 2021 (UTC)[reply]
@Rob Lowe - Smartify:  ? --Krd 10:26, 17 April 2021 (UTC)[reply]
@Krd: The works I will be uploading on behalf of Yale are all marked as Public Domain, like this one: https://collections.britishart.yale.edu/catalog/tms:73253. I spoke with the image rights person at Yale last week and they are fairly adamant they want to use CC-0 for these. I mentioned your concerns and she said there was much debate in the community about the correct assignation, but that things had swung towards the use of CC-0 for such works, even though the author of the work couldn't be contacted - having been dead for century. She was going to send me some comments/arguments supporting that view, but I've not had them yet. I'm open to your guidance. I can do any necessary checks on the artist and artwork dates. On another matter, I've decided to upload the works to Wikidata first and have a standard bot populate Commons. Except that the bot doesn't seem to run very frequently and there maybe a big backlog, which adding a further 40,000 items won't improve. I left a message on Multichill's Wikidata talk page, who I think runs the Wikidata->Commons bot, but not had a response. Do you have any advice? Perhaps I should create both a Wikidata and Commons entry and link them myself Rob Lowe - Smartify (talk) 17:35, 17 April 2021 (UTC).[reply]
As said before, it appears of no practical relevance if an image in at cc-0 or pd-old, but in theory there is a big difference, as you can release anything under cc-0 only if you are the copyright holder in the first place, which up to my understanding they are not. I personally strong oppose to accept this, different opinions welcome. --Krd 08:38, 23 April 2021 (UTC)[reply]
Hi @Krd: , I forwarded your comments to Yale and this is the formal response of their intellectual property manager.
"After examining the range of copyright tags appropriate to reproductions of cultural objects in Wikimedia Commons and the Commons’ own rights documentation, contributions to Wikimedia Commons from other museums through recent open content projects, and YCBA’s open access approach, the YCBA has determined that the application of CC0 to its Wikimedia Commons contribution is the most appropriate choice.
One resource supporting our determination is Creative Commons’ current recommendation that institutions providing access to public domain works of cultural heritage use CC0 (“For Faithful Reproductions of Public Domain Works use CC0,” 2015). The Creative Commons Public Domain Mark (associated with Wikimedia Commons’ PD-Art template and variants) may be used by *anyone* to identify material believed to be in the public domain, and can be applied by third parties with no relationship to the creation of the digital file/reproduction or to the underlying object pictured. CC0, on the other hand, is appropriate for application in cases where the creator of the digital file does not wish to enforce any rights of its own it may have in the reproduction. Further, Wikimedia Commons’ own documentation on when to use the PD-Art template recommends against the use of the PD-art template in these cases, as well as in the case of reproductions of 3D works, which are also part of the YCBA Wikimedia Commons contribution.
Note that Yale Center for British Art is contributing image files to Wikimedia Commons with the application of CC0 ONLY in cases only where BOTH 1. YCBA is the owner/creator of the reproduction AND 2. the copyright status of the underlying art object pictured in the image file is believed to be in the public domain.
It may also be useful to note that the Metropolitan Museum of Art’s recent, extensive Wikimedia Commons contribution utilized CC0, for example File:Whalers_MET_DP169567.jpg." Rob Lowe - Smartify (talk) 15:03, 24 April 2021 (UTC)[reply]
That sounds quite reasonable to me. I think this should be approved. Thank you! --Krd 10:05, 30 April 2021 (UTC)[reply]

Approved. --Krd 16:54, 13 May 2021 (UTC)[reply]