Commons:Bots/Requests/PronunBot

From Wikimedia Commons, the free media repository
Jump to navigation Jump to search

PronunBot (talk · contribs)

Operator: Sascha (talk · contributions · Statistics · Recent activity · block log · User rights log · uploads · Global account information)

Bot's tasks for which permission is being sought: Upload about 10000 audio files in FLAC format to Wikimedia commons. The files are spoken pronunciations in Sursilvan Romansh (IETF BCP47 language code: rm-sursilv). The audio has been recorded in March 2007 by Lia Rumantscha, a Swiss non-profit organization for promoting the Romansh language, as language training material; the speaker was Erwin Ardüser. In November 2018, Lia Rumantscha has released these pronunciation files under the Creative Commons Zero license. See https://github.com/brawer/PronunBot/blob/master/README.md for background and source code.

Automatic or manually assisted: automatic upload, but each uploaded audio file has been manually/aurally vetted in a separate quality assurance process.

Edit type (e.g. Continuous, daily, one time run): one time

Maximum edit rate (e.g. edits per minute): 6 edits per minute

Bot flag requested: (Y/N): Yes

Programming language(s): Python, pywikbot

Source code: https://github.com/brawer/PronunBot/blob/master/upload_to_commons.py

Sample edits:

Sascha (talk) 15:24, 29 November 2018 (UTC)[reply]

Discussion

  • I'd prefer to have the direct link to the file source in the source field, and some evidence that the file is CC-0 at the source. --Krd 09:56, 1 December 2018 (UTC)[reply]
    • The files haven’t been downloaded from the internet. Rather, the owner (Lia Rumantscha, a Swiss non-profit) has given me the sound files in person, asking to upload the pronunciations to Wikimedia Commons so they would become available online. To address your concern, I’ve now asked the source to sign a letter where they declare their ownership of the files, and where they put in writing that they’re releasing the recordings under the CC-0 public domain dedication. Once I’ve received the letter, I’ll put a scan (as PDF file) on Wikimedia Commons; and I’ll change the bot so it links to that letter from the description file of every uploaded pronunciation file. I’ll update this dicussion thread once that is done. — Sascha (talk) 13:44, 3 December 2018 (UTC)[reply]
      If you get a confirmation letter, this should be done via OTRS. It's also ok for me as it currently is, after having had a more detailed look. --Krd 14:08, 3 December 2018 (UTC)[reply]
 $ metaflac --list split/Rm-sursilv-calcogn.flac 
 METADATA block #1
   type: 4 (VORBIS_COMMENT)
   comments: 8
     comment[0]: TITLE=calcogn
     comment[1]: COPYRIGHT=2007 Lia Rumantscha
     comment[2]: PERFORMER=Erwin Ardüser
     comment[3]: LANGUAGE=rm-sursilv
     comment[4]: ORGANIZATION=Lia Rumantscha / Conradin Klaiss, 7001 Chur, Switzerland
     comment[5]: DATE=2007-03-09
     comment[6]: LICENSE=Creative Commons Zero v1.0 Universal
     comment[7]: encoder=Lavf57.83.100

Approved. --Krd 07:58, 11 December 2018 (UTC)[reply]