Commons:Bots/Requests/PronunBot
Operator: Sascha (talk · contributions · Statistics · Recent activity · block log · User rights log · uploads · Global account information)
Bot's tasks for which permission is being sought: Upload about 10000 audio files in FLAC format to Wikimedia commons. The files are spoken pronunciations in Sursilvan Romansh (IETF BCP47 language code: rm-sursilv). The audio has been recorded in March 2007 by Lia Rumantscha, a Swiss non-profit organization for promoting the Romansh language, as language training material; the speaker was Erwin Ardüser. In November 2018, Lia Rumantscha has released these pronunciation files under the Creative Commons Zero license. See https://github.com/brawer/PronunBot/blob/master/README.md for background and source code.
Automatic or manually assisted: automatic upload, but each uploaded audio file has been manually/aurally vetted in a separate quality assurance process.
Edit type (e.g. Continuous, daily, one time run): one time
Maximum edit rate (e.g. edits per minute): 6 edits per minute
Bot flag requested: (Y/N): Yes
Programming language(s): Python, pywikbot
Source code: https://github.com/brawer/PronunBot/blob/master/upload_to_commons.py
Sample edits:
- https://commons.wikimedia.org/w/index.php?title=File:Pronunciation_rm-sursilv_jeu_carezel_tei.flac&oldid=329778846
- https://commons.wikimedia.org/w/index.php?title=File:Pronunciation_rm-sursilv_calcogn.flac&oldid=329778831
- https://commons.wikimedia.org/w/index.php?title=File:Pronunciation_rm-sursilv_Gr%C3%B6nlanda.flac&oldid=329778632
Sascha (talk) 15:24, 29 November 2018 (UTC)
Discussion
- Please follow language code-text format for file names. See Category:French pronunciation as example. --EugeneZelenko (talk) 15:03, 30 November 2018 (UTC)
- Done. I’ve also improved the change summaries for the bot uploads. Sample changes: File:Rm-sursilv-gnanc_diesch_minutas.flac and File:Rm-sursilv-calcogn.flac. — Sascha (talk) 09:22, 1 December 2018 (UTC)
- I'd prefer to have the direct link to the file source in the source field, and some evidence that the file is CC-0 at the source. --Krd 09:56, 1 December 2018 (UTC)
- The files haven’t been downloaded from the internet. Rather, the owner (Lia Rumantscha, a Swiss non-profit) has given me the sound files in person, asking to upload the pronunciations to Wikimedia Commons so they would become available online. To address your concern, I’ve now asked the source to sign a letter where they declare their ownership of the files, and where they put in writing that they’re releasing the recordings under the CC-0 public domain dedication. Once I’ve received the letter, I’ll put a scan (as PDF file) on Wikimedia Commons; and I’ll change the bot so it links to that letter from the description file of every uploaded pronunciation file. I’ll update this dicussion thread once that is done. — Sascha (talk) 13:44, 3 December 2018 (UTC)
- If you get a confirmation letter, this should be done via OTRS. It's also ok for me as it currently is, after having had a more detailed look. --Krd 14:08, 3 December 2018 (UTC)
- The files haven’t been downloaded from the internet. Rather, the owner (Lia Rumantscha, a Swiss non-profit) has given me the sound files in person, asking to upload the pronunciations to Wikimedia Commons so they would become available online. To address your concern, I’ve now asked the source to sign a letter where they declare their ownership of the files, and where they put in writing that they’re releasing the recordings under the CC-0 public domain dedication. Once I’ve received the letter, I’ll put a scan (as PDF file) on Wikimedia Commons; and I’ll change the bot so it links to that letter from the description file of every uploaded pronunciation file. I’ll update this dicussion thread once that is done. — Sascha (talk) 13:44, 3 December 2018 (UTC)
- The files contain the following metadata. Just mentioning this because the Mediawiki software doesn’t seem to display metadata for FLAC files, yet. — Sascha (talk) 09:48, 2 December 2018 (UTC)
$ metaflac --list split/Rm-sursilv-calcogn.flac METADATA block #1 type: 4 (VORBIS_COMMENT) comments: 8 comment[0]: TITLE=calcogn comment[1]: COPYRIGHT=2007 Lia Rumantscha comment[2]: PERFORMER=Erwin Ardüser comment[3]: LANGUAGE=rm-sursilv comment[4]: ORGANIZATION=Lia Rumantscha / Conradin Klaiss, 7001 Chur, Switzerland comment[5]: DATE=2007-03-09 comment[6]: LICENSE=Creative Commons Zero v1.0 Universal comment[7]: encoder=Lavf57.83.100
Approved. --Krd 07:58, 11 December 2018 (UTC)