Commons:Requests and votes/User:Dvortybot

From Wikimedia Commons, the free media repository
Jump to navigation Jump to search

User:Dvortybot

Comments

Hi, I've thrown together a script thinggy to take a zip file from User:Dvortygirl and convert her .wav files to .ogg, then upload them here, with her "standard" text, for her. She plans to send me batches of about 100 files at a time. Special:Contributions/Dvortybot shows the test run. Let me know what throttling I should add between the python calls, and I'll happily add that back in. TIA! --Connel MacKenzie 07:47, 5 February 2007 (UTC)[reply]

I believe all the relevant information was already in my descriptions, but here is what I think the Information template would look like, if it would make everybody happier.
Pronunciation of the term in US English, recorded by [[User:Dvortygirl|Dvortygirl]], 
{{Information
|Description= Audio pronunciation of the term in United States English.  [[Category:English pronunciation|word]]
|Source= Self
|Date= {{subst:CURRENTDAY}} {{subst:CURRENTMONTHNAME}} {{subst:CURRENTYEAR}}
|Author= [[User:Dvortygirl|Dvortygirl]]
|Permission= {{self2|GFDL|cc-by-sa-2.5,2.0,1.0}}
|other_versions= (optional variable, can be left out)
}}
Yes, I had previously been using Audacity, and may yet do so for one-offs, such as requests or words of the day. I'm not sure that makes a whole lot of difference, but if for some reason we need to say "Source = Self (using Shtooka audio)" or some such, please suggest that. With 4000-plus words already done and probably 70,000 in the database, ready to go, I am eager to automate what parts of the process I can.
If anybody would like to divide audio pronunciations up by regions, I'm certainly open to having a different category. As far as I'm concerned, that's just template text. U.S. English varies, too, so we should think carefully about what to call that category, if we make one. That said, I think I'm generally in the vicinity of "GenAm", or General American. (As a rule, I don't do pronunciations for words that are clearly outside the American dialect. You will not, for instance, find U.S. audio for honour or bagsie.) All my files are already labeled En-us-word.ogg (for English, United States), and I have encouraged other interested Wiktionarians, at least, to follow suit with this Language-region-word naming convention. Our audio template in Wiktionary already has a field for region, too.
As a technical note, can templates nest, as I have done with the licensing, or should the GFDL/cc-by template go below with the licensing reading "see below", instead? Dvortygirl 05:23, 7 February 2007 (UTC)[reply]
I've seen it done both ways but I think we prefer if the template is below in the licensing section, it makes it marginally easier for bots to find. But it's not the same as if you embedded the license INTO a template rather than passed it in as a PARAMETER as is done here... the embedding into templates is something we are not so keen on (see recent VP discussions) That template looks good, what I would say would be an improvement is on source= say a bit more than "self"... some pointers to how it was done might be helpful. Since you are originating the content, you don't have to give provenance like you do for something you got somewhere else (like a PD-old pic). Looking good though. ++Lar: t/c 16:19, 7 February 2007 (UTC)[reply]
Lar, I'm sorry, but I'm quite a newbie when it comes to licensing format preferences on commons. Can you please repeat the example here, in the format you want, for me? --Connel MacKenzie 22:39, 7 February 2007 (UTC)[reply]
Well, I'm biased, but I quite like how this one came out: Image:M29 Weasel Arctic USArmyTransMuseum.jpg ... go into edit mode and you'll see the information box filled out. Of course that's an image not a sound. Here's another one: Image:Star in the east solfege.ogg by Makemi which shows the license at bottom as well. In both cases the permissions section describes verbally but the license itself is in ==Licensing== ... hope that helps rather than confuses. I'd err on the side of too much information. ++Lar: t/c 23:43, 7 February 2007 (UTC)[reply]
OK, all better now? --Connel MacKenzie 07:09, 11 February 2007 (UTC)[reply]

Comments