Commons:Batch uploading/Kurt Rasmussen

From Wikimedia Commons, the free media repository
Jump to navigation Jump to search

Kurt Rasmussen[edit]

  • Source to upload from: Kurt Rasmussen, bahnbilder.de
    • Did you observe an URL pattern
      • bahnbilder.de/xxxx/some-name.jpg where xxxx is a four-digit number
    • Do you know whether the site as an API
      • I think not.
    • What else can ease uploading (is the site valid XHTML, WCM they use…)?
      • Essentially the algorithm that needs to be done is:
        • for each page in the linked search results
          • for each div class="bildvorschau" in the search results
            • download the url given in the first a href=, use this URL as source in the final information template
            • in the now downloaded file, find div class="bildcontainer"
            • in it, from the p class="beschreibung", extract the description to be used in the final information template
            • from the img tag immediately following it, download the url in the src attribute
            • upload it to Commons
  • Describe the works to be uploaded in detail (audio files, images by …):
    • All images by Kurt Rasmussen.
  • Is there a template that could be used on the file description pages? Do you think a special template should be created?

I am also, parallelly, trying to coordinate a manual upload of this huge collection of extremely valuable photos. For that, see User:Darkweasel94/Rasmussen. darkweasel94 13:42, 13 December 2013 (UTC)[reply]

Opinions[edit]

Assigned to Progress Bot name Category
darkweasel94 finished coding, will upload in the next days will probably do this from my own user account Category:Files uploaded by darkweasel94 (cleanup) (also contains other stuff)