Commons:Batch uploading/US Army

From Wikimedia Commons, the free media repository
Jump to navigation Jump to search

US Army[edit]

The Fema request got me started. The US Army has a nice set of images at http://search.ahp.us.army.mil/search/images/?per=10&page=1&search= . Judging from the latest id it's around 50.000 images. The bot should probably consist of two parts

  1. Loop over the search pages and find the location of all images like http://www.army.mil/-images/2009/10/14/53021/ . All pages seem to be in the form http://www.army.mil/-images/YYYY/MM/DD/photo_id/
  2. Work on all these images

Shouldn't be to hard with some regular expressions for the first part and screen scraping with beautifulsoup for the second part. Multichill (talk) 22:07, 14 October 2009 (UTC)[reply]


I wrote a bot for this (source). It basicly works the same as the other USgov bots. The main difference is that I'm unable to extract category information. The title is based on the title field, and as a fallback, the description. The first images can be found in Category:Images from the US Army needing categories as of 23 October 2009. Multichill (talk) 14:01, 23 October 2009 (UTC)[reply]


No response so I slowly fired up the upload. Multichill (talk) 11:31, 25 October 2009 (UTC)[reply]

Opinions[edit]

Assigned to Progress Bot name
Multichill On hold (Commons is short on disk space). BotMultichillT