English subtitles for clip: File:OpenRefine Commons Extension - start project from categories.webm
Jump to navigation
Jump to search
1 00:00:07,355 --> 00:00:11,935 I want to start a project in OpenRefine based on a Wikimedia Commons category 2 00:00:13,187 --> 00:00:15,997 I have installed the Wikimedia Commons extension in OpenRefine 3 00:00:15,997 --> 00:00:20,569 So I am getting a Wikimedia Commons option here in the start screen 4 00:00:21,329 --> 00:00:27,000 So I click that, and then I get an option to type any category on Wikimedia Commons 5 00:00:28,000 --> 00:00:32,759 In this case I choose one for a heritage organization, an archive in the Netherlands 6 00:00:33,450 --> 00:00:41,214 And with each category you can also choose how deep you want to go in the category tree, so... 7 00:00:41,276 --> 00:00:46,153 ... if you want to go two levels deep, or more or less, you can change that here 8 00:00:46,633 --> 00:00:49,438 And, if you want, you can also add other categories 9 00:00:49,438 --> 00:00:52,908 So you can do multiple categories at the same time 10 00:00:52,908 --> 00:00:57,200 OpenRefine will retrieve all the files from all these categories 11 00:00:57,200 --> 00:00:59,200 So it is a combination. 12 00:00:59,530 --> 00:01:02,832 In this case I'm only interested in this one with no depth 13 00:01:02,832 --> 00:01:07,852 So just the category on the level that is indicated here 14 00:01:08,382 --> 00:01:11,758 I click 'Next', and then OpenRefine will give me a preview 15 00:01:12,854 --> 00:01:18,653 Then you see at the bottom that OpenRefine also allows me to give some options 16 00:01:18,653 --> 00:01:22,065 or to specify some options 17 00:01:22,065 --> 00:01:26,176 One of them is to include a column with categories already 18 00:01:26,176 --> 00:01:28,932 Categories can have interesting information 19 00:01:28,932 --> 00:01:31,140 So in this case I will select that. 20 00:01:31,140 --> 00:01:34,624 And if I would be interested in that, I can also include a column 21 00:01:34,624 --> 00:01:39,265 With the M-ids, or the unique identifiers of the files 22 00:01:39,265 --> 00:01:42,404 In this case I only want the category column. 23 00:01:42,404 --> 00:01:45,437 I give my project a meaningful name 24 00:01:45,437 --> 00:01:46,914 And I click 'Create'. 25 00:01:46,914 --> 00:01:49,935 Then OpenRefine will load the files for me. 26 00:01:50,819 --> 00:01:52,819 I wait a few seconds... 27 00:01:55,198 --> 00:01:56,600 And the project is loaded. 28 00:01:56,600 --> 00:01:59,677 As you can see, there is a column with categories 29 00:01:59,677 --> 00:02:03,090 And - because I have the Wikimedia Commons extension installed - 30 00:02:03,090 --> 00:02:06,174 I also get to see thumbnails of the files.