File:Automated metadata extraction (IA automatedmetadat109454057).pdf

From Wikimedia Commons, the free media repository
Jump to navigation Jump to search
Go to page
next page →
next page →
next page →

Original file(1,275 × 1,650 pixels, file size: 315 KB, MIME type: application/pdf, 84 pages)

Captions

Captions

Add a one-line explanation of what this file represents

Summary[edit]

Automated metadata extraction   (Wikidata search (Cirrus search) Wikidata query (SPARQL)  Create new Wikidata item based on this file)
Author
Migletz, James J.
Title
Automated metadata extraction
Publisher
Monterey, California. Naval Postgraduate School
Description

Metadata is data that describes data. There are many computer forensic uses of metadata and being able to extract metadata automatically provides positive forensic implications. This thesis presents a new technique for batch processing disk images and automatically extracting metadata from files and file contents. The technique is embodied in a program called fiwalk that has a plug-in architecture allowing new metadata extractors to be readily incorporated. Output from fiwalk can be provided in multiple formats such as ARFF and text. The plug-ins created for this thesis include one created by Simson Garfinkel for extracting metadata from .jpeg files, two for Microsoft Office documents (one for prior to Office 2007 release and one for Office 2007 release), and a default plug-in for extracting metadata from .gif, .pdf, and .mp3 files. To better understand the metadata available in common file formats such as .doc, .docx, .odt, .pdf, .mp3, .mp4, .jpeg, tiff, and .gif, an examination of these formats is provided.


Subjects: Metadata; Data mining
Language English
Publication date June 2008
Current location
IA Collections: navalpostgraduateschoollibrary; fedlink
Accession number
automatedmetadat109454057
Source
Internet Archive identifier: automatedmetadat109454057
https://archive.org/download/automatedmetadat109454057/automatedmetadat109454057.pdf
Permission
(Reusing this file)
Approved for public release, distribution unlimited

Licensing[edit]

Public domain
This work is in the public domain in the United States because it is a work prepared by an officer or employee of the United States Government as part of that person’s official duties under the terms of Title 17, Chapter 1, Section 105 of the US Code. Note: This only applies to original works of the Federal Government and not to the work of any individual U.S. state, territory, commonwealth, county, municipality, or any other subdivision. This template also does not apply to postage stamp designs published by the United States Postal Service since 1978. (See § 313.6(C)(1) of Compendium of U.S. Copyright Office Practices). It also does not apply to certain US coins; see The US Mint Terms of Use.

File history

Click on a date/time to view the file as it appeared at that time.

Date/TimeThumbnailDimensionsUserComment
current22:08, 14 July 2020Thumbnail for version as of 22:08, 14 July 20201,275 × 1,650, 84 pages (315 KB) (talk | contribs)FEDLINK - United States Federal Collection automatedmetadat109454057 (User talk:Fæ/IA books#Fork8) (batch 1993-2020 #8754)

Metadata