File:CLUSTER COMPUTING FOR AUTOMATED NETWORK ANALYSIS AT SCALE (IA clustercomputing1094559618).pdf

Size of this JPG preview of this PDF file: 463 × 599 pixels. Other resolutions: 185 × 240 pixels | 371 × 480 pixels | 593 × 768 pixels | 1,275 × 1,650 pixels.

Original file ‎(1,275 × 1,650 pixels, file size: 2.16 MB, MIME type: application/pdf, 106 pages)

Captions

English

Add a one-line explanation of what this file represents

Summary[edit]

CLUSTER COMPUTING FOR AUTOMATED NETWORK ANALYSIS AT SCALE ( )
Author	Brida, Benjamin J.
Title	CLUSTER COMPUTING FOR AUTOMATED NETWORK ANALYSIS AT SCALE
Publisher	Monterey, CA; Naval Postgraduate School
Description	Conventional single node packet analyzers are unable to monitor network traffic at scale. In this thesis, elements of the Apache Hadoop ecosystem, including HBase, Spark, and MapReduce, are employed to conduct network traffic analysis on a large collection of network traffic. Limited analysis is conducted directly on packet capture next generation (pcapng) files on the Hadoop Distributed File System (HDFS) using MapReduce. Next, to allow for repeated analysis on the same dataset without reading all source files in their entirety for every calculation, pcapng files are parsed and relevant meta-data is bulk loaded into HBase, a Not Only Structured Query Language (NoSQL) database employing the HDFS for parallelization. This NoSQL database is then accessed via Apache Spark where pertinent data is loaded into DataFrames and additional analysis on the network traffic takes place. This research demonstrates the viability of custom, modular, automated analytics, employing open-source software to enable parallelization, to conduct traffic analysis at scale. Subjects: big data; MapReduce; Hadoop; packet capture; traffic analysis; network analysis; Spark; HBase
Language	English
Publication date	June 2018
Current location	IA Collections: navalpostgraduateschoollibrary; fedlink
Accession number	clustercomputing1094559618
Source	Internet Archive identifier: clustercomputing1094559618 https://archive.org/download/clustercomputing1094559618/clustercomputing1094559618.pdf
Permission (Reusing this file)	This publication is a work of the U.S. Government as defined in Title 17, United States Code, Section 101. Copyright protection is not available for this work in the United States.

Licensing[edit]

	This work is in the public domain in the United States because it is a work prepared by an officer or employee of the United States Government as part of that person’s official duties under the terms of Title 17, Chapter 1, Section 105 of the US Code. Note: This only applies to original works of the Federal Government and not to the work of any individual U.S. state, territory, commonwealth, county, municipality, or any other subdivision. This template also does not apply to postage stamp designs published by the United States Postal Service since 1978. (See § 313.6(C)(1) of Compendium of U.S. Copyright Office Practices). It also does not apply to certain US coins; see The US Mint Terms of Use.
This file has been identified as being free of known restrictions under copyright law, including all related and neighboring rights.

PDMCreative Commons Public Domain Mark 1.0falsefalse

File history

Click on a date/time to view the file as it appeared at that time.

	Date/Time	Thumbnail	Dimensions	User	Comment
current	18:55, 15 July 2020		1,275 × 1,650, 106 pages (2.16 MB)	Fæ (talk \| contribs)	FEDLINK - United States Federal Collection clustercomputing1094559618 (User talk:Fæ/IA books#Fork8) (batch 1993-2020 #11510)

You cannot overwrite this file.

File usage on Commons

The following page uses this file:

File:CLUSTER COMPUTING FOR AUTOMATED NETWORK ANALYSIS AT SCALE (IA clustercomputing1094559618).pdf

Metadata

This file contains additional information such as Exif metadata which may have been added by the digital camera, scanner, or software program used to create or digitize it. If the file has been modified from its original state, some details such as the timestamp may not fully reflect those of the original file. The timestamp is only as accurate as the clock in the camera, and it may be completely wrong.

Short title	CLUSTER COMPUTING FOR AUTOMATED NETWORK ANALYSIS AT SCALE
Image title
Author	Brida, Benjamin J.
Keywords	NPS Thesis Template
Software used	Brida, Benjamin J.
Conversion program	Adobe PDF Library 11.0
Encrypted	no
Page size	612 x 792 pts (letter)
Version of PDF format	1.4

File:CLUSTER COMPUTING FOR AUTOMATED NETWORK ANALYSIS AT SCALE (IA clustercomputing1094559618).pdf

Captions

Captions

Summary[edit]

Licensing[edit]

File history

File usage on Commons

Metadata

Structured data

Items portrayed in this file

depicts

media type

application/pdf

checksum

e8c196e7f227d283112e4e28a9c3e725627473ee

data size

2,268,775 byte

height

1,650 pixel

width

1,275 pixel

number of pages

106

Navigation menu

File:CLUSTER COMPUTING FOR AUTOMATED NETWORK ANALYSIS AT SCALE (IA clustercomputing1094559618).pdf

Captions

Captions

Summary[edit]

Licensing[edit]

File history

File usage on Commons

Metadata

Navigation menu

Search