Entropy-based file type identification and partitioning

Saved in:
Bibliographic Details
Title: Entropy-based file type identification and partitioning
Authors: Paul, Calvin B.
Contributors: Cristi, Roberto, Fargues, Monique, Electrical and Computer Engineering (ECE)
Publisher Information: Naval Postgraduate School
Publication Year: 2017
Collection: Naval Postgraduate School: Calhoun
Subject Terms: file type identification, file partitioning, entropy, feature vector, detrended fluctuation analysis, Haar continuous wavelet, statistical measure
Description: The need for file identification and partitioning in the digital forensic, reverse engineering, and security analyst fields cannot be overstated. In this research, we investigate the use of the Shannon entropy profile derived from the file expressed in byte format to characterize specific file types and identify file segments based on entropy-level changes. The process consists of two stages. In the first stage, a binary representation of the file is partitioned into chunks of fixed-length data bytes and processed to extract the entropy profile. In the second stage, the detrended fluctuation analysis (DFA) method is applied to determine the level of structure in the entropy profile. The Haar continuous wavelet transform (CWT) is then used to partition the files identified as highly structured into areas of distinct changes in entropy level. Experimental results show that the proposed approach is effective in identifying file types and partitioning in segments of different entropy levels. ; Approved for public release; distribution is unlimited. ; Civilian, Armaments Corporation of South Africa (Armscor) ; http://archive.org/details/entropybasedfile1094555513
Document Type: thesis
File Description: application/pdf
Language: unknown
Relation: https://hdl.handle.net/10945/55513
Availability: https://hdl.handle.net/10945/55513
Rights: Copyright is reserved by the copyright owner.
Accession Number: edsbas.907D9B0E
Database: BASE
Be the first to leave a comment!
You must be logged in first