Data Type Classification: Hierarchical Class-to-Type Modeling

Uloženo v:
Podrobná bibliografie
Název: Data Type Classification: Hierarchical Class-to-Type Modeling
Autoři: Beebe, Nicole, Liu, Lishu, Sun, Minghe
Přispěvatelé: The University of Texas at San Antonio (UTSA), RetailMeNot, Gilbert Peterson, Sujeet Shenoi, TC 11, WG 11.9
Zdroj: IFIP Advances in Information and Communication Technology ; 12th IFIP International Conference on Digital Forensics (DF) ; https://inria.hal.science/hal-01758687 ; 12th IFIP International Conference on Digital Forensics (DF), Jan 2016, New Delhi, India. pp.325-343, ⟨10.1007/978-3-319-46279-0_17⟩
Informace o vydavateli: CCSD
Springer International Publishing
Rok vydání: 2016
Témata: Statistical classification, Data types, File types, Hierarchical model, [INFO]Computer Science [cs]
Geografické téma: New Delhi, India
Popis: Part 7: FORENSIC TECHNIQUES ; International audience ; Data and file type classification research conducted over the past ten to fifteen years has been dominated by competing experiments that only vary the number of classes, types of classes, machine learning technique and input vector. There has been surprisingly little innovation on fundamental approaches to data and file type classification. This chapter focuses on the empirical testing of a hypothesized, two-level hierarchical classification model and the empirical derivation and testing of several alternative classification models. Comparative evaluations are conducted on ten classification models to identify a final winning, two-level classification model consisting of five classes and 52 lower-level data and file types. Experimental results demonstrate that the approach leads to very good class-level classification performance, improved classification performance for data and file types without high entropy (e.g., compressed and encrypted data) and reasonably-equivalent classification performance for high-entropy data and file types.
Druh dokumentu: conference object
Jazyk: English
DOI: 10.1007/978-3-319-46279-0_17
Dostupnost: https://inria.hal.science/hal-01758687
https://inria.hal.science/hal-01758687v1/document
https://inria.hal.science/hal-01758687v1/file/431606_1_En_17_Chapter.pdf
https://doi.org/10.1007/978-3-319-46279-0_17
Rights: http://creativecommons.org/licenses/by/ ; info:eu-repo/semantics/OpenAccess
Přístupové číslo: edsbas.5E234991
Databáze: BASE
Popis
Abstrakt:Part 7: FORENSIC TECHNIQUES ; International audience ; Data and file type classification research conducted over the past ten to fifteen years has been dominated by competing experiments that only vary the number of classes, types of classes, machine learning technique and input vector. There has been surprisingly little innovation on fundamental approaches to data and file type classification. This chapter focuses on the empirical testing of a hypothesized, two-level hierarchical classification model and the empirical derivation and testing of several alternative classification models. Comparative evaluations are conducted on ten classification models to identify a final winning, two-level classification model consisting of five classes and 52 lower-level data and file types. Experimental results demonstrate that the approach leads to very good class-level classification performance, improved classification performance for data and file types without high entropy (e.g., compressed and encrypted data) and reasonably-equivalent classification performance for high-entropy data and file types.
DOI:10.1007/978-3-319-46279-0_17