A deep learning approach for detecting malicious JavaScript code.

Saved in:
Bibliographic Details
Title: A deep learning approach for detecting malicious JavaScript code.
Authors: Wang, Yao, Cai, Wan-dong, Wei, Peng-cheng
Source: Security & Communication Networks; 7/25/2016, Vol. 9 Issue 11, p1520-1534, 15p
Subject Terms: JAVASCRIPT programming language, MACHINE learning, DATA transmission systems, LOGISTIC regression analysis
Abstract: Malicious JavaScript code in webpages on the Internet is an emergent security issue because of its universality and potentially severe impact. Because of its obfuscation and complexities, detecting it has a considerable cost. Over the last few years, several machine learning-based detection approaches have been proposed; most of them use shallow discriminating models with features that are constructed with artificial rules. However, with the advent of the big data era for information transmission, these existing methods already cannot satisfy actual needs. In this paper, we present a new deep learning framework for detection of malicious JavaScript code, from which we obtained the highest detection accuracy compared with the control group. The architecture is composed of a sparse random projection, deep learning model, and logistic regression. Stacked denoising auto-encoders were used to extract high-level features from JavaScript code; logistic regression as a classifier was used to distinguish between malicious and benign JavaScript code. Experimental results indicated that our architecture, with over 27 000 labeled samples, can achieve an accuracy of up to 95%, with a false positive rate less than 4.2% in the best case. Copyright © 2016 John Wiley & Sons, Ltd. [ABSTRACT FROM AUTHOR]
Copyright of Security & Communication Networks is the property of Wiley-Blackwell and its content may not be copied or emailed to multiple sites without the copyright holder's express written permission. Additionally, content may not be used with any artificial intelligence tools or machine learning technologies. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.)
Database: Complementary Index
FullText Links:
  – Type: pdflink
Text:
  Availability: 0
Header DbId: edb
DbLabel: Complementary Index
An: 116123715
RelevancyScore: 853
AccessLevel: 6
PubType: Academic Journal
PubTypeId: academicJournal
PreciseRelevancyScore: 853.196594238281
IllustrationInfo
Items – Name: Title
  Label: Title
  Group: Ti
  Data: A deep learning approach for detecting malicious JavaScript code.
– Name: Author
  Label: Authors
  Group: Au
  Data: <searchLink fieldCode="AR" term="%22Wang%2C+Yao%22">Wang, Yao</searchLink><br /><searchLink fieldCode="AR" term="%22Cai%2C+Wan-dong%22">Cai, Wan-dong</searchLink><br /><searchLink fieldCode="AR" term="%22Wei%2C+Peng-cheng%22">Wei, Peng-cheng</searchLink>
– Name: TitleSource
  Label: Source
  Group: Src
  Data: Security & Communication Networks; 7/25/2016, Vol. 9 Issue 11, p1520-1534, 15p
– Name: Subject
  Label: Subject Terms
  Group: Su
  Data: <searchLink fieldCode="DE" term="%22JAVASCRIPT+programming+language%22">JAVASCRIPT programming language</searchLink><br /><searchLink fieldCode="DE" term="%22MACHINE+learning%22">MACHINE learning</searchLink><br /><searchLink fieldCode="DE" term="%22DATA+transmission+systems%22">DATA transmission systems</searchLink><br /><searchLink fieldCode="DE" term="%22LOGISTIC+regression+analysis%22">LOGISTIC regression analysis</searchLink>
– Name: Abstract
  Label: Abstract
  Group: Ab
  Data: Malicious JavaScript code in webpages on the Internet is an emergent security issue because of its universality and potentially severe impact. Because of its obfuscation and complexities, detecting it has a considerable cost. Over the last few years, several machine learning-based detection approaches have been proposed; most of them use shallow discriminating models with features that are constructed with artificial rules. However, with the advent of the big data era for information transmission, these existing methods already cannot satisfy actual needs. In this paper, we present a new deep learning framework for detection of malicious JavaScript code, from which we obtained the highest detection accuracy compared with the control group. The architecture is composed of a sparse random projection, deep learning model, and logistic regression. Stacked denoising auto-encoders were used to extract high-level features from JavaScript code; logistic regression as a classifier was used to distinguish between malicious and benign JavaScript code. Experimental results indicated that our architecture, with over 27 000 labeled samples, can achieve an accuracy of up to 95%, with a false positive rate less than 4.2% in the best case. Copyright © 2016 John Wiley & Sons, Ltd. [ABSTRACT FROM AUTHOR]
– Name: Abstract
  Label:
  Group: Ab
  Data: <i>Copyright of Security & Communication Networks is the property of Wiley-Blackwell and its content may not be copied or emailed to multiple sites without the copyright holder's express written permission. Additionally, content may not be used with any artificial intelligence tools or machine learning technologies. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract.</i> (Copyright applies to all Abstracts.)
PLink https://erproxy.cvtisr.sk/sfx/access?url=https://search.ebscohost.com/login.aspx?direct=true&site=eds-live&db=edb&AN=116123715
RecordInfo BibRecord:
  BibEntity:
    Identifiers:
      – Type: doi
        Value: 10.1002/sec.1441
    Languages:
      – Code: eng
        Text: English
    PhysicalDescription:
      Pagination:
        PageCount: 15
        StartPage: 1520
    Subjects:
      – SubjectFull: JAVASCRIPT programming language
        Type: general
      – SubjectFull: MACHINE learning
        Type: general
      – SubjectFull: DATA transmission systems
        Type: general
      – SubjectFull: LOGISTIC regression analysis
        Type: general
    Titles:
      – TitleFull: A deep learning approach for detecting malicious JavaScript code.
        Type: main
  BibRelationships:
    HasContributorRelationships:
      – PersonEntity:
          Name:
            NameFull: Wang, Yao
      – PersonEntity:
          Name:
            NameFull: Cai, Wan-dong
      – PersonEntity:
          Name:
            NameFull: Wei, Peng-cheng
    IsPartOfRelationships:
      – BibEntity:
          Dates:
            – D: 25
              M: 07
              Text: 7/25/2016
              Type: published
              Y: 2016
          Identifiers:
            – Type: issn-print
              Value: 19390114
          Numbering:
            – Type: volume
              Value: 9
            – Type: issue
              Value: 11
          Titles:
            – TitleFull: Security & Communication Networks
              Type: main
ResultId 1