Optimizing Small Files Operations in HDFS File Storage Mode.

Uložené v:
Podrobná bibliografia
Názov: Optimizing Small Files Operations in HDFS File Storage Mode.
Autori: Yi-Yang Chen, Rui-Jun Wang, Zhen Hong, Akhtar, Zahid, Siddique, Kamran
Zdroj: International Journal of Design, Analysis & Tools for Integrated Circuits & Systems; Nov2022, Vol. 11 Issue 1, p43-49, 7p
Predmety: RECORDS management, STORAGE, BIG data
Abstrakt: Hadoop Distributed File System (HDFS) is based on Google File System (GFS), a big data distributed file management system included in Hadoop. Nowadays, many HDFS and many other similar frameworks have the need to store small files in the system. In this aspect, HDFS affects its performance and Namenode memory management when dealing with a large number of small files. Therefore, researchers have proposed various solutions to address the shortcomings of HDFS for storing small and medium-sized files. This paper presents three HDFS schemes for merging small files and analyzes the importance of correlation and prefetching after merging small files. The efficiency of reading small files can be improved by correlated file prefetching. Finally, the small file storage architecture is obtained to stand superior to the NHAR architecture. [ABSTRACT FROM AUTHOR]
Copyright of International Journal of Design, Analysis & Tools for Integrated Circuits & Systems is the property of International Journal of Design, Analysis & Tools for Integrated Circuits & Systems and its content may not be copied or emailed to multiple sites without the copyright holder's express written permission. Additionally, content may not be used with any artificial intelligence tools or machine learning technologies. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.)
Databáza: Complementary Index
FullText Text:
  Availability: 0
CustomLinks:
  – Url: https://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=EBSCO&SrcAuth=EBSCO&DestApp=WOS&ServiceName=TransferToWoS&DestLinkType=GeneralSearchSummary&Func=Links&author=Chen%20Y
    Name: ISI
    Category: fullText
    Text: Nájsť tento článok vo Web of Science
    Icon: https://imagesrvr.epnet.com/ls/20docs.gif
    MouseOverText: Nájsť tento článok vo Web of Science
Header DbId: edb
DbLabel: Complementary Index
An: 164426619
RelevancyScore: 938
AccessLevel: 6
PubType: Academic Journal
PubTypeId: academicJournal
PreciseRelevancyScore: 937.593139648438
IllustrationInfo
Items – Name: Title
  Label: Title
  Group: Ti
  Data: Optimizing Small Files Operations in HDFS File Storage Mode.
– Name: Author
  Label: Authors
  Group: Au
  Data: <searchLink fieldCode="AR" term="%22Yi-Yang+Chen%22">Yi-Yang Chen</searchLink><br /><searchLink fieldCode="AR" term="%22Rui-Jun+Wang%22">Rui-Jun Wang</searchLink><br /><searchLink fieldCode="AR" term="%22Zhen+Hong%22">Zhen Hong</searchLink><br /><searchLink fieldCode="AR" term="%22Akhtar%2C+Zahid%22">Akhtar, Zahid</searchLink><br /><searchLink fieldCode="AR" term="%22Siddique%2C+Kamran%22">Siddique, Kamran</searchLink>
– Name: TitleSource
  Label: Source
  Group: Src
  Data: International Journal of Design, Analysis & Tools for Integrated Circuits & Systems; Nov2022, Vol. 11 Issue 1, p43-49, 7p
– Name: Subject
  Label: Subject Terms
  Group: Su
  Data: <searchLink fieldCode="DE" term="%22RECORDS+management%22">RECORDS management</searchLink><br /><searchLink fieldCode="DE" term="%22STORAGE%22">STORAGE</searchLink><br /><searchLink fieldCode="DE" term="%22BIG+data%22">BIG data</searchLink>
– Name: Abstract
  Label: Abstract
  Group: Ab
  Data: Hadoop Distributed File System (HDFS) is based on Google File System (GFS), a big data distributed file management system included in Hadoop. Nowadays, many HDFS and many other similar frameworks have the need to store small files in the system. In this aspect, HDFS affects its performance and Namenode memory management when dealing with a large number of small files. Therefore, researchers have proposed various solutions to address the shortcomings of HDFS for storing small and medium-sized files. This paper presents three HDFS schemes for merging small files and analyzes the importance of correlation and prefetching after merging small files. The efficiency of reading small files can be improved by correlated file prefetching. Finally, the small file storage architecture is obtained to stand superior to the NHAR architecture. [ABSTRACT FROM AUTHOR]
– Name: Abstract
  Label:
  Group: Ab
  Data: <i>Copyright of International Journal of Design, Analysis & Tools for Integrated Circuits & Systems is the property of International Journal of Design, Analysis & Tools for Integrated Circuits & Systems and its content may not be copied or emailed to multiple sites without the copyright holder's express written permission. Additionally, content may not be used with any artificial intelligence tools or machine learning technologies. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract.</i> (Copyright applies to all Abstracts.)
PLink https://erproxy.cvtisr.sk/sfx/access?url=https://search.ebscohost.com/login.aspx?direct=true&site=eds-live&db=edb&AN=164426619
RecordInfo BibRecord:
  BibEntity:
    Languages:
      – Code: eng
        Text: English
    PhysicalDescription:
      Pagination:
        PageCount: 7
        StartPage: 43
    Subjects:
      – SubjectFull: RECORDS management
        Type: general
      – SubjectFull: STORAGE
        Type: general
      – SubjectFull: BIG data
        Type: general
    Titles:
      – TitleFull: Optimizing Small Files Operations in HDFS File Storage Mode.
        Type: main
  BibRelationships:
    HasContributorRelationships:
      – PersonEntity:
          Name:
            NameFull: Yi-Yang Chen
      – PersonEntity:
          Name:
            NameFull: Rui-Jun Wang
      – PersonEntity:
          Name:
            NameFull: Zhen Hong
      – PersonEntity:
          Name:
            NameFull: Akhtar, Zahid
      – PersonEntity:
          Name:
            NameFull: Siddique, Kamran
    IsPartOfRelationships:
      – BibEntity:
          Dates:
            – D: 01
              M: 11
              Text: Nov2022
              Type: published
              Y: 2022
          Identifiers:
            – Type: issn-print
              Value: 2223523X
          Numbering:
            – Type: volume
              Value: 11
            – Type: issue
              Value: 1
          Titles:
            – TitleFull: International Journal of Design, Analysis & Tools for Integrated Circuits & Systems
              Type: main
ResultId 1