The Hadoop Distributed File System

The Hadoop Distributed File System (HDFS) is designed to store very large data sets reliably, and to stream those data sets at high bandwidth to user applications. In a large cluster, thousands of servers both host directly attached storage and execute user application tasks. By distributing storage...

Celý popis

Uloženo v:
Podrobná bibliografie
Vydáno v:2010 IEEE 26th Symposium on Mass Storage Systems and Technologies (MSST) s. 1 - 10
Hlavní autoři: Shvachko, K, Hairong Kuang, Radia, S, Chansler, R
Médium: Konferenční příspěvek
Jazyk:angličtina
Vydáno: IEEE 01.05.2010
Témata:
ISBN:1424471524, 9781424471522
ISSN:2160-195X
On-line přístup:Získat plný text
Tagy: Přidat tag
Žádné tagy, Buďte první, kdo vytvoří štítek k tomuto záznamu!
Abstract The Hadoop Distributed File System (HDFS) is designed to store very large data sets reliably, and to stream those data sets at high bandwidth to user applications. In a large cluster, thousands of servers both host directly attached storage and execute user application tasks. By distributing storage and computation across many servers, the resource can grow with demand while remaining economical at every size. We describe the architecture of HDFS and report on experience using HDFS to manage 25 petabytes of enterprise data at Yahoo!.
AbstractList The Hadoop Distributed File System (HDFS) is designed to store very large data sets reliably, and to stream those data sets at high bandwidth to user applications. In a large cluster, thousands of servers both host directly attached storage and execute user application tasks. By distributing storage and computation across many servers, the resource can grow with demand while remaining economical at every size. We describe the architecture of HDFS and report on experience using HDFS to manage 25 petabytes of enterprise data at Yahoo!.
Author Hairong Kuang
Chansler, R
Shvachko, K
Radia, S
Author_xml – sequence: 1
  givenname: K
  surname: Shvachko
  fullname: Shvachko, K
  email: Shv@Yahoo-Inc.com
  organization: Yahoo!, Sunnyvale, CA, USA
– sequence: 2
  surname: Hairong Kuang
  fullname: Hairong Kuang
  email: Hairong@Yahoo-Inc.com
  organization: Yahoo!, Sunnyvale, CA, USA
– sequence: 3
  givenname: S
  surname: Radia
  fullname: Radia, S
  email: SRadia@Yahoo-Inc.com
  organization: Yahoo!, Sunnyvale, CA, USA
– sequence: 4
  givenname: R
  surname: Chansler
  fullname: Chansler, R
  email: Chansler@Yahoo-Inc.com
  organization: Yahoo!, Sunnyvale, CA, USA
BookMark eNo1j0FPwkAQhceIiYD9AcZL4724M7vbZY8GQUwgHNqDNzLNTuMaoKStB_69TcR3efkuX96bwOjUnATgEdUMUfmXbVGUM1IDWuNz7-gGEu_maMgYh1b7W5j8A5kRjAlzlaG3n_eQdN23GmLsINBjeC6_JF1zaJpz-ha7vo3VTy8hXcWDpMWl6-X4AHc1HzpJrj2FcrUsF-tss3v_WLxusojO9hk7K4zDQEMsRnJNrJCVZouObG6CcxwUBV3V6FlTZUKNta15TpWlSk_h6U8bRWR_buOR28v--lD_AkHuQhs
ContentType Conference Proceeding
DBID 6IE
6IH
CBEJK
RIE
RIO
DOI 10.1109/MSST.2010.5496972
DatabaseName IEEE Electronic Library (IEL) Conference Proceedings
IEEE Proceedings Order Plan (POP) 1998-present by volume
IEEE Xplore All Conference Proceedings
IEEE Electronic Library (IEL)
IEEE Proceedings Order Plans (POP) 1998-present
DatabaseTitleList
Database_xml – sequence: 1
  dbid: RIE
  name: IEEE Electronic Library (IEL)
  url: https://ieeexplore.ieee.org/
  sourceTypes: Publisher
DeliveryMethod fulltext_linktorsrc
Discipline Engineering
EISBN 9781424471539
1424471532
EndPage 10
ExternalDocumentID 5496972
Genre orig-research
GroupedDBID 6IE
6IF
6IH
6IK
6IL
6IM
6IN
AAJGR
AAWTH
ABLEC
ACGFS
ADZIZ
ALMA_UNASSIGNED_HOLDINGS
BEFXN
BFFAM
BGNUA
BKEBE
BPEOZ
CBEJK
CHZPO
IEGSK
IJVOP
IPLJI
M43
OCL
RIE
RIL
RIO
ID FETCH-LOGICAL-i175t-a75ea111042ae4e632a01a03a5172564d77ad02d3bf19a32b4df1f5fa82b52b3
IEDL.DBID RIE
ISBN 1424471524
9781424471522
ISICitedReferencesCount 1347
ISICitedReferencesURI http://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=Summon&SrcAuth=ProQuest&DestLinkType=CitingArticles&DestApp=WOS_CPL&KeyUT=000287502800009&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
ISSN 2160-195X
IngestDate Wed Aug 27 02:35:55 EDT 2025
IsPeerReviewed false
IsScholarly false
Language English
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-i175t-a75ea111042ae4e632a01a03a5172564d77ad02d3bf19a32b4df1f5fa82b52b3
PageCount 10
ParticipantIDs ieee_primary_5496972
PublicationCentury 2000
PublicationDate 2010-May
PublicationDateYYYYMMDD 2010-05-01
PublicationDate_xml – month: 05
  year: 2010
  text: 2010-May
PublicationDecade 2010
PublicationTitle 2010 IEEE 26th Symposium on Mass Storage Systems and Technologies (MSST)
PublicationTitleAbbrev MSST
PublicationYear 2010
Publisher IEEE
Publisher_xml – name: IEEE
SSID ssj0000452013
ssj0000942121
Score 2.088143
Snippet The Hadoop Distributed File System (HDFS) is designed to store very large data sets reliably, and to stream those data sets at high bandwidth to user...
SourceID ieee
SourceType Publisher
StartPage 1
SubjectTerms Bandwidth
Clustering algorithms
Computer architecture
Concurrent computing
Distributed computing
distributed file system
Facebook
File servers
File systems
Hadoop
HDFS
Protection
Protocols
Title The Hadoop Distributed File System
URI https://ieeexplore.ieee.org/document/5496972
WOSCitedRecordID wos000287502800009&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV07T8MwED6VigEWHi3irQgxYmo7cRzPQMVCVakZslV2cpbKkFSl5fdj5wVILGy2F9uy_Z3v9R3AvdeRa7sGNSohEYacKMsMERYZSu053pK62ISczZIsU_MBPPS5MIhYB5_ho2_WvvyiynfeVDZxukyspAPcPSnjJlert6d4avBOVXhvIuYcKnt9i7OYEqZE1uV1SSezoo7uqe3z1uPJqJq8LRZpE_TVTvir8koteKZH_1vyMYy_M_iCeS-bTmCA5Skc_iAfHMGduyGBA56qWgfPnj7XV77CIpg6nAgaIvMxpNOX9OmVtBUTyMp9A7ZES4HaoZd7iRojjEOuKdM01ML9U0QcFVLqgvIiNJYpHXITFZZZYXXCjeAmPINhWZV4DgHNc3TP1VPrmEgYVDaxeR57kaeo0ngBI7_Z5brhxFi2-7z8e_gKDjqvO2XXMNxudngD-_nndvWxua0P8gs1JZQL
linkProvider IEEE
linkToHtml http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV07T8MwED5VBQlYeLSINxFiJNR27CSegaqItqrUDN0qOzlLZUiq0vL7sfMoILGw2V5sy_Z3vtd3APdORy7tGkTL2OcYMF8aqn1hkGKkHMdbXBabiMbjeDaTkxY8bHNhELEMPsNH1yx9-VmRbpyprGd1mVBGFnB3BOeMVNlaW4uKIwdvlIX3KmbO4rLTuBgNiU-lmDWZXZGVWrwhfKr7rPZ5UiJ7o-k0qcK-6il_1V4pRU__8H-LPoLudw6fN9lKp2NoYX4CBz_oBztwZ--IZ6GnKJbesyPQdbWvMPP6Fim8isq8C0n_JXka-HXNBH9hPwJrX0UClcUv-xYVcgwDpghVJFDC_lREyLMoUhlhWaANlSpgmmeGGmFUzLRgOjiFdl7keAYeSVO0D9aR62guNEoTmzQNndCTRCo8h47b7HxZsWLM631e_D18C3uDZDScD1_Hb5ew3_jgCb2C9nq1wWvYTT_Xi4_VTXmoXyU8l1I
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=2010+IEEE+26th+Symposium+on+Mass+Storage+Systems+and+Technologies+%28MSST%29&rft.atitle=The+Hadoop+Distributed+File+System&rft.au=Shvachko%2C+K&rft.au=Hairong+Kuang&rft.au=Radia%2C+S&rft.au=Chansler%2C+R&rft.date=2010-05-01&rft.pub=IEEE&rft.isbn=9781424471522&rft.issn=2160-195X&rft.spage=1&rft.epage=10&rft_id=info:doi/10.1109%2FMSST.2010.5496972&rft.externalDocID=5496972
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=2160-195X&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=2160-195X&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=2160-195X&client=summon