Regen: An object layout regenerator on large-scale production HPC systems

This article proposes an object layout regenerator called Regen which regenerates and removes the object layout dynamically to improve the read performance of applications. Regen first detects frequent access patterns from the I/O requests of the applications. Second, Regen reorganizes the objects a...

Celý popis

Uloženo v:
Podrobná bibliografie
Vydáno v:Future generation computer systems Ročník 171; číslo C; s. 107830
Hlavní autoři: Sung, Dong Kyu, Kim, Sunggon, Lee, Sangjin, Tang, Houjun, Sim, Alex, Wu, Kesheng, Byna, Suren, Son, Yongseok
Médium: Journal Article
Jazyk:angličtina
Vydáno: Netherlands Elsevier B.V 01.10.2025
Elsevier
Témata:
ISSN:0167-739X
On-line přístup:Získat plný text
Tagy: Přidat tag
Žádné tagy, Buďte první, kdo vytvoří štítek k tomuto záznamu!
Abstract This article proposes an object layout regenerator called Regen which regenerates and removes the object layout dynamically to improve the read performance of applications. Regen first detects frequent access patterns from the I/O requests of the applications. Second, Regen reorganizes the objects and regenerates or preallocates new object layouts according to the identified access patterns. Finally, Regen removes or reuses the obsolete or regenerated object layouts as necessary. As a result, Regen accelerates access to objects by providing a flexible object layout. We implement Regen as a framework on top of Proactive Data Container (PDC) and evaluate it on Cori supercomputer, a production-scale HPC system, by using realistic HPC I/O benchmarks. The experimental results show that Regen improves the I/O performance by up to 16.92× compared with an existing system. •Object storage system is gaining interest and being adopted by the HPC community.•Many HPC applications perform I/O in various access patterns.•Regen, object layout regenerator, regenerates object layouts based on frequent access patterns on original objects.•Regen regenerates or removes object layouts based on read patterns of applications.•Regen can improve the I/O performance by up to 16.92× compared with an existing object storage system.
AbstractList This article proposes an object layout regenerator called Regen which regenerates and removes the object layout dynamically to improve the read performance of applications. Regen first detects frequent access patterns from the I/O requests of the applications. Second, Regen reorganizes the objects and regenerates or preallocates new object layouts according to the identified access patterns. Finally, Regen removes or reuses the obsolete or regenerated object layouts as necessary. As a result, Regen accelerates access to objects by providing a flexible object layout. We implement Regen as a framework on top of Proactive Data Container (PDC) and evaluate it on Cori supercomputer, a production-scale HPC system, by using realistic HPC I/O benchmarks. The experimental results show that Regen improves the I/O performance by up to 16.92× compared with an existing system. •Object storage system is gaining interest and being adopted by the HPC community.•Many HPC applications perform I/O in various access patterns.•Regen, object layout regenerator, regenerates object layouts based on frequent access patterns on original objects.•Regen regenerates or removes object layouts based on read patterns of applications.•Regen can improve the I/O performance by up to 16.92× compared with an existing object storage system.
ArticleNumber 107830
Author Sim, Alex
Son, Yongseok
Byna, Suren
Kim, Sunggon
Wu, Kesheng
Lee, Sangjin
Tang, Houjun
Sung, Dong Kyu
Author_xml – sequence: 1
  givenname: Dong Kyu
  orcidid: 0000-0003-3983-5585
  surname: Sung
  fullname: Sung, Dong Kyu
  email: davidsung1@snu.ac.kr
  organization: Seoul National University, Republic of Korea
– sequence: 2
  givenname: Sunggon
  surname: Kim
  fullname: Kim, Sunggon
  email: sunggonkim@seoultech.ac.kr
  organization: Seoul National University of Science and Technology, Republic of Korea
– sequence: 3
  givenname: Sangjin
  orcidid: 0000-0002-0891-5286
  surname: Lee
  fullname: Lee, Sangjin
  email: tkdwls0727@cau.ac.kr
  organization: Chung-Ang University, Republic of Korea
– sequence: 4
  givenname: Houjun
  orcidid: 0000-0001-7038-8360
  surname: Tang
  fullname: Tang, Houjun
  email: htang4@lbl.gov
  organization: Lawrence Berkeley National Laboratory, United States of America
– sequence: 5
  givenname: Alex
  orcidid: 0000-0002-6295-1982
  surname: Sim
  fullname: Sim, Alex
  email: asim@lbl.gov
  organization: Lawrence Berkeley National Laboratory, United States of America
– sequence: 6
  givenname: Kesheng
  orcidid: 0000-0002-6907-3393
  surname: Wu
  fullname: Wu, Kesheng
  email: kwu@lbl.gov
  organization: Lawrence Berkeley National Laboratory, United States of America
– sequence: 7
  givenname: Suren
  surname: Byna
  fullname: Byna, Suren
  email: sbyna@lbl.gov
  organization: Lawrence Berkeley National Laboratory, United States of America
– sequence: 8
  givenname: Yongseok
  orcidid: 0000-0003-4512-0121
  surname: Son
  fullname: Son, Yongseok
  email: sysganda@cau.ac.kr
  organization: Chung-Ang University, Republic of Korea
BackLink https://www.osti.gov/biblio/2559015$$D View this record in Osti.gov
BookMark eNp9kE9LAzEQxXOoYKt-Aw-L96350-xuPAilqC0UFFHwFtJkUndpk5JkhX57s6xnTzO8efNj5s3QxHkHCN0SPCeYVPfd3PapDzCnmPIs1Q3DEzTNo7qsmfi6RLMYO4wxqRmZos077ME9FEtX-F0HOhUHdfZ9KsKgQ1DJh8K7rIY9lFGrAxSn4E2vU5vl9duqiOeY4Biv0YVVhwg3f_UKfT4_fazW5fb1ZbNabkvNGE6lYdRYWu0qS6lQla7FglFoKkFqsEoYSnljNbdCcGIZpraph8YIIrThQrMrdDdyfUytjLpNoL-1dy4fLynnAhOeTYvRpIOPMYCVp9AeVThLguWQk-zkmJMccpJjTnntcVyD_MBPC2Hgg9Ng2jDgjW__B_wCYOV2gQ
Cites_doi 10.1016/j.newast.2015.06.003
10.1371/journal.pone.0202410
10.1145/3502181.3531461
10.1109/TC.2018.2836426
10.1145/3295500.3356183
10.1145/3078468.3078485
10.1109/TPDS.2018.2796100
10.1145/1996130.1996139
10.1145/2749246.2749252
10.1063/1.2937116
10.1145/3078597.3078614
10.1145/1383519.1383526
10.1109/TPDS.2021.3097884
10.1145/3404190
10.1177/1094342011428142
10.1038/s41550-021-01405-0
10.1145/1374596.1374606
10.1109/TPDS.2022.3170574
10.1063/1.2840133
10.1016/j.jocs.2019.01.003
10.1145/2493123.2462909
10.1145/2807591.2807616
10.1109/TC.2001.970573
10.1016/S0098-3004(00)00009-1
10.3390/app11188540
10.1109/TPDS.2021.3100784
10.1145/3458817.3476144
10.1088/1749-4699/2/1/015001
10.1002/jcc.23340
10.1145/2628194.2628195
ContentType Journal Article
Copyright 2025 Elsevier B.V.
Copyright_xml – notice: 2025 Elsevier B.V.
DBID AAYXX
CITATION
OTOTI
DOI 10.1016/j.future.2025.107830
DatabaseName CrossRef
OSTI.GOV
DatabaseTitle CrossRef
DatabaseTitleList
DeliveryMethod fulltext_linktorsrc
Discipline Computer Science
ExternalDocumentID 2559015
10_1016_j_future_2025_107830
S0167739X25001256
GroupedDBID --K
--M
-~X
.DC
.~1
0R~
1B1
1~.
1~5
29H
4.4
457
4G.
5GY
5VS
7-5
71M
8P~
9JN
AAEDT
AAEDW
AAIKJ
AAKOC
AALRI
AAOAW
AAQFI
AAQXK
AATTM
AAXKI
AAXUO
AAYFN
AAYWO
ABBOA
ABDPE
ABFNM
ABJNI
ABMAC
ABWVN
ABXDB
ACDAQ
ACGFS
ACLOT
ACNNM
ACRLP
ACRPL
ACZNC
ADBBV
ADEZE
ADJOM
ADMUD
ADNMO
AEBSH
AEIPS
AEKER
AFJKZ
AFTJW
AGHFR
AGQPQ
AGUBO
AGYEJ
AHHHB
AHZHX
AIALX
AIEXJ
AIIUN
AIKHN
AITUG
ALMA_UNASSIGNED_HOLDINGS
AMRAJ
ANKPU
AOUOD
APXCP
ASPBG
AVWKF
AXJTR
AZFZN
BKOJK
BLXMC
CS3
EBS
EFJIC
EFKBS
EFLBG
EJD
EO8
EO9
EP2
EP3
F5P
FDB
FEDTE
FGOYB
FIRID
FNPLU
FYGXN
G-Q
GBLVA
GBOLZ
HLZ
HVGLF
HZ~
IHE
J1W
KOM
LG9
M41
MO0
MS~
N9A
O-L
O9-
OAUVE
OZT
P-8
P-9
PC.
Q38
R2-
ROL
RPZ
SBC
SDF
SDG
SES
SEW
SPC
SPCBC
SSV
SSZ
T5K
UHS
WUQ
XPP
ZMT
~G-
~HD
9DU
AAYXX
CITATION
AFXIZ
AGCQF
AGRNS
BNPGV
OTOTI
SSH
ID FETCH-LOGICAL-c330t-d32df26b6f229a6c79432e86917efa9d2258fc5f9951f302f8751f3d919cd59c3
ISICitedReferencesCount 0
ISICitedReferencesURI http://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=Summon&SrcAuth=ProQuest&DestLinkType=CitingArticles&DestApp=WOS_CPL&KeyUT=001476000200001&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
ISSN 0167-739X
IngestDate Mon May 26 02:33:22 EDT 2025
Sat Nov 29 07:28:42 EST 2025
Sun Oct 19 02:00:19 EDT 2025
IsDoiOpenAccess false
IsOpenAccess true
IsPeerReviewed true
IsScholarly true
Issue C
Keywords Pattern detection
Object storage
High-performance computing
Distributed file system
Language English
LinkModel OpenURL
MergedId FETCHMERGED-LOGICAL-c330t-d32df26b6f229a6c79432e86917efa9d2258fc5f9951f302f8751f3d919cd59c3
Notes USDOE Office of Science (SC), Advanced Scientific Computing Research (ASCR)
AC02-05CH11231
ORCID 0000-0002-6907-3393
0000-0001-7038-8360
0000-0002-0891-5286
0000-0002-6295-1982
0000-0003-3983-5585
0000-0003-4512-0121
0000000345120121
0000000170388360
0000000262951982
0000000269073393
0000000339835585
0000000208915286
OpenAccessLink https://escholarship.org/content/qt07t4k8ss/qt07t4k8ss.pdf
ParticipantIDs osti_scitechconnect_2559015
crossref_primary_10_1016_j_future_2025_107830
elsevier_sciencedirect_doi_10_1016_j_future_2025_107830
PublicationCentury 2000
PublicationDate October 2025
2025-10-00
2025-10-01
PublicationDateYYYYMMDD 2025-10-01
PublicationDate_xml – month: 10
  year: 2025
  text: October 2025
PublicationDecade 2020
PublicationPlace Netherlands
PublicationPlace_xml – name: Netherlands
PublicationTitle Future generation computer systems
PublicationYear 2025
Publisher Elsevier B.V
Elsevier
Publisher_xml – name: Elsevier B.V
– name: Elsevier
References M.M.A. Patwary, S. Byna, N.R. Satish, N. Sundaram, Z. Lukić, V. Roytershteyn, M.J. Anderson, Y. Yao, P. Dubey, BD-CATS: Big Data Clustering at Trillion Particle Scale, in: Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis, 2015, pp. 1–12.
Alam, Bartolome, Bassini, Carpene, Cestari, Combeau, Girona, Gorini, Fiameni, Hagemeier (b34) 2020
Chang, Klasky, Cummings, Samtaney, Shoshani, Sugiyama, Keyes, Ku, Park, Parker (b8) 2008; 125
ThinkParQ (b15) 2021
M.R. Palankar, A. Iamnitchi, M. Ripeanu, S. Garfinkel, Amazon S3 for Science Grids: a Viable Solution?, in: Proceedings of the 2008 International Workshop on Data-Aware Distributed Computing, 2008, pp. 55–64.
Tang, Byna, Tessier, Wang, Dong, Mu, Koziol, Soumagne, Vishwanath, Liu (b27) 2018
Shi, Cheng, Zhu, Chen (b58) 2020
Yin, Li, He, Sun, Thakur (b30) 2013
Ross, Ward, Carns, Grider, Klasky, Koziol, Lockwood, Mohror, Settlemyer, Wolf (b38) 2018
T. Patel, S. Byna, G.K. Lockwood, N.J. Wright, P. Carns, R. Ross, D. Tiwari, Uncovering Access, Reuse, and Sharing Characteristics of I/O-Intensive Files on Large-Scale Production HPC Systems, in: 18th {USENIX} Conference on File and Storage Technologies ({FAST} 20), 2020, pp. 91–101.
J.L. Bez, A.M. Karimi, A.K. Paul, B. Xie, S. Byna, P. Carns, S. Oral, F. Wang, J. Hanley, Access Patterns and Performance Behaviors of Multi-layer Supercomputer I/O Subsystems under Production Load, in: Proceedings of the 31st International Symposium on High-Performance Parallel and Distributed Computing, 2022, pp. 43–55.
Amazon Web Services, Cloud Object Storage - Amazon S3.
Cheng, Li, Zeng, Qian, Li, Brinkmann (b51) 2021; 17
Jeong, Duffy, Kim, Lee (b36) 2019
Tian, Klasky, Abbasi, Lofstead, Grout, Podhorszki, Liu, Wang, Yu (b41) 2011
Tomes, Rush, Altiparmak (b61) 2018; 67
Jacob, Katz, Berriman, Good, Laity, Deelman, Kesselman, Singh, Su, Prince (b65) 2009; 4
Bhadkamkar, Guerra, Useche, Burnett, Liptak, Rangaswami, Hristidis (b29) 2009; Vol. 9
Devarajan, Mohror (b62) 2024
Accessed: 2024-03-09.
Huerta, Khan, Huang, Tian, Levental, Chard, Wei, Heflin, Katz, Kindratenko (b64) 2021; 5
K. Oh, A. Chandra, J. Weissman, TripS: Automated Multi-tiered Data Placement in a Geo-distributed Cloud Environment, in: Proceedings of the 10th ACM International Systems and Storage Conference, 2017, pp. 1–11.
B. Xie, Y. Huang, J.S. Chase, J.Y. Choi, S. Klasky, J. Lofstead, S. Oral, Predicting Output Performance of a Petascale Supercomputer, in: Proceedings of the 26th International Symposium on High-Performance Parallel and Distributed Computing, 2017, pp. 181–192.
Brace, Yakushin, Ma, Trifan, Munson, Foster, Ramanathan, Lee, Turilli, Jha (b63) 2022
T. Rosado, J. Bernardino, An Overview of Openstack Architecture, in: Proceedings of the 18th International Database Engineering & Applications Symposium, 2014, pp. 366–367.
J. He, J. Bent, A. Torres, G. Grider, G. Gibson, C. Maltzahn, X.-H. Sun, I/O Acceleration with Pattern Detection, in: Proceedings of the 22nd International Symposium on High-Performance Parallel and Distributed Computing, 2013, pp. 25–36.
Dai, Wang, Kent, Zeng, Xu (b21) 2022; 33
Habib, Pope, Finkel, Frontiere, Heitmann, Daniel, Fasel, Morozov, Zagaris, Peterka (b12) 2016; 42
Moon, Di Natale, Ingolfsson, Bhatia, Chavez (b66) 2021
S.A. Weil, S.A. Brandt, E.L. Miller, D.D. Long, C. Maltzahn, Ceph: A Scalable, High-Performance Distributed File System, in: Proceedings of the 7th Symposium on Operating Systems Design and Implementation, 2006, pp. 307–320.
Salkhordeh, Ebrahimi, Asadi (b53) 2018; 29
Poeschel, E, Godoy, Podhorszki, Klasky, Eisenhauer, Davis, Wan, Gainaru, Gu (b45) 2022
R. Gracia-Tinedo, J. Sampé, E. Zamora, M. Sánchez-Artigas, P. García-López, Y. Moatti, E. Rom, Crystal: Software-Defined Storage for Multi-Tenant Object Stores, in: Proceedings of the 15th Usenix Conference on File and Storage Technologies, 2017, pp. 243–256.
T. Patel, S. Byna, G.K. Lockwood, D. Tiwari, Revisiting I/O Behavior in Large-Scale Storage Systems: The Expected and the Unexpected, in: Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis, 2019, pp. 1–13.
Soumagne, Henderson, Chaarawi, Fortner, Breitenfeld, Lu, Robinson, Pourmal, Lombardi (b20) 2021; 33
Tang, Byna, Dong, Liu, Koziol (b28) 2017
Terrace, Freedman (b50) 2009
Gadban, Kunkel (b35) 2021; 11
Li, Petukh, Li, Alexov (b47) 2013; 34
S.A. Weil, A.W. Leung, S.A. Brandt, C. Maltzahn, RADOS: A Scalable, Reliable Storage Service for Petabyte-scale Storage Clusters, in: Proceedings of the 2nd International Workshop on Petascale Data Storage: Held in Conjunction with Supercomputing’07, 2007, pp. 35–44.
Braam (b13) 2019
Messer, Bruenn, Blondin, Hix, Mezzacappa, Dirk (b40) 2007; 78
Wu, Byna, Dong (b32) 2018
Cardone-Noott, Rodriguez, Bueno-Orovio (b42) 2018; 13
Lofstead, Jimenez, Maltzahn, Koziol, Bent, Barton (b26) 2016
Bowers, Albright, Yin, Bergen, Kwan (b7) 2008; 15
Kumar, Edwards, Bremer, Knoll, Christensen, Vishwanath, Carns, Schmidt, Pascucci (b43) 2014
K. Rashmi, M. Chowdhury, J. Kosaian, I. Stoica, K. Ramchandran, EC-Cache: Load-Balanced, Low-Latency Cluster Caching with Online Erasure Coding, in: 12th {USENIX} Symposium on Operating Systems Design and Implementation ({OSDI} 16), 2016, pp. 401–417.
Lee, Choi, Kim, Noh, Min, Cho, Kim (b48) 2001; 50
Dennis, Edwards, Evans, Guba, Lauritzen, Mirin, St-Cyr, Taylor, Worley (b11) 2012; 26
Byna, Chou, Rubel, Karimabadi, Daughter, Roytershteyn, Bethel, Howison, Hsu, Lin (b5) 2012
Wan, Huebl, Gu, Poeschel, Gainaru, Wang, Chen, Liang, Ganyushin, Munson (b31) 2021; 33
Chowdhury, Zhu, Di Natale, Moody, Gonsiorowski, Mohror, Yu (b37) 2020
Chang, Ku (b1) 2008; 15
Liu, Koziol, Butler, Fortner, Chaarawi, Tang, Byna, Lockwood, Cheema, Kallback-Rose (b2) 2018
Jones, Kennard, Zundel (b46) 2000; 26
J. Sun, J. Huang, M. Snir, Pinpointing Crash-Consistency Bugs in the HPC I/O Stack: A Cross-Layer Approach, in: Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis, 2021, pp. 1–13.
Marrinan, Eisenhauer, Wolf, Insley, Rizzi, Papka (b44) 2019; 31
Behzad, Byna, Snir (b19) 2019; 5
Chen, Choudhary, De Supinski, DeVries, Hawkes, Klasky, Liao, Ma, Mellor-Crummey, Podhorszki (b9) 2009; 2
Howison (b22) 2010
Quintero, Bolinches, Chaudhary, Davis, Duersch, Fachim, Socoliuc, Weiser (b16) 2017
Mishra, Mishra, Somani (b57) 2016
Li, Byna, Tang, Koziol, Ravi (b49) 2021
Lofstead, Mitchell, Chen (b60) 2020
Y. Cheng, M.S. Iqbal, A. Gupta, A.R. Butt, Cast: Tiering Storage for Data Analytics in the Cloud, in: Proceedings of the 24th International Symposium on High-Performance Parallel and Distributed Computing, 2015, pp. 45–56.
Cui, Olsen, Jordan, Lee, Zhou, Small, Roten, Ely, Panda, Chourasia (b10) 2010
J. Sun, C. Wang, J. Huang, M. Snir, Understanding and Finding Crash-Consistency Bugs in Parallel File Systems, in: 12th USENIX Workshop on Hot Topics in Storage and File Systems (HotStorage 20), 2020.
J. Lofstead, M. Polte, G. Gibson, S. Klasky, K. Schwan, R. Oldfield, M. Wolf, Q. Liu, Six Degrees of Scientific Data: Reading Patterns for Extreme Scale Science IO, in: Proceedings of the 20th International Symposium on High Performance Distributed Computing, 2011, pp. 49–60.
10.1016/j.future.2025.107830_b33
Jones (10.1016/j.future.2025.107830_b46) 2000; 26
Lee (10.1016/j.future.2025.107830_b48) 2001; 50
Cui (10.1016/j.future.2025.107830_b10) 2010
Quintero (10.1016/j.future.2025.107830_b16) 2017
10.1016/j.future.2025.107830_b39
Terrace (10.1016/j.future.2025.107830_b50) 2009
Shi (10.1016/j.future.2025.107830_b58) 2020
Alam (10.1016/j.future.2025.107830_b34) 2020
Chang (10.1016/j.future.2025.107830_b1) 2008; 15
Behzad (10.1016/j.future.2025.107830_b19) 2019; 5
Poeschel (10.1016/j.future.2025.107830_b45) 2022
Cheng (10.1016/j.future.2025.107830_b51) 2021; 17
Braam (10.1016/j.future.2025.107830_b13) 2019
Tomes (10.1016/j.future.2025.107830_b61) 2018; 67
10.1016/j.future.2025.107830_b23
10.1016/j.future.2025.107830_b67
Chowdhury (10.1016/j.future.2025.107830_b37) 2020
Marrinan (10.1016/j.future.2025.107830_b44) 2019; 31
Bowers (10.1016/j.future.2025.107830_b7) 2008; 15
10.1016/j.future.2025.107830_b25
10.1016/j.future.2025.107830_b24
Wu (10.1016/j.future.2025.107830_b32) 2018
10.1016/j.future.2025.107830_b68
Moon (10.1016/j.future.2025.107830_b66) 2021
Bhadkamkar (10.1016/j.future.2025.107830_b29) 2009; Vol. 9
Messer (10.1016/j.future.2025.107830_b40) 2007; 78
Byna (10.1016/j.future.2025.107830_b5) 2012
Kumar (10.1016/j.future.2025.107830_b43) 2014
Li (10.1016/j.future.2025.107830_b47) 2013; 34
Lofstead (10.1016/j.future.2025.107830_b26) 2016
Mishra (10.1016/j.future.2025.107830_b57) 2016
Huerta (10.1016/j.future.2025.107830_b64) 2021; 5
10.1016/j.future.2025.107830_b56
10.1016/j.future.2025.107830_b55
10.1016/j.future.2025.107830_b54
Tang (10.1016/j.future.2025.107830_b27) 2018
Tian (10.1016/j.future.2025.107830_b41) 2011
10.1016/j.future.2025.107830_b59
10.1016/j.future.2025.107830_b14
10.1016/j.future.2025.107830_b18
10.1016/j.future.2025.107830_b17
Lofstead (10.1016/j.future.2025.107830_b60) 2020
Devarajan (10.1016/j.future.2025.107830_b62) 2024
Chen (10.1016/j.future.2025.107830_b9) 2009; 2
Wan (10.1016/j.future.2025.107830_b31) 2021; 33
Li (10.1016/j.future.2025.107830_b49) 2021
Gadban (10.1016/j.future.2025.107830_b35) 2021; 11
Jeong (10.1016/j.future.2025.107830_b36) 2019
10.1016/j.future.2025.107830_b3
Habib (10.1016/j.future.2025.107830_b12) 2016; 42
10.1016/j.future.2025.107830_b4
Chang (10.1016/j.future.2025.107830_b8) 2008; 125
Dennis (10.1016/j.future.2025.107830_b11) 2012; 26
10.1016/j.future.2025.107830_b6
Brace (10.1016/j.future.2025.107830_b63) 2022
Salkhordeh (10.1016/j.future.2025.107830_b53) 2018; 29
Yin (10.1016/j.future.2025.107830_b30) 2013
Tang (10.1016/j.future.2025.107830_b28) 2017
Jacob (10.1016/j.future.2025.107830_b65) 2009; 4
ThinkParQ (10.1016/j.future.2025.107830_b15) 2021
Soumagne (10.1016/j.future.2025.107830_b20) 2021; 33
Howison (10.1016/j.future.2025.107830_b22) 2010
Liu (10.1016/j.future.2025.107830_b2) 2018
Cardone-Noott (10.1016/j.future.2025.107830_b42) 2018; 13
Dai (10.1016/j.future.2025.107830_b21) 2022; 33
Ross (10.1016/j.future.2025.107830_b38) 2018
10.1016/j.future.2025.107830_b52
References_xml – start-page: 113
  year: 2018
  end-page: 122
  ident: b27
  article-title: Toward scalable and asynchronous object-centric data management for HPC
  publication-title: 2018 18th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing
– volume: 2
  year: 2009
  ident: b9
  article-title: Terascale direct numerical simulations of turbulent combustion using S3D
  publication-title: Comput. Sci. Discov.
– volume: 5
  start-page: 1
  year: 2019
  end-page: 27
  ident: b19
  article-title: Optimizing I/O performance of HPC applications with autotuning
  publication-title: ACM Trans. Parallel Comput. (TOPC)
– volume: 33
  start-page: 878
  year: 2021
  end-page: 890
  ident: b31
  article-title: Improving I/O performance for exascale applications through online data layout reorganization
  publication-title: IEEE Trans. Parallel Distrib. Syst.
– start-page: 52
  year: 2020
  end-page: 61
  ident: b60
  article-title: Stitch it up: Using progressive data storage to scale science
  publication-title: 2020 IEEE International Parallel and Distributed Processing Symposium
– volume: 33
  start-page: 3850
  year: 2022
  end-page: 3869
  ident: b21
  article-title: The state of the art of metadata managements in large-scale distributed file systems—Scalability, performance and availability
  publication-title: IEEE Trans. Parallel Distrib. Syst.
– year: 2021
  ident: b15
  article-title: Beegfs
– start-page: 1
  year: 2009
  end-page: 16
  ident: b50
  article-title: Object storage on CRAQ: High-throughput chain replication for read-mostly workloads
  publication-title: USENIX Annual Technical Conference
– reference: K. Rashmi, M. Chowdhury, J. Kosaian, I. Stoica, K. Ramchandran, EC-Cache: Load-Balanced, Low-Latency Cluster Caching with Online Erasure Coding, in: 12th {USENIX} Symposium on Operating Systems Design and Implementation ({OSDI} 16), 2016, pp. 401–417.
– reference: M.M.A. Patwary, S. Byna, N.R. Satish, N. Sundaram, Z. Lukić, V. Roytershteyn, M.J. Anderson, Y. Yao, P. Dubey, BD-CATS: Big Data Clustering at Trillion Particle Scale, in: Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis, 2015, pp. 1–12.
– start-page: 2023
  year: 2020
  end-page: 2027
  ident: b58
  article-title: An intelligent data placement strategy for hierarchical storage systems
  publication-title: 2020 IEEE 6th International Conference on Computer and Communications
– volume: 33
  start-page: 903
  year: 2021
  end-page: 914
  ident: b20
  article-title: Accelerating HDF5 I/O for exascale using DAOS
  publication-title: IEEE Trans. Parallel Distrib. Syst.
– volume: 26
  start-page: 74
  year: 2012
  end-page: 89
  ident: b11
  article-title: CAM-SE: A scalable spectral element dynamical core for the community atmosphere model
  publication-title: Int. J. High Perform. Comput. Appl.
– start-page: 585
  year: 2016
  end-page: 596
  ident: b26
  article-title: DAOS and friends: A proposal for an exascale storage system
  publication-title: SC’16: Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis
– reference: . Accessed: 2024-03-09.
– reference: T. Rosado, J. Bernardino, An Overview of Openstack Architecture, in: Proceedings of the 18th International Database Engineering & Applications Symposium, 2014, pp. 366–367.
– volume: 34
  start-page: 1949
  year: 2013
  end-page: 1960
  ident: b47
  article-title: Continuous development of schemes for parallel computing of the electrostatics in biological systems: Implementation in DelPhi
  publication-title: J. Comput. Chem.
– volume: 78
  year: 2007
  ident: b40
  article-title: Petascale supernova simulation with CHIMERA
  publication-title: J. Phys.: Conf. Ser.
– volume: 5
  start-page: 1062
  year: 2021
  end-page: 1068
  ident: b64
  article-title: Accelerated, scalable and reproducible AI-driven gravitational wave detection
  publication-title: Nat. Astron.
– volume: 4
  start-page: 73
  year: 2009
  end-page: 87
  ident: b65
  article-title: Montage: a grid portal and software toolkit for science-grade astronomical image mosaicking
  publication-title: Int. J. Comput. Sci. Eng.
– reference: K. Oh, A. Chandra, J. Weissman, TripS: Automated Multi-tiered Data Placement in a Geo-distributed Cloud Environment, in: Proceedings of the 10th ACM International Systems and Storage Conference, 2017, pp. 1–11.
– start-page: 1
  year: 2012
  end-page: 12
  ident: b5
  article-title: Parallel I/O, analysis, and visualization of a trillion particle simulation
  publication-title: SC’12: Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis
– reference: T. Patel, S. Byna, G.K. Lockwood, N.J. Wright, P. Carns, R. Ross, D. Tiwari, Uncovering Access, Reuse, and Sharing Characteristics of I/O-Intensive Files on Large-Scale Production HPC Systems, in: 18th {USENIX} Conference on File and Storage Technologies ({FAST} 20), 2020, pp. 91–101.
– volume: 125
  year: 2008
  ident: b8
  article-title: Toward a first-principles integrated simulation of tokamak edge plasmas
  publication-title: J. Phys.: Conf. Ser.
– year: 2019
  ident: b13
  article-title: The lustre storage architecture
– volume: 67
  start-page: 1840
  year: 2018
  end-page: 1848
  ident: b61
  article-title: Towards adaptive parallel storage systems
  publication-title: IEEE Trans. Comput.
– reference: J.L. Bez, A.M. Karimi, A.K. Paul, B. Xie, S. Byna, P. Carns, S. Oral, F. Wang, J. Hanley, Access Patterns and Performance Behaviors of Multi-layer Supercomputer I/O Subsystems under Production Load, in: Proceedings of the 31st International Symposium on High-Performance Parallel and Distributed Computing, 2022, pp. 43–55.
– volume: 50
  start-page: 1352
  year: 2001
  end-page: 1361
  ident: b48
  article-title: LRFU: A spectrum of policies that subsumes the least recently used and least frequently used policies
  publication-title: IEEE Trans. Comput.
– start-page: 24
  year: 2018
  end-page: 34
  ident: b2
  article-title: Evaluation of HPC application I/O on object storage systems
  publication-title: 2018 IEEE/ACM 3rd International Workshop on Parallel Data Storage & Data Intensive Scalable Computing Systems (PDSW-DISCS)
– volume: 15
  year: 2008
  ident: b1
  article-title: Spontaneous rotation sources in a quiescent tokamak edge plasma
  publication-title: Phys. Plasmas
– start-page: 413
  year: 2014
  end-page: 423
  ident: b43
  article-title: Efficient I/O and storage of adaptive-resolution data
  publication-title: SC’14: Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis
– year: 2021
  ident: b66
  article-title: Multiscale machine-learned modeling infrastructure RAS
– reference: J. He, J. Bent, A. Torres, G. Grider, G. Gibson, C. Maltzahn, X.-H. Sun, I/O Acceleration with Pattern Detection, in: Proceedings of the 22nd International Symposium on High-Performance Parallel and Distributed Computing, 2013, pp. 25–36.
– reference: B. Xie, Y. Huang, J.S. Chase, J.Y. Choi, S. Klasky, J. Lofstead, S. Oral, Predicting Output Performance of a Petascale Supercomputer, in: Proceedings of the 26th International Symposium on High-Performance Parallel and Distributed Computing, 2017, pp. 181–192.
– start-page: 34
  year: 2020
  end-page: 39
  ident: b37
  article-title: Emulating I/O behavior in scientific workflows on high performance computing systems
  publication-title: 2020 IEEE/ACM Fifth International Parallel Data Systems Workshop
– start-page: 412
  year: 2016
  end-page: 417
  ident: b57
  article-title: Bulk I/O storage management for big data applications
  publication-title: 2016 IEEE 24th International Symposium on Modeling, Analysis and Simulation of Computer and Telecommunication Systems
– reference: T. Patel, S. Byna, G.K. Lockwood, D. Tiwari, Revisiting I/O Behavior in Large-Scale Storage Systems: The Expected and the Unexpected, in: Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis, 2019, pp. 1–13.
– start-page: 39
  year: 2020
  end-page: 44
  ident: b34
  article-title: Archival data repository services to enable HPC and cloud workflows in a federated research e-infrastructure
  publication-title: 2020 IEEE/ACM International Workshop on Interoperability of Supercomputing and Cloud Technologies (SuperCompCloud)
– year: 2021
  ident: b49
  article-title: H5bench: a benchmark suite for parallel HDF5 (H5bench) v0. 1
– volume: 15
  year: 2008
  ident: b7
  article-title: Ultrahigh performance three-dimensional electromagnetic relativistic kinetic plasma simulation
  publication-title: Phys. Plasmas
– volume: 11
  start-page: 8540
  year: 2021
  ident: b35
  article-title: Analyzing the performance of the S3 object storage API for HPC workloads
  publication-title: Appl. Sci.
– start-page: 446
  year: 2019
  end-page: 451
  ident: b36
  article-title: Optimizing the ceph distributed file system for high performance computing
  publication-title: 2019 27th Euromicro International Conference on Parallel, Distributed and Network-Based Processing
– volume: Vol. 9
  start-page: 183
  year: 2009
  end-page: 196
  ident: b29
  article-title: BORG: Block-reORGanization for self-optimizing storage systems
  publication-title: FAST
– start-page: 345
  year: 2013
  end-page: 356
  ident: b30
  article-title: Pattern-direct and layout-aware replication scheme for parallel I/O systems
  publication-title: 2013 IEEE 27th International Symposium on Parallel and Distributed Processing
– start-page: 806
  year: 2022
  end-page: 816
  ident: b63
  article-title: Coupling streaming AI and HPC ensembles to achieve 100–1000
  publication-title: 2022 IEEE International Parallel and Distributed Processing Symposium
– year: 2018
  ident: b38
  article-title: Storage systems and input/output: Organizing, storing, and accessing data for scientific discovery. Report for the DOE ASCR workshop on storage systems and I/O.[full workshop report]
– volume: 13
  year: 2018
  ident: b42
  article-title: Strategies of data layout and cache writing for input-output optimization in high performance scientific computing: Applications to the forward electrocardiographic problem
  publication-title: Plos One
– start-page: 93
  year: 2011
  end-page: 102
  ident: b41
  article-title: EDO: Improving read performance for scientific applications through elastic data organization
  publication-title: 2011 IEEE International Conference on Cluster Computing
– volume: 29
  start-page: 1605
  year: 2018
  end-page: 1620
  ident: b53
  article-title: ReCA: An efficient reconfigurable cache architecture for storage systems with online workload characterization
  publication-title: IEEE Trans. Parallel Distrib. Syst.
– reference: J. Sun, C. Wang, J. Huang, M. Snir, Understanding and Finding Crash-Consistency Bugs in Parallel File Systems, in: 12th USENIX Workshop on Hot Topics in Storage and File Systems (HotStorage 20), 2020.
– year: 2010
  ident: b22
  article-title: Tuning HDF5 for lustre file systems
– reference: Amazon Web Services, Cloud Object Storage - Amazon S3.
– volume: 26
  start-page: 831
  year: 2000
  end-page: 837
  ident: b46
  article-title: Fast algorithm for generating sorted contour strings
  publication-title: Comput. Geosci.
– year: 2018
  ident: b32
  article-title: VPIC IO utilities
– reference: M.R. Palankar, A. Iamnitchi, M. Ripeanu, S. Garfinkel, Amazon S3 for Science Grids: a Viable Solution?, in: Proceedings of the 2008 International Workshop on Data-Aware Distributed Computing, 2008, pp. 55–64.
– start-page: 1
  year: 2010
  end-page: 20
  ident: b10
  article-title: Scalable earthquake simulation on petascale supercomputers
  publication-title: SC’10: Proceedings of the 2010 ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis
– reference: R. Gracia-Tinedo, J. Sampé, E. Zamora, M. Sánchez-Artigas, P. García-López, Y. Moatti, E. Rom, Crystal: Software-Defined Storage for Multi-Tenant Object Stores, in: Proceedings of the 15th Usenix Conference on File and Storage Technologies, 2017, pp. 243–256.
– start-page: 359
  year: 2017
  end-page: 369
  ident: b28
  article-title: Someta: Scalable object-centric metadata management for high performance computing
  publication-title: 2017 IEEE International Conference on Cluster Computing
– start-page: 81
  year: 2024
  end-page: 92
  ident: b62
  article-title: TailorFS: An adaptive file system to support dynamic I/O requirements of HPC workloads
  publication-title: 2024 IEEE 36th International Symposium on Computer Architecture and High Performance Computing (SBAC-PAD)
– reference: S.A. Weil, A.W. Leung, S.A. Brandt, C. Maltzahn, RADOS: A Scalable, Reliable Storage Service for Petabyte-scale Storage Clusters, in: Proceedings of the 2nd International Workshop on Petascale Data Storage: Held in Conjunction with Supercomputing’07, 2007, pp. 35–44.
– reference: Y. Cheng, M.S. Iqbal, A. Gupta, A.R. Butt, Cast: Tiering Storage for Data Analytics in the Cloud, in: Proceedings of the 24th International Symposium on High-Performance Parallel and Distributed Computing, 2015, pp. 45–56.
– start-page: 99
  year: 2022
  end-page: 118
  ident: b45
  article-title: Transitioning from file-based HPC workflows to streaming data pipelines with openPMD and ADIOS2
  publication-title: Driving Scientific and Engineering Discoveries Through the Integration of Experiment, Big Data, and Modeling and Simulation: 21st Smoky Mountains Computational Sciences and Engineering, SMC 2021, Virtual Event, October 18-20, 2021, Revised Selected Papers
– reference: J. Sun, J. Huang, M. Snir, Pinpointing Crash-Consistency Bugs in the HPC I/O Stack: A Cross-Layer Approach, in: Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis, 2021, pp. 1–13.
– year: 2017
  ident: b16
  article-title: IBM Spectrum Scale (formerly GPFS)
– volume: 17
  start-page: 1
  year: 2021
  end-page: 22
  ident: b51
  article-title: NVMM-oriented hierarchical persistent client caching for lustre
  publication-title: ACM Trans. Storage (TOS)
– reference: J. Lofstead, M. Polte, G. Gibson, S. Klasky, K. Schwan, R. Oldfield, M. Wolf, Q. Liu, Six Degrees of Scientific Data: Reading Patterns for Extreme Scale Science IO, in: Proceedings of the 20th International Symposium on High Performance Distributed Computing, 2011, pp. 49–60.
– volume: 42
  start-page: 49
  year: 2016
  end-page: 65
  ident: b12
  article-title: HACC: Simulating sky surveys on state-of-the-art supercomputing architectures
  publication-title: New Astron.
– volume: 31
  start-page: 163
  year: 2019
  end-page: 171
  ident: b44
  article-title: Parallel streaming between heterogeneous HPC resources for real-time analysis
  publication-title: J. Comput. Sci.
– reference: S.A. Weil, S.A. Brandt, E.L. Miller, D.D. Long, C. Maltzahn, Ceph: A Scalable, High-Performance Distributed File System, in: Proceedings of the 7th Symposium on Operating Systems Design and Implementation, 2006, pp. 307–320.
– volume: 42
  start-page: 49
  year: 2016
  ident: 10.1016/j.future.2025.107830_b12
  article-title: HACC: Simulating sky surveys on state-of-the-art supercomputing architectures
  publication-title: New Astron.
  doi: 10.1016/j.newast.2015.06.003
– volume: 13
  issue: 8
  year: 2018
  ident: 10.1016/j.future.2025.107830_b42
  article-title: Strategies of data layout and cache writing for input-output optimization in high performance scientific computing: Applications to the forward electrocardiographic problem
  publication-title: Plos One
  doi: 10.1371/journal.pone.0202410
– ident: 10.1016/j.future.2025.107830_b18
– ident: 10.1016/j.future.2025.107830_b67
  doi: 10.1145/3502181.3531461
– volume: Vol. 9
  start-page: 183
  year: 2009
  ident: 10.1016/j.future.2025.107830_b29
  article-title: BORG: Block-reORGanization for self-optimizing storage systems
– volume: 78
  issue: 1
  year: 2007
  ident: 10.1016/j.future.2025.107830_b40
  article-title: Petascale supernova simulation with CHIMERA
  publication-title: J. Phys.: Conf. Ser.
– volume: 67
  start-page: 1840
  issue: 12
  year: 2018
  ident: 10.1016/j.future.2025.107830_b61
  article-title: Towards adaptive parallel storage systems
  publication-title: IEEE Trans. Comput.
  doi: 10.1109/TC.2018.2836426
– year: 2019
  ident: 10.1016/j.future.2025.107830_b13
– ident: 10.1016/j.future.2025.107830_b68
  doi: 10.1145/3295500.3356183
– ident: 10.1016/j.future.2025.107830_b14
– start-page: 1
  year: 2012
  ident: 10.1016/j.future.2025.107830_b5
  article-title: Parallel I/O, analysis, and visualization of a trillion particle simulation
– ident: 10.1016/j.future.2025.107830_b54
  doi: 10.1145/3078468.3078485
– volume: 29
  start-page: 1605
  issue: 7
  year: 2018
  ident: 10.1016/j.future.2025.107830_b53
  article-title: ReCA: An efficient reconfigurable cache architecture for storage systems with online workload characterization
  publication-title: IEEE Trans. Parallel Distrib. Syst.
  doi: 10.1109/TPDS.2018.2796100
– start-page: 446
  year: 2019
  ident: 10.1016/j.future.2025.107830_b36
  article-title: Optimizing the ceph distributed file system for high performance computing
– ident: 10.1016/j.future.2025.107830_b39
  doi: 10.1145/1996130.1996139
– ident: 10.1016/j.future.2025.107830_b55
  doi: 10.1145/2749246.2749252
– start-page: 413
  year: 2014
  ident: 10.1016/j.future.2025.107830_b43
  article-title: Efficient I/O and storage of adaptive-resolution data
– start-page: 39
  year: 2020
  ident: 10.1016/j.future.2025.107830_b34
  article-title: Archival data repository services to enable HPC and cloud workflows in a federated research e-infrastructure
– start-page: 93
  year: 2011
  ident: 10.1016/j.future.2025.107830_b41
  article-title: EDO: Improving read performance for scientific applications through elastic data organization
– year: 2021
  ident: 10.1016/j.future.2025.107830_b66
– volume: 15
  issue: 6
  year: 2008
  ident: 10.1016/j.future.2025.107830_b1
  article-title: Spontaneous rotation sources in a quiescent tokamak edge plasma
  publication-title: Phys. Plasmas
  doi: 10.1063/1.2937116
– year: 2018
  ident: 10.1016/j.future.2025.107830_b38
– start-page: 585
  year: 2016
  ident: 10.1016/j.future.2025.107830_b26
  article-title: DAOS and friends: A proposal for an exascale storage system
– start-page: 1
  year: 2009
  ident: 10.1016/j.future.2025.107830_b50
  article-title: Object storage on CRAQ: High-throughput chain replication for read-mostly workloads
– year: 2021
  ident: 10.1016/j.future.2025.107830_b15
– start-page: 359
  year: 2017
  ident: 10.1016/j.future.2025.107830_b28
  article-title: Someta: Scalable object-centric metadata management for high performance computing
– volume: 5
  start-page: 1
  issue: 4
  year: 2019
  ident: 10.1016/j.future.2025.107830_b19
  article-title: Optimizing I/O performance of HPC applications with autotuning
  publication-title: ACM Trans. Parallel Comput. (TOPC)
– start-page: 2023
  year: 2020
  ident: 10.1016/j.future.2025.107830_b58
  article-title: An intelligent data placement strategy for hierarchical storage systems
– ident: 10.1016/j.future.2025.107830_b4
  doi: 10.1145/3078597.3078614
– ident: 10.1016/j.future.2025.107830_b33
  doi: 10.1145/1383519.1383526
– volume: 33
  start-page: 903
  issue: 4
  year: 2021
  ident: 10.1016/j.future.2025.107830_b20
  article-title: Accelerating HDF5 I/O for exascale using DAOS
  publication-title: IEEE Trans. Parallel Distrib. Syst.
  doi: 10.1109/TPDS.2021.3097884
– year: 2010
  ident: 10.1016/j.future.2025.107830_b22
– ident: 10.1016/j.future.2025.107830_b52
– ident: 10.1016/j.future.2025.107830_b56
– year: 2021
  ident: 10.1016/j.future.2025.107830_b49
– volume: 17
  start-page: 1
  issue: 1
  year: 2021
  ident: 10.1016/j.future.2025.107830_b51
  article-title: NVMM-oriented hierarchical persistent client caching for lustre
  publication-title: ACM Trans. Storage (TOS)
  doi: 10.1145/3404190
– volume: 26
  start-page: 74
  issue: 1
  year: 2012
  ident: 10.1016/j.future.2025.107830_b11
  article-title: CAM-SE: A scalable spectral element dynamical core for the community atmosphere model
  publication-title: Int. J. High Perform. Comput. Appl.
  doi: 10.1177/1094342011428142
– start-page: 24
  year: 2018
  ident: 10.1016/j.future.2025.107830_b2
  article-title: Evaluation of HPC application I/O on object storage systems
– start-page: 99
  year: 2022
  ident: 10.1016/j.future.2025.107830_b45
  article-title: Transitioning from file-based HPC workflows to streaming data pipelines with openPMD and ADIOS2
– volume: 5
  start-page: 1062
  issue: 10
  year: 2021
  ident: 10.1016/j.future.2025.107830_b64
  article-title: Accelerated, scalable and reproducible AI-driven gravitational wave detection
  publication-title: Nat. Astron.
  doi: 10.1038/s41550-021-01405-0
– year: 2018
  ident: 10.1016/j.future.2025.107830_b32
– start-page: 806
  year: 2022
  ident: 10.1016/j.future.2025.107830_b63
  article-title: Coupling streaming AI and HPC ensembles to achieve 100–1000× faster biomolecular simulations
– ident: 10.1016/j.future.2025.107830_b25
  doi: 10.1145/1374596.1374606
– volume: 33
  start-page: 3850
  issue: 12
  year: 2022
  ident: 10.1016/j.future.2025.107830_b21
  article-title: The state of the art of metadata managements in large-scale distributed file systems—Scalability, performance and availability
  publication-title: IEEE Trans. Parallel Distrib. Syst.
  doi: 10.1109/TPDS.2022.3170574
– volume: 15
  issue: 5
  year: 2008
  ident: 10.1016/j.future.2025.107830_b7
  article-title: Ultrahigh performance three-dimensional electromagnetic relativistic kinetic plasma simulation
  publication-title: Phys. Plasmas
  doi: 10.1063/1.2840133
– year: 2017
  ident: 10.1016/j.future.2025.107830_b16
– volume: 31
  start-page: 163
  year: 2019
  ident: 10.1016/j.future.2025.107830_b44
  article-title: Parallel streaming between heterogeneous HPC resources for real-time analysis
  publication-title: J. Comput. Sci.
  doi: 10.1016/j.jocs.2019.01.003
– ident: 10.1016/j.future.2025.107830_b59
  doi: 10.1145/2493123.2462909
– ident: 10.1016/j.future.2025.107830_b6
– start-page: 1
  year: 2010
  ident: 10.1016/j.future.2025.107830_b10
  article-title: Scalable earthquake simulation on petascale supercomputers
– start-page: 34
  year: 2020
  ident: 10.1016/j.future.2025.107830_b37
  article-title: Emulating I/O behavior in scientific workflows on high performance computing systems
– start-page: 52
  year: 2020
  ident: 10.1016/j.future.2025.107830_b60
  article-title: Stitch it up: Using progressive data storage to scale science
– start-page: 81
  year: 2024
  ident: 10.1016/j.future.2025.107830_b62
  article-title: TailorFS: An adaptive file system to support dynamic I/O requirements of HPC workloads
– ident: 10.1016/j.future.2025.107830_b3
  doi: 10.1145/2807591.2807616
– start-page: 113
  year: 2018
  ident: 10.1016/j.future.2025.107830_b27
  article-title: Toward scalable and asynchronous object-centric data management for HPC
– volume: 50
  start-page: 1352
  issue: 12
  year: 2001
  ident: 10.1016/j.future.2025.107830_b48
  article-title: LRFU: A spectrum of policies that subsumes the least recently used and least frequently used policies
  publication-title: IEEE Trans. Comput.
  doi: 10.1109/TC.2001.970573
– volume: 4
  start-page: 73
  issue: 2
  year: 2009
  ident: 10.1016/j.future.2025.107830_b65
  article-title: Montage: a grid portal and software toolkit for science-grade astronomical image mosaicking
  publication-title: Int. J. Comput. Sci. Eng.
– volume: 26
  start-page: 831
  issue: 7
  year: 2000
  ident: 10.1016/j.future.2025.107830_b46
  article-title: Fast algorithm for generating sorted contour strings
  publication-title: Comput. Geosci.
  doi: 10.1016/S0098-3004(00)00009-1
– start-page: 345
  year: 2013
  ident: 10.1016/j.future.2025.107830_b30
  article-title: Pattern-direct and layout-aware replication scheme for parallel I/O systems
– volume: 11
  start-page: 8540
  issue: 18
  year: 2021
  ident: 10.1016/j.future.2025.107830_b35
  article-title: Analyzing the performance of the S3 object storage API for HPC workloads
  publication-title: Appl. Sci.
  doi: 10.3390/app11188540
– volume: 33
  start-page: 878
  issue: 4
  year: 2021
  ident: 10.1016/j.future.2025.107830_b31
  article-title: Improving I/O performance for exascale applications through online data layout reorganization
  publication-title: IEEE Trans. Parallel Distrib. Syst.
  doi: 10.1109/TPDS.2021.3100784
– ident: 10.1016/j.future.2025.107830_b17
  doi: 10.1145/3458817.3476144
– volume: 2
  issue: 1
  year: 2009
  ident: 10.1016/j.future.2025.107830_b9
  article-title: Terascale direct numerical simulations of turbulent combustion using S3D
  publication-title: Comput. Sci. Discov.
  doi: 10.1088/1749-4699/2/1/015001
– volume: 34
  start-page: 1949
  issue: 22
  year: 2013
  ident: 10.1016/j.future.2025.107830_b47
  article-title: Continuous development of schemes for parallel computing of the electrostatics in biological systems: Implementation in DelPhi
  publication-title: J. Comput. Chem.
  doi: 10.1002/jcc.23340
– start-page: 412
  year: 2016
  ident: 10.1016/j.future.2025.107830_b57
  article-title: Bulk I/O storage management for big data applications
– volume: 125
  year: 2008
  ident: 10.1016/j.future.2025.107830_b8
  article-title: Toward a first-principles integrated simulation of tokamak edge plasmas
  publication-title: J. Phys.: Conf. Ser.
– ident: 10.1016/j.future.2025.107830_b23
– ident: 10.1016/j.future.2025.107830_b24
  doi: 10.1145/2628194.2628195
SSID ssj0001731
Score 2.4409368
Snippet This article proposes an object layout regenerator called Regen which regenerates and removes the object layout dynamically to improve the read performance of...
SourceID osti
crossref
elsevier
SourceType Open Access Repository
Index Database
Publisher
StartPage 107830
SubjectTerms Distributed file system
High-performance computing
Object storage
Pattern detection
Title Regen: An object layout regenerator on large-scale production HPC systems
URI https://dx.doi.org/10.1016/j.future.2025.107830
https://www.osti.gov/biblio/2559015
Volume 171
WOSCitedRecordID wos001476000200001&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
journalDatabaseRights – providerCode: PRVESC
  databaseName: Elsevier SD Freedom Collection Journals 2021
  issn: 0167-739X
  databaseCode: AIEXJ
  dateStart: 19950201
  customDbUrl:
  isFulltext: true
  dateEnd: 99991231
  titleUrlDefault: https://www.sciencedirect.com
  omitProxy: false
  ssIdentifier: ssj0001731
  providerName: Elsevier
link http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwtV1Lb9swDBaCdIdd9h7WdRt02K1w4UixZe0WZC3SbSiKtRtyM2w9ggapXORRtP9-pCXH7t477GIYkmUb_GiaoqiPhLxNNHjFAxNHrOBFNCyQ8rY0PJLGFGWqkiSrqZS-fhInJ9l0Kk97vbNmL8z1QjiX3dzIq_8KNbQB2Lh19h_g3t4UGuAcQIcjwA7HvwL-s4GuEO-rSgyz7C-KW0w_XmKPqdfVcY1ggUng0QpAws1SNfEr6sLkdBz4nVddz_WoJh_ZD3fAC1UoCHHn6np9yduP91jG6OPtpl3mv_R5QG42q77PBDor3Gx-0WYFhzD2pNrMN64bmmDJNsktxMvCz70bvgSzLHhdPLe1v74GS1C08U_tug8xzA880coBPgwaRRbWdO4yZuMsKUbigR0mEpn1yc7o-HD6YfuDHohQpjK8TLOjsk77-_EJv_JY-hUY4Y4zcv6IPAizCDry6D8mPeOekIdNhQ4aDPZTclwrwzs6ctSrAvWqQDuqQCtHO6pAW1WgoAo0gPuMfDk6PB9PolA8I1Kcx-tIc6YtS8vUMiaLVCERIDNZCtNzYwupwY5nViVWgottecwsTFzhRMuBVDqRij8nfVc584JQrpVkpbacGz1UsZCitPGwLLmBD1mwbJdEjYDyK8-RkjfJg_PcCzRHgeZeoLtENFLMg5_n_bcc4P7DyD0UOo5CimOFuWAwLAD-8re9e-R-q6OvSH-93JjX5J66Xl-slm-CjnwD0UF8iQ
linkProvider Elsevier
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Regen%3A+An+object+layout+regenerator+on+large-scale+production+HPC+systems&rft.jtitle=Future+generation+computer+systems&rft.au=Sung%2C+Dong+Kyu&rft.au=Kim%2C+Sunggon&rft.au=Lee%2C+Sangjin&rft.au=Tang%2C+Houjun&rft.date=2025-10-01&rft.pub=Elsevier&rft.issn=0167-739X&rft.volume=171&rft.issue=C&rft_id=info:doi/10.1016%2Fj.future.2025.107830&rft.externalDocID=2559015
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=0167-739X&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=0167-739X&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=0167-739X&client=summon