Leveraging National Science Data Fabric Services to Train Data Scientists

We document an interactive half-day tutorial in which participants explore the advanced applications of National Science Data Fabric (NSDF) services and strategies for comprehensive scientific data analysis. Targeting researchers, students, developers, and scientists, the tutorial provides valuable...

Celý popis

Uloženo v:
Podrobná bibliografie
Vydáno v:SC24-W: Workshops of the International Conference for High Performance Computing, Networking, Storage and Analysis s. 355 - 362
Hlavní autoři: Taufer, Michela, Martinez, Heberth, Panta, Aashish, Olaya, Paula, Marquez, Jack, Gooch, Amy, Scorzelli, Giorgio, Pascucci, Valerio
Médium: Konferenční příspěvek
Jazyk:angličtina
Vydáno: IEEE 17.11.2024
Témata:
On-line přístup:Získat plný text
Tagy: Přidat tag
Žádné tagy, Buďte první, kdo vytvoří štítek k tomuto záznamu!
Abstract We document an interactive half-day tutorial in which participants explore the advanced applications of National Science Data Fabric (NSDF) services and strategies for comprehensive scientific data analysis. Targeting researchers, students, developers, and scientists, the tutorial provides valuable insights into managing and analyzing large datasets, particularly those exceeding 100TB. Participants gain hands-on experience by constructing modular workflows, leveraging public and private data storage and streaming solutions, and deploying sophisticated visualization and analysis dashboards. The tutorial emphasizes NSDF's role in supporting visualization conference themes by providing scalable visualization and visual analytics solutions. Our tutorial includes an overview of NSDF's capabilities, addressing common data analysis challenges, and intermediate hands-on exercises using NSDF services for Earth science data. Advanced applications cover handling and visualizing massive datasets requiring high-resolution data management. By the end of the session, attendees have a deeper understanding of integrating NSDF services into their research workflows, enhancing data accessibility, sharing, and collaborative scientific discovery. Our tutorial aims to advance knowledge in data-intensive computing and empower participants to harness the full potential of NSDF in their respective fields.
AbstractList We document an interactive half-day tutorial in which participants explore the advanced applications of National Science Data Fabric (NSDF) services and strategies for comprehensive scientific data analysis. Targeting researchers, students, developers, and scientists, the tutorial provides valuable insights into managing and analyzing large datasets, particularly those exceeding 100TB. Participants gain hands-on experience by constructing modular workflows, leveraging public and private data storage and streaming solutions, and deploying sophisticated visualization and analysis dashboards. The tutorial emphasizes NSDF's role in supporting visualization conference themes by providing scalable visualization and visual analytics solutions. Our tutorial includes an overview of NSDF's capabilities, addressing common data analysis challenges, and intermediate hands-on exercises using NSDF services for Earth science data. Advanced applications cover handling and visualizing massive datasets requiring high-resolution data management. By the end of the session, attendees have a deeper understanding of integrating NSDF services into their research workflows, enhancing data accessibility, sharing, and collaborative scientific discovery. Our tutorial aims to advance knowledge in data-intensive computing and empower participants to harness the full potential of NSDF in their respective fields.
Author Taufer, Michela
Martinez, Heberth
Gooch, Amy
Marquez, Jack
Pascucci, Valerio
Olaya, Paula
Scorzelli, Giorgio
Panta, Aashish
Author_xml – sequence: 1
  givenname: Michela
  surname: Taufer
  fullname: Taufer, Michela
  email: taufer@acm.org
  organization: University of Tennessee,Knoxville,USA
– sequence: 2
  givenname: Heberth
  surname: Martinez
  fullname: Martinez, Heberth
  email: hmarti46@utk.edu
  organization: University of Tennessee,Knoxville,USA
– sequence: 3
  givenname: Aashish
  surname: Panta
  fullname: Panta, Aashish
  email: aashish.panta@utah.edu
  organization: University of Utah Salt,Lake City,USA
– sequence: 4
  givenname: Paula
  surname: Olaya
  fullname: Olaya, Paula
  email: polaya@vols.utk.edu
  organization: University of Tennessee,Knoxville,USA
– sequence: 5
  givenname: Jack
  surname: Marquez
  fullname: Marquez, Jack
  email: jmarque4@utk.edu
  organization: University of Tennessee,Knoxville,USA
– sequence: 6
  givenname: Amy
  surname: Gooch
  fullname: Gooch, Amy
  email: amy.a.gooch@gmail.com
  organization: University of Utah Salt,Lake City,USA
– sequence: 7
  givenname: Giorgio
  surname: Scorzelli
  fullname: Scorzelli, Giorgio
  email: scrgiorgio@gmail.com
  organization: University of Utah Salt,Lake City,USA
– sequence: 8
  givenname: Valerio
  surname: Pascucci
  fullname: Pascucci, Valerio
  email: pascucci.valerio@gmail.com
  organization: University of Utah Salt,Lake City,USA
BookMark eNotjMtKw0AUQEdQUGu-QBfzA4l3HjfJLCVaLQRdpOKyzE1uykBNZDIU_HvFujqLczjX4nyaJxbiVkGhFLj7rvkojbZQaNC2AAA0ZyJzlasNgkFEay5FtiyBoASsLdR4JTYtHzn6fZj28tWnME_-ILs-8NSzfPTJy7WnGHrZcTyGnheZZrmNPkwn-5emsKTlRlyM_rBw9s-VeF8_bZuXvH173jQPbe41linnwVdjNdCoFRkwitlZNwAqtPVIjmxJ9GtsNQ6jwQGcgt7rmgwBVZrQrMTd6RuYefcVw6eP3zsFtYZKo_kB98RNwg
CODEN IEEPAD
ContentType Conference Proceeding
DBID 6IE
6IL
CBEJK
RIE
RIL
DOI 10.1109/SCW63240.2024.00053
DatabaseName IEEE Electronic Library (IEL) Conference Proceedings
IEEE Proceedings Order Plan All Online (POP All Online) 1998-present by volume
IEEE Xplore All Conference Proceedings
IEEE Electronic Library (IEL)
IEEE Proceedings Order Plans (POP All) 1998-Present
DatabaseTitleList
Database_xml – sequence: 1
  dbid: RIE
  name: IEEE Electronic Library (IEL)
  url: https://ieeexplore.ieee.org/
  sourceTypes: Publisher
DeliveryMethod fulltext_linktorsrc
EISBN 9798350355543
EndPage 362
ExternalDocumentID 10820725
Genre orig-research
GrantInformation_xml – fundername: National Science Foundation
  funderid: 10.13039/100000001
GroupedDBID 6IE
6IL
ACM
ALMA_UNASSIGNED_HOLDINGS
CBEJK
RIE
RIL
ID FETCH-LOGICAL-a256t-eda7f7dbf21b3031ee949d051548fb9b46bb1b347fdf35d0910ca28b3b0b72b53
IEDL.DBID RIE
ISICitedReferencesCount 0
ISICitedReferencesURI http://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=Summon&SrcAuth=ProQuest&DestLinkType=CitingArticles&DestApp=WOS_CPL&KeyUT=001451792300040&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
IngestDate Wed Aug 27 01:59:34 EDT 2025
IsPeerReviewed false
IsScholarly false
Language English
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-a256t-eda7f7dbf21b3031ee949d051548fb9b46bb1b347fdf35d0910ca28b3b0b72b53
PageCount 8
ParticipantIDs ieee_primary_10820725
PublicationCentury 2000
PublicationDate 2024-Nov.-17
PublicationDateYYYYMMDD 2024-11-17
PublicationDate_xml – month: 11
  year: 2024
  text: 2024-Nov.-17
  day: 17
PublicationDecade 2020
PublicationTitle SC24-W: Workshops of the International Conference for High Performance Computing, Networking, Storage and Analysis
PublicationTitleAbbrev SC-W
PublicationYear 2024
Publisher IEEE
Publisher_xml – name: IEEE
SSID ssib060584085
Score 1.8897899
Snippet We document an interactive half-day tutorial in which participants explore the advanced applications of National Science Data Fabric (NSDF) services and...
SourceID ieee
SourceType Publisher
StartPage 355
SubjectTerms Data analysis
Data visualization
Fabrics
Geoscience
High performance computing
Memory
Pain
Surveys
Tutorials
Visual analytics
workforce development
Title Leveraging National Science Data Fabric Services to Train Data Scientists
URI https://ieeexplore.ieee.org/document/10820725
WOSCitedRecordID wos001451792300040&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV09T8MwED3RioEJEEV8ywNrwHE-bM-FCqSqqkQR3Sq7Pk-oRSXl9_fOTYGFgS2Ko0R5tu_Ocd57ALcWZdBUN2ROWZOVVPNmTlqdOdQBKTpG5ZKI61CPRmY6teOWrJ64MIiYfj7DOz5Me_lhOV_zpzKa4ZSvtKo60NG63pK1doOHt_dYratVFsqlvX_pv7EYuaRVoGKNbMkOyL88VFIKGRz-8-FH0Psh44nxd5o5hj1cnMDzEGkMJoch0Upbv4t2nooH1zgxcJ5CnNjFAtEsxYTtILat6dKGuvizB6-Dx0n_KWtdEQjOqm4yDE5HHXxUuaf8kyPa0ga2ailN9NaXtffUUuoYYlEFrgfmThlfeOm18lVxCt3FcoFnICItT3M0rJlmSltIg6a2fC9CMqfC8Bx6jMPsYyt8MdtBcPHH-Us4YKiZqpfrK-g2qzVew_78i95ndZO6awNhZZSC
linkProvider IEEE
linkToHtml http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV09T8MwELX4kmACRBHfeGANOI5T23OhakWoKlFEt8quzxNqUUn5_dy5KbAwsEVxlCjP9t05znuPsRsLImisGzInrckU1ryZE1ZnDnQAjI5RuiTiWunBwIzHdtiQ1RMXBgDSz2dwS4dpLz_Mp0v6VIYzHPOVluUm2y6VkmJF11oPH9rgI72uRlsoF_buufNKcuQC14GSVLIFeSD_clFJSaS7_8_HH7DWDx2PD78TzSHbgNkR61eAozB5DPFG3PqNNzOV37va8a7zGOT4Ohrwes5HZAixak2X1tjJHy320n0YdXpZ44uAgJbtOoPgdNTBR5l7zEA5gFU2kFmLMtFbr9reY4vSMcSiDFQRTJ00vvDCa-nL4phtzeYzOGE84gI1B0OqaUbZQhgwbUv3QiRzLA1PWYtwmLyvpC8mawjO_jh_zXZ7o6dqUvUHj-dsj2An4l6uL9hWvVjCJduZfuK7La5S130BTiKXyQ
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=SC24-W%3A+Workshops+of+the+International+Conference+for+High+Performance+Computing%2C+Networking%2C+Storage+and+Analysis&rft.atitle=Leveraging+National+Science+Data+Fabric+Services+to+Train+Data+Scientists&rft.au=Taufer%2C+Michela&rft.au=Martinez%2C+Heberth&rft.au=Panta%2C+Aashish&rft.au=Olaya%2C+Paula&rft.date=2024-11-17&rft.pub=IEEE&rft.spage=355&rft.epage=362&rft_id=info:doi/10.1109%2FSCW63240.2024.00053&rft.externalDocID=10820725