Leveraging National Science Data Fabric Services to Train Data Scientists
We document an interactive half-day tutorial in which participants explore the advanced applications of National Science Data Fabric (NSDF) services and strategies for comprehensive scientific data analysis. Targeting researchers, students, developers, and scientists, the tutorial provides valuable...
Saved in:
| Published in: | SC24-W: Workshops of the International Conference for High Performance Computing, Networking, Storage and Analysis pp. 355 - 362 |
|---|---|
| Main Authors: | , , , , , , , |
| Format: | Conference Proceeding |
| Language: | English |
| Published: |
IEEE
17.11.2024
|
| Subjects: | |
| Online Access: | Get full text |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
| Abstract | We document an interactive half-day tutorial in which participants explore the advanced applications of National Science Data Fabric (NSDF) services and strategies for comprehensive scientific data analysis. Targeting researchers, students, developers, and scientists, the tutorial provides valuable insights into managing and analyzing large datasets, particularly those exceeding 100TB. Participants gain hands-on experience by constructing modular workflows, leveraging public and private data storage and streaming solutions, and deploying sophisticated visualization and analysis dashboards. The tutorial emphasizes NSDF's role in supporting visualization conference themes by providing scalable visualization and visual analytics solutions. Our tutorial includes an overview of NSDF's capabilities, addressing common data analysis challenges, and intermediate hands-on exercises using NSDF services for Earth science data. Advanced applications cover handling and visualizing massive datasets requiring high-resolution data management. By the end of the session, attendees have a deeper understanding of integrating NSDF services into their research workflows, enhancing data accessibility, sharing, and collaborative scientific discovery. Our tutorial aims to advance knowledge in data-intensive computing and empower participants to harness the full potential of NSDF in their respective fields. |
|---|---|
| AbstractList | We document an interactive half-day tutorial in which participants explore the advanced applications of National Science Data Fabric (NSDF) services and strategies for comprehensive scientific data analysis. Targeting researchers, students, developers, and scientists, the tutorial provides valuable insights into managing and analyzing large datasets, particularly those exceeding 100TB. Participants gain hands-on experience by constructing modular workflows, leveraging public and private data storage and streaming solutions, and deploying sophisticated visualization and analysis dashboards. The tutorial emphasizes NSDF's role in supporting visualization conference themes by providing scalable visualization and visual analytics solutions. Our tutorial includes an overview of NSDF's capabilities, addressing common data analysis challenges, and intermediate hands-on exercises using NSDF services for Earth science data. Advanced applications cover handling and visualizing massive datasets requiring high-resolution data management. By the end of the session, attendees have a deeper understanding of integrating NSDF services into their research workflows, enhancing data accessibility, sharing, and collaborative scientific discovery. Our tutorial aims to advance knowledge in data-intensive computing and empower participants to harness the full potential of NSDF in their respective fields. |
| Author | Taufer, Michela Martinez, Heberth Gooch, Amy Marquez, Jack Pascucci, Valerio Olaya, Paula Scorzelli, Giorgio Panta, Aashish |
| Author_xml | – sequence: 1 givenname: Michela surname: Taufer fullname: Taufer, Michela email: taufer@acm.org organization: University of Tennessee,Knoxville,USA – sequence: 2 givenname: Heberth surname: Martinez fullname: Martinez, Heberth email: hmarti46@utk.edu organization: University of Tennessee,Knoxville,USA – sequence: 3 givenname: Aashish surname: Panta fullname: Panta, Aashish email: aashish.panta@utah.edu organization: University of Utah Salt,Lake City,USA – sequence: 4 givenname: Paula surname: Olaya fullname: Olaya, Paula email: polaya@vols.utk.edu organization: University of Tennessee,Knoxville,USA – sequence: 5 givenname: Jack surname: Marquez fullname: Marquez, Jack email: jmarque4@utk.edu organization: University of Tennessee,Knoxville,USA – sequence: 6 givenname: Amy surname: Gooch fullname: Gooch, Amy email: amy.a.gooch@gmail.com organization: University of Utah Salt,Lake City,USA – sequence: 7 givenname: Giorgio surname: Scorzelli fullname: Scorzelli, Giorgio email: scrgiorgio@gmail.com organization: University of Utah Salt,Lake City,USA – sequence: 8 givenname: Valerio surname: Pascucci fullname: Pascucci, Valerio email: pascucci.valerio@gmail.com organization: University of Utah Salt,Lake City,USA |
| BookMark | eNotjMtKw0AUQEdQUGu-QBfzA4l3HjfJLCVaLQRdpOKyzE1uykBNZDIU_HvFujqLczjX4nyaJxbiVkGhFLj7rvkojbZQaNC2AAA0ZyJzlasNgkFEay5FtiyBoASsLdR4JTYtHzn6fZj28tWnME_-ILs-8NSzfPTJy7WnGHrZcTyGnheZZrmNPkwn-5emsKTlRlyM_rBw9s-VeF8_bZuXvH173jQPbe41linnwVdjNdCoFRkwitlZNwAqtPVIjmxJ9GtsNQ6jwQGcgt7rmgwBVZrQrMTd6RuYefcVw6eP3zsFtYZKo_kB98RNwg |
| CODEN | IEEPAD |
| ContentType | Conference Proceeding |
| DBID | 6IE 6IL CBEJK RIE RIL |
| DOI | 10.1109/SCW63240.2024.00053 |
| DatabaseName | IEEE Electronic Library (IEL) Conference Proceedings IEEE Xplore POP ALL IEEE Xplore All Conference Proceedings IEL IEEE Proceedings Order Plans (POP All) 1998-Present |
| DatabaseTitleList | |
| Database_xml | – sequence: 1 dbid: RIE name: IEEE/IET Electronic Library url: https://ieeexplore.ieee.org/ sourceTypes: Publisher |
| DeliveryMethod | fulltext_linktorsrc |
| EISBN | 9798350355543 |
| EndPage | 362 |
| ExternalDocumentID | 10820725 |
| Genre | orig-research |
| GrantInformation_xml | – fundername: National Science Foundation funderid: 10.13039/100000001 |
| GroupedDBID | 6IE 6IL ACM ALMA_UNASSIGNED_HOLDINGS CBEJK RIE RIL |
| ID | FETCH-LOGICAL-a256t-eda7f7dbf21b3031ee949d051548fb9b46bb1b347fdf35d0910ca28b3b0b72b53 |
| IEDL.DBID | RIE |
| ISICitedReferencesCount | 0 |
| ISICitedReferencesURI | http://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=Summon&SrcAuth=ProQuest&DestLinkType=CitingArticles&DestApp=WOS_CPL&KeyUT=001451792300040&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D |
| IngestDate | Wed Aug 27 01:59:34 EDT 2025 |
| IsPeerReviewed | false |
| IsScholarly | false |
| Language | English |
| LinkModel | DirectLink |
| MergedId | FETCHMERGED-LOGICAL-a256t-eda7f7dbf21b3031ee949d051548fb9b46bb1b347fdf35d0910ca28b3b0b72b53 |
| PageCount | 8 |
| ParticipantIDs | ieee_primary_10820725 |
| PublicationCentury | 2000 |
| PublicationDate | 2024-Nov.-17 |
| PublicationDateYYYYMMDD | 2024-11-17 |
| PublicationDate_xml | – month: 11 year: 2024 text: 2024-Nov.-17 day: 17 |
| PublicationDecade | 2020 |
| PublicationTitle | SC24-W: Workshops of the International Conference for High Performance Computing, Networking, Storage and Analysis |
| PublicationTitleAbbrev | SC-W |
| PublicationYear | 2024 |
| Publisher | IEEE |
| Publisher_xml | – name: IEEE |
| SSID | ssib060584085 |
| Score | 1.889684 |
| Snippet | We document an interactive half-day tutorial in which participants explore the advanced applications of National Science Data Fabric (NSDF) services and... |
| SourceID | ieee |
| SourceType | Publisher |
| StartPage | 355 |
| SubjectTerms | Data analysis Data visualization Fabrics Geoscience High performance computing Memory Pain Surveys Tutorials Visual analytics workforce development |
| Title | Leveraging National Science Data Fabric Services to Train Data Scientists |
| URI | https://ieeexplore.ieee.org/document/10820725 |
| WOSCitedRecordID | wos001451792300040&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D |
| hasFullText | 1 |
| inHoldings | 1 |
| isFullTextHit | |
| isPrint | |
| link | http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV3PT8MgFCZu8eBJjTX-DgevKG1pXzlPF02WZYlTd1t4BU5mM7Pz7_dBW_XiwVsDJIT3gPcofN_H2LUDbSxlDgI91EJBXgvK8q0otUZLTsc6ah2-TGA6rRYLPevA6hEL45yLj8_cTfiMd_l2XW_DrzJa4RSvICsGbABQtmCtfvKE673A1tUxC6VS3z6NXgMZuaRTYBY4smVQQP6loRJDyHj_n50fsOQHjMdn32HmkO241RF7nDiag1FhiHfU1m-8W6f8zjSGjw3SFsf7vYA3az4PchBtbWzakIs_EvY8vp-PHkSniiAMpSeNcNaAB4s-S5HiT-qcVtoGqRZVedSoSkSqUeCtzwsb8oHaZBXmKBEyLPJjNlytV-6E8bQ0qGxhram8QmUMUPtCWqkcgDT-lCXBDsv3lvhi2Zvg7I_yc7YXTB2geilcsGGz2bpLtlt_0ng2V9FdX_t2lsc |
| linkProvider | IEEE |
| linkToHtml | http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV1NTwMhEJ1oNdGTGmv8loNXlN1ly3KuNm1cmyZW7a1hFjiZ1tStv9-BbtWLB28bICHMADMsvPcArp3SxlLmwNGrikuVVZyyfMs7WqMlp2MVtQ5fSjUcFpOJHjVg9YiFcc7Fx2fuJnzGu3w7r5bhVxmtcIpXKs03YSuXMhUruNZ6-oQLvsDX1XALJULfPnVfAx25oHNgGliyRdBA_qWiEoNIb--f3e9D-weOx0bfgeYANtzsEAalo1kYNYZYQ279xpqVyu5MbVjPIG1ybL0bsHrOxkEQYlUbm9bk5I82PPfux90-b3QRuKEEpebOGuWVRZ8mSBEocU5LbYNYiyw8apQdRKqRyluf5TZkBJVJC8xQoEoxz46gNZvP3DGwpGNQ2txaU3iJ0hhF7XNhhXRKCeNPoB3sMH1fUV9M1yY4_aP8Cnb648dyWg6GD2ewG8wegHuJOodWvVi6C9iuPmlsi8voui8EVJoO |
| openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=SC24-W%3A+Workshops+of+the+International+Conference+for+High+Performance+Computing%2C+Networking%2C+Storage+and+Analysis&rft.atitle=Leveraging+National+Science+Data+Fabric+Services+to+Train+Data+Scientists&rft.au=Taufer%2C+Michela&rft.au=Martinez%2C+Heberth&rft.au=Panta%2C+Aashish&rft.au=Olaya%2C+Paula&rft.date=2024-11-17&rft.pub=IEEE&rft.spage=355&rft.epage=362&rft_id=info:doi/10.1109%2FSCW63240.2024.00053&rft.externalDocID=10820725 |