Reproducibility Starts at the Source: R, Python, and Julia Packages for Retrieving USGS Hydrologic Data.

Saved in:
Bibliographic Details
Title: Reproducibility Starts at the Source: R, Python, and Julia Packages for Retrieving USGS Hydrologic Data.
Authors: Hodson, Timothy O., DeCicco, Laura A., Hariharan, Jayaram A., Stanish, Lee F., Black, Scott, Horsburgh, Jeffery S.
Source: Water (20734441); Dec2023, Vol. 15 Issue 24, p4236, 10p
Subject Terms: DATABASES, PYTHONS, PYTHON programming language, INFORMATION retrieval, WATER quality, SOURCE code
Company/Entity: GEOLOGICAL Survey (U.S.)
Abstract: Much of modern science takes place in a computational environment, and, increasingly, that environment is programmed using R, Python, or Julia. Furthermore, most scientific data now live on the cloud, so the first step in many workflows is to query a cloud database and load the response into a computational environment for further analysis. Thus, tools that facilitate programmatic data retrieval represent a critical component in reproducible scientific workflows. Earth science is no different in this regard. To fulfill that basic need, we developed R, Python, and Julia packages providing programmatic access to the U.S. Geological Survey's National Water Information System database and the multi-agency Water Quality Portal. Together, these packages create a common interface for retrieving hydrologic data in the Jupyter ecosystem, which is widely used in water research, operations, and teaching. Source code, documentation, and tutorials for the packages are available on GitHub. Users can go there to learn, raise issues, or contribute improvements within a single platform, which helps foster better engagement and collaboration between data providers and their users. [ABSTRACT FROM AUTHOR]
Copyright of Water (20734441) is the property of MDPI and its content may not be copied or emailed to multiple sites without the copyright holder's express written permission. Additionally, content may not be used with any artificial intelligence tools or machine learning technologies. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.)
Database: Biomedical Index
Description
Abstract:Much of modern science takes place in a computational environment, and, increasingly, that environment is programmed using R, Python, or Julia. Furthermore, most scientific data now live on the cloud, so the first step in many workflows is to query a cloud database and load the response into a computational environment for further analysis. Thus, tools that facilitate programmatic data retrieval represent a critical component in reproducible scientific workflows. Earth science is no different in this regard. To fulfill that basic need, we developed R, Python, and Julia packages providing programmatic access to the U.S. Geological Survey's National Water Information System database and the multi-agency Water Quality Portal. Together, these packages create a common interface for retrieving hydrologic data in the Jupyter ecosystem, which is widely used in water research, operations, and teaching. Source code, documentation, and tutorials for the packages are available on GitHub. Users can go there to learn, raise issues, or contribute improvements within a single platform, which helps foster better engagement and collaboration between data providers and their users. [ABSTRACT FROM AUTHOR]
ISSN:20734441
DOI:10.3390/w15244236