FFMDFPA: A FAIRification Framework for Materials Data with No-Code Flexible Semi-Structured Parser and Application Programming Interfaces
The FAIR Data Principles are guidelines to ensure Findability, Accessibility, Interoperability, and Reusability of digital resources, which are essential to accelerate data-driven materials science. Despite the development and growing adoption of the FAIR principles, appropriate implementation solut...
Saved in:
| Published in: | Journal of chemical information and modeling Vol. 63; no. 16; p. 4986 |
|---|---|
| Main Authors: | , , , |
| Format: | Journal Article |
| Language: | English |
| Published: |
United States
28.08.2023
|
| ISSN: | 1549-960X, 1549-960X |
| Online Access: | Get more information |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
| Abstract | The FAIR Data Principles are guidelines to ensure Findability, Accessibility, Interoperability, and Reusability of digital resources, which are essential to accelerate data-driven materials science. Despite the development and growing adoption of the FAIR principles, appropriate implementation solutions and software to make data FAIR are still sparse, particularly in standardization of heterogeneous data and subsequent data access. Here, we introduce a FAIRification Framework for Materials Data with No-Code Flexible Semi-Structured Parser and API (FFMDFPA) (API, application programming interface) for raw data processing. Using a template-based parser, FFMDFPA can extract and transform semistructured data in various text formats, providing the flexibility to extend data manipulation without coding. Additionally, FFMDFPA provides a standardized API with efficient query syntax that facilitates seamless data sharing. Taking various text files generated by computational software as examples, we demonstrate the potential utility of FFMDFPA. This work offers important insights toward efficient utilization and reuse of materials data, and the data semantic manipulation implemented in the parser and API can be extended to textual data, which has implications for future data FAIRification. |
|---|---|
| AbstractList | The FAIR Data Principles are guidelines to ensure Findability, Accessibility, Interoperability, and Reusability of digital resources, which are essential to accelerate data-driven materials science. Despite the development and growing adoption of the FAIR principles, appropriate implementation solutions and software to make data FAIR are still sparse, particularly in standardization of heterogeneous data and subsequent data access. Here, we introduce a FAIRification Framework for Materials Data with No-Code Flexible Semi-Structured Parser and API (FFMDFPA) (API, application programming interface) for raw data processing. Using a template-based parser, FFMDFPA can extract and transform semistructured data in various text formats, providing the flexibility to extend data manipulation without coding. Additionally, FFMDFPA provides a standardized API with efficient query syntax that facilitates seamless data sharing. Taking various text files generated by computational software as examples, we demonstrate the potential utility of FFMDFPA. This work offers important insights toward efficient utilization and reuse of materials data, and the data semantic manipulation implemented in the parser and API can be extended to textual data, which has implications for future data FAIRification.The FAIR Data Principles are guidelines to ensure Findability, Accessibility, Interoperability, and Reusability of digital resources, which are essential to accelerate data-driven materials science. Despite the development and growing adoption of the FAIR principles, appropriate implementation solutions and software to make data FAIR are still sparse, particularly in standardization of heterogeneous data and subsequent data access. Here, we introduce a FAIRification Framework for Materials Data with No-Code Flexible Semi-Structured Parser and API (FFMDFPA) (API, application programming interface) for raw data processing. Using a template-based parser, FFMDFPA can extract and transform semistructured data in various text formats, providing the flexibility to extend data manipulation without coding. Additionally, FFMDFPA provides a standardized API with efficient query syntax that facilitates seamless data sharing. Taking various text files generated by computational software as examples, we demonstrate the potential utility of FFMDFPA. This work offers important insights toward efficient utilization and reuse of materials data, and the data semantic manipulation implemented in the parser and API can be extended to textual data, which has implications for future data FAIRification. The FAIR Data Principles are guidelines to ensure Findability, Accessibility, Interoperability, and Reusability of digital resources, which are essential to accelerate data-driven materials science. Despite the development and growing adoption of the FAIR principles, appropriate implementation solutions and software to make data FAIR are still sparse, particularly in standardization of heterogeneous data and subsequent data access. Here, we introduce a FAIRification Framework for Materials Data with No-Code Flexible Semi-Structured Parser and API (FFMDFPA) (API, application programming interface) for raw data processing. Using a template-based parser, FFMDFPA can extract and transform semistructured data in various text formats, providing the flexibility to extend data manipulation without coding. Additionally, FFMDFPA provides a standardized API with efficient query syntax that facilitates seamless data sharing. Taking various text files generated by computational software as examples, we demonstrate the potential utility of FFMDFPA. This work offers important insights toward efficient utilization and reuse of materials data, and the data semantic manipulation implemented in the parser and API can be extended to textual data, which has implications for future data FAIRification. |
| Author | He, Bing Gong, Zhuming Avdeev, Maxim Shi, Siqi |
| Author_xml | – sequence: 1 givenname: Bing orcidid: 0000-0002-6796-941X surname: He fullname: He, Bing organization: School of Computer Engineering and Science, Shanghai University, Shanghai 20444, China – sequence: 2 givenname: Zhuming surname: Gong fullname: Gong, Zhuming organization: School of Computer Engineering and Science, Shanghai University, Shanghai 20444, China – sequence: 3 givenname: Maxim orcidid: 0000-0003-2366-5809 surname: Avdeev fullname: Avdeev, Maxim organization: School of Chemistry, The University of Sydney, Sydney 2006, Australia – sequence: 4 givenname: Siqi orcidid: 0000-0001-8988-9763 surname: Shi fullname: Shi, Siqi organization: State Key Laboratory of Advanced Special Steel, Shanghai Key Laboratory of Advanced Ferrometallurgy, School of Materials Science and Engineering, Shanghai University, Shanghai 200444, China |
| BackLink | https://www.ncbi.nlm.nih.gov/pubmed/37549383$$D View this record in MEDLINE/PubMed |
| BookMark | eNpN0E1Lw0AQBuBFKmqrd0-yRy-pu5k0H95Ca7TQarEK3sJmM6tbk2zdTaj-BP-1AS14GOaFeXkOMySDxjRIyDlnY858fiWkG2-krscgGYshPCAnfBIkXhKyl8G_fEyGzm0YA0hC_4gcQ9RfIIYT8p1ly1m2Sq9pSrN0_qiVlqLVpqGZFTXujH2nyli6FC1aLSpHZ6IVdKfbN3pvvKkpkWYVfuqiQrrGWnvr1nay7SyWdCWsQ0tFU9J0u6328sqa1x6vdfNK503vKiHRnZJD1ft49rdH5Dm7eZreeYuH2_k0XXgCeNR6UQxFwZifBEpBCRyAB4DYD49LlQifI2ASSuwzLxUUyEQECuJJwX2pIn9ELn_drTUfHbo2r7WTWFWiQdO53I-DKAriCQv76sVftStqLPOt1bWwX_n-ff4PGJd19A |
| ContentType | Journal Article |
| DBID | NPM 7X8 |
| DOI | 10.1021/acs.jcim.3c00836 |
| DatabaseName | PubMed MEDLINE - Academic |
| DatabaseTitle | PubMed MEDLINE - Academic |
| DatabaseTitleList | MEDLINE - Academic PubMed |
| Database_xml | – sequence: 1 dbid: NPM name: PubMed url: http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=PubMed sourceTypes: Index Database – sequence: 2 dbid: 7X8 name: MEDLINE - Academic url: https://search.proquest.com/medline sourceTypes: Aggregation Database |
| DeliveryMethod | no_fulltext_linktorsrc |
| Discipline | Chemistry |
| EISSN | 1549-960X |
| ExternalDocumentID | 37549383 |
| Genre | Journal Article |
| GroupedDBID | --- -~X 4.4 55A 5GY 5VS 7~N AABXI ABJNI ABMVS ABQRX ABUCX ACGFS ACIWK ACNCT ACS ADHLV AEESW AENEX AFEFF AHGAQ ALMA_UNASSIGNED_HOLDINGS AQSVZ CUPRZ D0L DU5 EBS ED~ F5P GGK GNL IH9 JG~ NPM P2P PQQKQ RNS ROL UI2 VF5 VG9 W1F 7X8 ABBLG ABLBI |
| ID | FETCH-LOGICAL-a317t-783bb00294ff3d3133143ee43e18df9a21e3e96cef9a1df3be0a73f385b12cf72 |
| IEDL.DBID | 7X8 |
| ISICitedReferencesCount | 3 |
| ISICitedReferencesURI | http://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=Summon&SrcAuth=ProQuest&DestLinkType=CitingArticles&DestApp=WOS_CPL&KeyUT=001042710100001&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D |
| ISSN | 1549-960X |
| IngestDate | Thu Oct 02 11:06:25 EDT 2025 Thu Jan 02 22:52:21 EST 2025 |
| IsPeerReviewed | true |
| IsScholarly | true |
| Issue | 16 |
| Language | English |
| LinkModel | DirectLink |
| MergedId | FETCHMERGED-LOGICAL-a317t-783bb00294ff3d3133143ee43e18df9a21e3e96cef9a1df3be0a73f385b12cf72 |
| Notes | ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 23 |
| ORCID | 0000-0001-8988-9763 0000-0003-2366-5809 0000-0002-6796-941X |
| PMID | 37549383 |
| PQID | 2847748506 |
| PQPubID | 23479 |
| ParticipantIDs | proquest_miscellaneous_2847748506 pubmed_primary_37549383 |
| PublicationCentury | 2000 |
| PublicationDate | 2023-08-28 |
| PublicationDateYYYYMMDD | 2023-08-28 |
| PublicationDate_xml | – month: 08 year: 2023 text: 2023-08-28 day: 28 |
| PublicationDecade | 2020 |
| PublicationPlace | United States |
| PublicationPlace_xml | – name: United States |
| PublicationTitle | Journal of chemical information and modeling |
| PublicationTitleAlternate | J Chem Inf Model |
| PublicationYear | 2023 |
| SSID | ssj0033962 |
| Score | 2.4298155 |
| Snippet | The FAIR Data Principles are guidelines to ensure Findability, Accessibility, Interoperability, and Reusability of digital resources, which are essential to... |
| SourceID | proquest pubmed |
| SourceType | Aggregation Database Index Database |
| StartPage | 4986 |
| Title | FFMDFPA: A FAIRification Framework for Materials Data with No-Code Flexible Semi-Structured Parser and Application Programming Interfaces |
| URI | https://www.ncbi.nlm.nih.gov/pubmed/37549383 https://www.proquest.com/docview/2847748506 |
| Volume | 63 |
| WOSCitedRecordID | wos001042710100001&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D |
| hasFullText | |
| inHoldings | 1 |
| isFullTextHit | |
| isPrint | |
| link | http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwpV1LS8NAEF58gV58P-qLEbxu23RjN-tFQmvQQ0uwCr2VTXYXKppoH_4H_7UzSaonQfCQJZeEZXaY-XbmYz7GLlViMPgZKi-129yXznHte4pLampZK0zT-YXYhOz3g-FQxVXBbVrRKhcxsQjUJk-pRt6gMCp9mq928_bOSTWKuquVhMYyWxUIZYjSJYffXQQhVCEoSlPIOCL1YdWmxLTW0Om0_pyOX-sibVYDmn8BmEWiibb-u8VttllBTAhLn9hhSzbbZeudhbLbHvuMol43isNrCCEK7x-ILVQcEEQLqhYgloWenpX-CV0900AlW-jnvJMbCxEN0kxeLAzwt3xQTKGdT6yBGG_KdgI6MxD-NMchLnlgr5gpoahCOuKC7bOn6Paxc8crSQauEWjMuAxEQnld-c4JgzYXiLesxccLjFO65VlhVTu1-O4ZJxLb1FI4EVwlXit1snXAVrI8s0cMZIBIyHOSYoyfJElgfCfbymkEneg8psYuFlYeoX2oj6Ezm8-nox8719hheVSjt3I2x4gUfRXeuo__8PUJ2yDxeKoQt4JTturQoPaMraUfs_F0cl74Eq79uPcFK8rU_A |
| linkProvider | ProQuest |
| openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=FFMDFPA%3A+A+FAIRification+Framework+for+Materials+Data+with+No-Code+Flexible+Semi-Structured+Parser+and+Application+Programming+Interfaces&rft.jtitle=Journal+of+chemical+information+and+modeling&rft.au=He%2C+Bing&rft.au=Gong%2C+Zhuming&rft.au=Avdeev%2C+Maxim&rft.au=Shi%2C+Siqi&rft.date=2023-08-28&rft.eissn=1549-960X&rft_id=info:doi/10.1021%2Facs.jcim.3c00836&rft_id=info%3Apmid%2F37549383&rft_id=info%3Apmid%2F37549383&rft.externalDocID=37549383 |
| thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1549-960X&client=summon |
| thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1549-960X&client=summon |
| thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1549-960X&client=summon |