Ensuring Model Performance Reliability through a Data-Centric Approach
Uloženo v:
| Název: | Ensuring Model Performance Reliability through a Data-Centric Approach |
|---|---|
| Autoři: | Sanka, Praveen Gupta, Minukuri, Vidya Sagar |
| Informace o vydavateli: | Zenodo, 2025. |
| Rok vydání: | 2025 |
| Témata: | model performance, data reliability, data quality |
| Popis: | Businesses optimize ML models for marginal performance gains, but how often are the business decisions made with full awareness of data quality? The importance and the level of effort to maintain data quality is not new. However, the industry still lacks a standard way to quantify and monitor data quality. While companies rigorously optimize the models, data issues can quietly undermine performance, introduce bias and can lead to costly failures. For example, credit agency’s misreporting of inquiry counts and tradeline ages can contribute to major financial losses. This study introduces the Data Reliability Score (DRS), a longitudinal metric for assessing data quality across training and inference. Similar to performance metrics such as Accuracy and Mean Squared Error, DRS provides continuous monitoring across six key pillars: Lineage, Completeness, Consistency, Bias, Frequency, and Accuracy anchored in data driven methodology. By proactively identifying issues, DRS helps businesses ensure data reliability, preventing failures. Just as low-performing models aren’t deployed, data with a low DRS should not be trusted for making business decisions. |
| Druh dokumentu: | Conference object |
| Jazyk: | English |
| DOI: | 10.5281/zenodo.17073070 |
| DOI: | 10.5281/zenodo.17073071 |
| Rights: | CC BY |
| Přístupové číslo: | edsair.doi.dedup.....32ce9f67a4d938364c71e70307cefa1a |
| Databáze: | OpenAIRE |
| FullText | Text: Availability: 0 CustomLinks: – Url: https://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=EBSCO&SrcAuth=EBSCO&DestApp=WOS&ServiceName=TransferToWoS&DestLinkType=GeneralSearchSummary&Func=Links&author=Sanka%20PG Name: ISI Category: fullText Text: Nájsť tento článok vo Web of Science Icon: https://imagesrvr.epnet.com/ls/20docs.gif MouseOverText: Nájsť tento článok vo Web of Science |
|---|---|
| Header | DbId: edsair DbLabel: OpenAIRE An: edsair.doi.dedup.....32ce9f67a4d938364c71e70307cefa1a RelevancyScore: 1017 AccessLevel: 3 PubType: Conference PubTypeId: conference PreciseRelevancyScore: 1017.22802734375 |
| IllustrationInfo | |
| Items | – Name: Title Label: Title Group: Ti Data: Ensuring Model Performance Reliability through a Data-Centric Approach – Name: Author Label: Authors Group: Au Data: <searchLink fieldCode="AR" term="%22Sanka%2C+Praveen+Gupta%22">Sanka, Praveen Gupta</searchLink><br /><searchLink fieldCode="AR" term="%22Minukuri%2C+Vidya+Sagar%22">Minukuri, Vidya Sagar</searchLink> – Name: Publisher Label: Publisher Information Group: PubInfo Data: Zenodo, 2025. – Name: DatePubCY Label: Publication Year Group: Date Data: 2025 – Name: Subject Label: Subject Terms Group: Su Data: <searchLink fieldCode="DE" term="%22model+performance%22">model performance</searchLink><br /><searchLink fieldCode="DE" term="%22data+reliability%22">data reliability</searchLink><br /><searchLink fieldCode="DE" term="%22data+quality%22">data quality</searchLink> – Name: Abstract Label: Description Group: Ab Data: Businesses optimize ML models for marginal performance gains, but how often are the business decisions made with full awareness of data quality? The importance and the level of effort to maintain data quality is not new. However, the industry still lacks a standard way to quantify and monitor data quality. While companies rigorously optimize the models, data issues can quietly undermine performance, introduce bias and can lead to costly failures. For example, credit agency’s misreporting of inquiry counts and tradeline ages can contribute to major financial losses. This study introduces the Data Reliability Score (DRS), a longitudinal metric for assessing data quality across training and inference. Similar to performance metrics such as Accuracy and Mean Squared Error, DRS provides continuous monitoring across six key pillars: Lineage, Completeness, Consistency, Bias, Frequency, and Accuracy anchored in data driven methodology. By proactively identifying issues, DRS helps businesses ensure data reliability, preventing failures. Just as low-performing models aren’t deployed, data with a low DRS should not be trusted for making business decisions. – Name: TypeDocument Label: Document Type Group: TypDoc Data: Conference object – Name: Language Label: Language Group: Lang Data: English – Name: DOI Label: DOI Group: ID Data: 10.5281/zenodo.17073070 – Name: DOI Label: DOI Group: ID Data: 10.5281/zenodo.17073071 – Name: Copyright Label: Rights Group: Cpyrght Data: CC BY – Name: AN Label: Accession Number Group: ID Data: edsair.doi.dedup.....32ce9f67a4d938364c71e70307cefa1a |
| PLink | https://erproxy.cvtisr.sk/sfx/access?url=https://search.ebscohost.com/login.aspx?direct=true&site=eds-live&db=edsair&AN=edsair.doi.dedup.....32ce9f67a4d938364c71e70307cefa1a |
| RecordInfo | BibRecord: BibEntity: Identifiers: – Type: doi Value: 10.5281/zenodo.17073070 Languages: – Text: English Subjects: – SubjectFull: model performance Type: general – SubjectFull: data reliability Type: general – SubjectFull: data quality Type: general Titles: – TitleFull: Ensuring Model Performance Reliability through a Data-Centric Approach Type: main BibRelationships: HasContributorRelationships: – PersonEntity: Name: NameFull: Sanka, Praveen Gupta – PersonEntity: Name: NameFull: Minukuri, Vidya Sagar IsPartOfRelationships: – BibEntity: Dates: – D: 14 M: 09 Type: published Y: 2025 Identifiers: – Type: issn-locals Value: edsair |
| ResultId | 1 |
Nájsť tento článok vo Web of Science