Ensuring Model Performance Reliability through a Data-Centric Approach

Uloženo v:
Podrobná bibliografie
Název: Ensuring Model Performance Reliability through a Data-Centric Approach
Autoři: Sanka, Praveen Gupta, Minukuri, Vidya Sagar
Informace o vydavateli: Zenodo, 2025.
Rok vydání: 2025
Témata: model performance, data reliability, data quality
Popis: Businesses optimize ML models for marginal performance gains, but how often are the business decisions made with full awareness of data quality? The importance and the level of effort to maintain data quality is not new. However, the industry still lacks a standard way to quantify and monitor data quality. While companies rigorously optimize the models, data issues can quietly undermine performance, introduce bias and can lead to costly failures. For example, credit agency’s misreporting of inquiry counts and tradeline ages can contribute to major financial losses. This study introduces the Data Reliability Score (DRS), a longitudinal metric for assessing data quality across training and inference. Similar to performance metrics such as Accuracy and Mean Squared Error, DRS provides continuous monitoring across six key pillars: Lineage, Completeness, Consistency, Bias, Frequency, and Accuracy anchored in data driven methodology. By proactively identifying issues, DRS helps businesses ensure data reliability, preventing failures. Just as low-performing models aren’t deployed, data with a low DRS should not be trusted for making business decisions.
Druh dokumentu: Conference object
Jazyk: English
DOI: 10.5281/zenodo.17073070
DOI: 10.5281/zenodo.17073071
Rights: CC BY
Přístupové číslo: edsair.doi.dedup.....32ce9f67a4d938364c71e70307cefa1a
Databáze: OpenAIRE
FullText Text:
  Availability: 0
CustomLinks:
  – Url: https://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=EBSCO&SrcAuth=EBSCO&DestApp=WOS&ServiceName=TransferToWoS&DestLinkType=GeneralSearchSummary&Func=Links&author=Sanka%20PG
    Name: ISI
    Category: fullText
    Text: Nájsť tento článok vo Web of Science
    Icon: https://imagesrvr.epnet.com/ls/20docs.gif
    MouseOverText: Nájsť tento článok vo Web of Science
Header DbId: edsair
DbLabel: OpenAIRE
An: edsair.doi.dedup.....32ce9f67a4d938364c71e70307cefa1a
RelevancyScore: 1017
AccessLevel: 3
PubType: Conference
PubTypeId: conference
PreciseRelevancyScore: 1017.22802734375
IllustrationInfo
Items – Name: Title
  Label: Title
  Group: Ti
  Data: Ensuring Model Performance Reliability through a Data-Centric Approach
– Name: Author
  Label: Authors
  Group: Au
  Data: <searchLink fieldCode="AR" term="%22Sanka%2C+Praveen+Gupta%22">Sanka, Praveen Gupta</searchLink><br /><searchLink fieldCode="AR" term="%22Minukuri%2C+Vidya+Sagar%22">Minukuri, Vidya Sagar</searchLink>
– Name: Publisher
  Label: Publisher Information
  Group: PubInfo
  Data: Zenodo, 2025.
– Name: DatePubCY
  Label: Publication Year
  Group: Date
  Data: 2025
– Name: Subject
  Label: Subject Terms
  Group: Su
  Data: <searchLink fieldCode="DE" term="%22model+performance%22">model performance</searchLink><br /><searchLink fieldCode="DE" term="%22data+reliability%22">data reliability</searchLink><br /><searchLink fieldCode="DE" term="%22data+quality%22">data quality</searchLink>
– Name: Abstract
  Label: Description
  Group: Ab
  Data: Businesses optimize ML models for marginal performance gains, but how often are the business decisions made with full awareness of data quality? The importance and the level of effort to maintain data quality is not new. However, the industry still lacks a standard way to quantify and monitor data quality. While companies rigorously optimize the models, data issues can quietly undermine performance, introduce bias and can lead to costly failures. For example, credit agency’s misreporting of inquiry counts and tradeline ages can contribute to major financial losses. This study introduces the Data Reliability Score (DRS), a longitudinal metric for assessing data quality across training and inference. Similar to performance metrics such as Accuracy and Mean Squared Error, DRS provides continuous monitoring across six key pillars: Lineage, Completeness, Consistency, Bias, Frequency, and Accuracy anchored in data driven methodology. By proactively identifying issues, DRS helps businesses ensure data reliability, preventing failures. Just as low-performing models aren’t deployed, data with a low DRS should not be trusted for making business decisions.
– Name: TypeDocument
  Label: Document Type
  Group: TypDoc
  Data: Conference object
– Name: Language
  Label: Language
  Group: Lang
  Data: English
– Name: DOI
  Label: DOI
  Group: ID
  Data: 10.5281/zenodo.17073070
– Name: DOI
  Label: DOI
  Group: ID
  Data: 10.5281/zenodo.17073071
– Name: Copyright
  Label: Rights
  Group: Cpyrght
  Data: CC BY
– Name: AN
  Label: Accession Number
  Group: ID
  Data: edsair.doi.dedup.....32ce9f67a4d938364c71e70307cefa1a
PLink https://erproxy.cvtisr.sk/sfx/access?url=https://search.ebscohost.com/login.aspx?direct=true&site=eds-live&db=edsair&AN=edsair.doi.dedup.....32ce9f67a4d938364c71e70307cefa1a
RecordInfo BibRecord:
  BibEntity:
    Identifiers:
      – Type: doi
        Value: 10.5281/zenodo.17073070
    Languages:
      – Text: English
    Subjects:
      – SubjectFull: model performance
        Type: general
      – SubjectFull: data reliability
        Type: general
      – SubjectFull: data quality
        Type: general
    Titles:
      – TitleFull: Ensuring Model Performance Reliability through a Data-Centric Approach
        Type: main
  BibRelationships:
    HasContributorRelationships:
      – PersonEntity:
          Name:
            NameFull: Sanka, Praveen Gupta
      – PersonEntity:
          Name:
            NameFull: Minukuri, Vidya Sagar
    IsPartOfRelationships:
      – BibEntity:
          Dates:
            – D: 14
              M: 09
              Type: published
              Y: 2025
          Identifiers:
            – Type: issn-locals
              Value: edsair
ResultId 1