Region-Based Active Learning with Hierarchical and Adaptive Region Construction
Learning of classification models in practice often relies on human annotation effort in which humans assign class labels to data instances. As this process can be very time-consuming and costly, finding effective ways to reduce the annotation cost becomes critical for building such models. To solve...
Saved in:
| Published in: | Proceedings of the ... SIAM International Conference on Data Mining Vol. 2019; p. 441 |
|---|---|
| Main Authors: | , |
| Format: | Journal Article |
| Language: | English |
| Published: |
United States
01.05.2019
|
| ISSN: | 2167-0102 |
| Online Access: | Get more information |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
| Abstract | Learning of classification models in practice often relies on human annotation effort in which humans assign class labels to data instances. As this process can be very time-consuming and costly, finding effective ways to reduce the annotation cost becomes critical for building such models. To solve this problem, instead of soliciting instance-based annotation we explore
-based annotation as the human feedback. A region is defined as a hyper-cubic subspace of the input space
and it covers a subpopulation of data instances that fall into this region. Each region is labeled with a number in [0,1] (in binary classification setting), representing a human estimate of the positive (or negative) class proportion in the subpopulation. To quickly discover pure regions (in terms of class proportion) in the data, we have developed a novel active learning framework that constructs regions in a
and
way.
means that regions are incrementally built into a hierarchical tree, which is done by repeatedly splitting the input space.
means that our framework can adaptively choose the best heuristic for each of the region splits. Through experiments on numerous datasets we demonstrate that our framework can identify pure regions in very few region queries. Thus our approach is shown to be effective in learning classification models from very limited human feedback. |
|---|---|
| AbstractList | Learning of classification models in practice often relies on human annotation effort in which humans assign class labels to data instances. As this process can be very time-consuming and costly, finding effective ways to reduce the annotation cost becomes critical for building such models. To solve this problem, instead of soliciting instance-based annotation we explore
-based annotation as the human feedback. A region is defined as a hyper-cubic subspace of the input space
and it covers a subpopulation of data instances that fall into this region. Each region is labeled with a number in [0,1] (in binary classification setting), representing a human estimate of the positive (or negative) class proportion in the subpopulation. To quickly discover pure regions (in terms of class proportion) in the data, we have developed a novel active learning framework that constructs regions in a
and
way.
means that regions are incrementally built into a hierarchical tree, which is done by repeatedly splitting the input space.
means that our framework can adaptively choose the best heuristic for each of the region splits. Through experiments on numerous datasets we demonstrate that our framework can identify pure regions in very few region queries. Thus our approach is shown to be effective in learning classification models from very limited human feedback. Learning of classification models in practice often relies on human annotation effort in which humans assign class labels to data instances. As this process can be very time-consuming and costly, finding effective ways to reduce the annotation cost becomes critical for building such models. To solve this problem, instead of soliciting instance-based annotation we explore region-based annotation as the human feedback. A region is defined as a hyper-cubic subspace of the input space X and it covers a subpopulation of data instances that fall into this region. Each region is labeled with a number in [0,1] (in binary classification setting), representing a human estimate of the positive (or negative) class proportion in the subpopulation. To quickly discover pure regions (in terms of class proportion) in the data, we have developed a novel active learning framework that constructs regions in a hierarchical and adaptive way. Hierarchical means that regions are incrementally built into a hierarchical tree, which is done by repeatedly splitting the input space. Adaptive means that our framework can adaptively choose the best heuristic for each of the region splits. Through experiments on numerous datasets we demonstrate that our framework can identify pure regions in very few region queries. Thus our approach is shown to be effective in learning classification models from very limited human feedback.Learning of classification models in practice often relies on human annotation effort in which humans assign class labels to data instances. As this process can be very time-consuming and costly, finding effective ways to reduce the annotation cost becomes critical for building such models. To solve this problem, instead of soliciting instance-based annotation we explore region-based annotation as the human feedback. A region is defined as a hyper-cubic subspace of the input space X and it covers a subpopulation of data instances that fall into this region. Each region is labeled with a number in [0,1] (in binary classification setting), representing a human estimate of the positive (or negative) class proportion in the subpopulation. To quickly discover pure regions (in terms of class proportion) in the data, we have developed a novel active learning framework that constructs regions in a hierarchical and adaptive way. Hierarchical means that regions are incrementally built into a hierarchical tree, which is done by repeatedly splitting the input space. Adaptive means that our framework can adaptively choose the best heuristic for each of the region splits. Through experiments on numerous datasets we demonstrate that our framework can identify pure regions in very few region queries. Thus our approach is shown to be effective in learning classification models from very limited human feedback. |
| Author | Hauskrecht, Milos Luo, Zhipeng |
| Author_xml | – sequence: 1 givenname: Zhipeng surname: Luo fullname: Luo, Zhipeng organization: Department of Computer Science, University of Pittsburgh, PA, USA – sequence: 2 givenname: Milos surname: Hauskrecht fullname: Hauskrecht, Milos organization: Department of Computer Science, University of Pittsburgh, PA, USA |
| BackLink | https://www.ncbi.nlm.nih.gov/pubmed/31929950$$D View this record in MEDLINE/PubMed |
| BookMark | eNo1kLFOwzAURT0U0VL6AwwoI0uKnx3H8VgqoEiVKiGYo1f7pTVKnRAnIP6eipbp3uGcO9wrNgpNIMZugM8BpL6HudEF5ABGq1zLueIjNhGQ65QDF2M2i_GD82NXOhP8ko0lGGGM4hO2eaWdb0L6gJFcsrC9_6JkTdgFH3bJt-_3ycpTh53de4t1guFIOWz_uJObLJsQ-244uk24ZhcV1pFm55yy96fHt-UqXW-eX5aLddpK4H1KljRkhYTKZqgKC0Rk8xzQUUWoNKotmqKqJLoKxNZokgXmDi2i45lTYsruTrtt13wOFPvy4KOlusZAzRBLIWXBteFCH9HbMzpsD-TKtvMH7H7K_xPELwBrYNQ |
| ContentType | Journal Article |
| DBID | NPM 7X8 |
| DOI | 10.1137/1.9781611975673.50 |
| DatabaseName | PubMed MEDLINE - Academic |
| DatabaseTitle | PubMed MEDLINE - Academic |
| DatabaseTitleList | PubMed MEDLINE - Academic |
| Database_xml | – sequence: 1 dbid: NPM name: PubMed url: http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=PubMed sourceTypes: Index Database – sequence: 2 dbid: 7X8 name: MEDLINE - Academic url: https://search.proquest.com/medline sourceTypes: Aggregation Database |
| DeliveryMethod | no_fulltext_linktorsrc |
| Discipline | Computer Science |
| ExternalDocumentID | 31929950 |
| Genre | Journal Article |
| GrantInformation_xml | – fundername: NIGMS NIH HHS grantid: R01 GM088224 – fundername: NLM NIH HHS grantid: R01 LM010019 |
| GroupedDBID | 3V. 7WY 7X2 7XC 88A 88I 8CJ 8FE 8FG 8FH 8FL 8G5 ABJCF ABUWG ACGOD ACIWK ACPRK ADBBV AFKRA AFRAH ALMA_UNASSIGNED_HOLDINGS ARAPS ATCPS AZQEC BBNVY BENPR BEZIV BGLVJ BHPHI BPHCQ CCPQU CZ9 D1I D1J D1K DWQXO FRNLG GNUQQ GROUPED_ABI_INFORM_COMPLETE GUQSH HCIFZ K6- K60 K6V K6~ K7- KB. KC. L6V LK5 LK8 M0C M0K M0L M0N M1Q M2O M2P M7P M7R M7S NPM P62 PATMY PDBOC PQBIZ PQBZA PQQKQ PROAC PTHSS PYCSY 7X8 |
| ID | FETCH-LOGICAL-p310t-ece714831fc4a58c1eeec661adefea57a5ba98ff3adf12b97e38a6dacaad04d52 |
| IEDL.DBID | 7X8 |
| ISICitedReferencesCount | 4 |
| ISICitedReferencesURI | http://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=Summon&SrcAuth=ProQuest&DestLinkType=CitingArticles&DestApp=WOS_CPL&KeyUT=001288498300045&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D |
| ISSN | 2167-0102 |
| IngestDate | Fri Jul 11 11:06:54 EDT 2025 Thu Jan 02 23:00:21 EST 2025 |
| IsDoiOpenAccess | false |
| IsOpenAccess | true |
| IsPeerReviewed | false |
| IsScholarly | false |
| Language | English |
| LinkModel | DirectLink |
| MergedId | FETCHMERGED-LOGICAL-p310t-ece714831fc4a58c1eeec661adefea57a5ba98ff3adf12b97e38a6dacaad04d52 |
| Notes | ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 23 |
| OpenAccessLink | https://epubs.siam.org/doi/pdf/10.1137/1.9781611975673.50 |
| PMID | 31929950 |
| PQID | 2338079027 |
| PQPubID | 23479 |
| ParticipantIDs | proquest_miscellaneous_2338079027 pubmed_primary_31929950 |
| PublicationCentury | 2000 |
| PublicationDate | 20190501 |
| PublicationDateYYYYMMDD | 2019-05-01 |
| PublicationDate_xml | – month: 5 year: 2019 text: 20190501 day: 1 |
| PublicationDecade | 2010 |
| PublicationPlace | United States |
| PublicationPlace_xml | – name: United States |
| PublicationTitle | Proceedings of the ... SIAM International Conference on Data Mining |
| PublicationTitleAlternate | Proc SIAM Int Conf Data Min |
| PublicationYear | 2019 |
| SSID | ssj0001057420 |
| Score | 1.7318677 |
| Snippet | Learning of classification models in practice often relies on human annotation effort in which humans assign class labels to data instances. As this process... |
| SourceID | proquest pubmed |
| SourceType | Aggregation Database Index Database |
| StartPage | 441 |
| Title | Region-Based Active Learning with Hierarchical and Adaptive Region Construction |
| URI | https://www.ncbi.nlm.nih.gov/pubmed/31929950 https://www.proquest.com/docview/2338079027 |
| Volume | 2019 |
| WOSCitedRecordID | wos001288498300045&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D |
| hasFullText | |
| inHoldings | 1 |
| isFullTextHit | |
| isPrint | |
| link | http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwpV1LSwMxEA5qPXixvq0vInhN202yu9mTVLH0oLWISm_LbDKRgmxXW_39JvugJ0HwsqcElmHeM_k-Qq54hH2wEDEbgGBScXQmhZaBRMhASa5L0r7X-3g8VtNpMqkbbot6rbLxiaWjNnPte-Q9Ljw0euKqqOvig3nWKD9drSk01klLuFTGr3TFU7XqsbhkRJbIjNzDe3v4tObdjIh7QdfjPUV-jhZGseiG_d-zzDLaDNv__c8dsl3nmXRQKcYuWcN8j7QbDgdam_Q-eXxCv5HMblw0M3RQej9ag66-Ud-lpaOZf6Rccqa8U8jdKQNFea66Sz3pZwNDe0BehnfPtyNWkyywwmV2S4YaY1cSicBqCaHSASJqF7TBoEUIYwgzSJS1AowNeJbEKBREBjSA6UsT8kOykc9zPCZUcotKSR0C96iDEShX62kFaCLMtM465LIRWeqU2E8mIMf51yJdCa1Djiq5p0WFtpE6H-FCZtg_-cPtU7LlEpqkWkg8Iy3rTBjPyab-Xs4WnxeldrjvePLwA7vfxa8 |
| linkProvider | ProQuest |
| openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Region-Based+Active+Learning+with+Hierarchical+and+Adaptive+Region+Construction&rft.jtitle=Proceedings+of+the+...+SIAM+International+Conference+on+Data+Mining&rft.au=Luo%2C+Zhipeng&rft.au=Hauskrecht%2C+Milos&rft.date=2019-05-01&rft.issn=2167-0102&rft.volume=2019&rft.spage=441&rft_id=info:doi/10.1137%2F1.9781611975673.50&rft.externalDBID=NO_FULL_TEXT |
| thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=2167-0102&client=summon |
| thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=2167-0102&client=summon |
| thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=2167-0102&client=summon |