Quantification of histone modification ChIP-seq enrichment for data mining and machine learning applications
Background The advent of ChIP-seq technology has made the investigation of epigenetic regulatory networks a computationally tractable problem. Several groups have applied statistical computing methods to ChIP-seq datasets to gain insight into the epigenetic regulation of transcription. However, meth...
Uložené v:
| Vydané v: | BMC research notes Ročník 4; číslo 1; s. 288 |
|---|---|
| Hlavní autori: | , , |
| Médium: | Journal Article |
| Jazyk: | English |
| Vydavateľské údaje: |
London
BioMed Central
11.08.2011
BioMed Central Ltd BMC |
| Predmet: | |
| ISSN: | 1756-0500, 1756-0500 |
| On-line prístup: | Získať plný text |
| Tagy: |
Pridať tag
Žiadne tagy, Buďte prvý, kto otaguje tento záznam!
|
| Abstract | Background
The advent of ChIP-seq technology has made the investigation of epigenetic regulatory networks a computationally tractable problem. Several groups have applied statistical computing methods to ChIP-seq datasets to gain insight into the epigenetic regulation of transcription. However, methods for estimating enrichment levels in ChIP-seq data for these computational studies are understudied and variable. Since the conclusions drawn from these data mining and machine learning applications strongly depend on the enrichment level inputs, a comparison of estimation methods with respect to the performance of statistical models should be made.
Results
Various methods were used to estimate the gene-wise ChIP-seq enrichment levels for 20 histone methylations and the histone variant H2A.Z. The Multivariate Adaptive Regression Splines (MARS) algorithm was applied for each estimation method using the estimation of enrichment levels as predictors and gene expression levels as responses. The methods used to estimate enrichment levels included tag counting and model-based methods that were applied to whole genes and specific gene regions. These methods were also applied to various sizes of estimation windows. The MARS model performance was assessed with the Generalized Cross-Validation Score (GCV). We determined that model-based methods of enrichment estimation that spatially weight enrichment based on average patterns provided an improvement over tag counting methods. Also, methods that included information across the entire gene body provided improvement over methods that focus on a specific sub-region of the gene (e.g., the 5' or 3' region).
Conclusion
The performance of data mining and machine learning methods when applied to histone modification ChIP-seq data can be improved by using data across the entire gene body, and incorporating the spatial distribution of enrichment. Refinement of enrichment estimation ultimately improved accuracy of model predictions. |
|---|---|
| AbstractList | The advent of ChIP-seq technology has made the investigation of epigenetic regulatory networks a computationally tractable problem. Several groups have applied statistical computing methods to ChIP-seq datasets to gain insight into the epigenetic regulation of transcription. However, methods for estimating enrichment levels in ChIP-seq data for these computational studies are understudied and variable. Since the conclusions drawn from these data mining and machine learning applications strongly depend on the enrichment level inputs, a comparison of estimation methods with respect to the performance of statistical models should be made. Various methods were used to estimate the gene-wise ChIP-seq enrichment levels for 20 histone methylations and the histone variant H2A.Z. The Multivariate Adaptive Regression Splines (MARS) algorithm was applied for each estimation method using the estimation of enrichment levels as predictors and gene expression levels as responses. The methods used to estimate enrichment levels included tag counting and model-based methods that were applied to whole genes and specific gene regions. These methods were also applied to various sizes of estimation windows. The MARS model performance was assessed with the Generalized Cross-Validation Score (GCV). We determined that model-based methods of enrichment estimation that spatially weight enrichment based on average patterns provided an improvement over tag counting methods. Also, methods that included information across the entire gene body provided improvement over methods that focus on a specific sub-region of the gene (e.g., the 5' or 3' region). The performance of data mining and machine learning methods when applied to histone modification ChIP-seq data can be improved by using data across the entire gene body, and incorporating the spatial distribution of enrichment. Refinement of enrichment estimation ultimately improved accuracy of model predictions. The advent of ChIP-seq technology has made the investigation of epigenetic regulatory networks a computationally tractable problem. Several groups have applied statistical computing methods to ChIP-seq datasets to gain insight into the epigenetic regulation of transcription. However, methods for estimating enrichment levels in ChIP-seq data for these computational studies are understudied and variable. Since the conclusions drawn from these data mining and machine learning applications strongly depend on the enrichment level inputs, a comparison of estimation methods with respect to the performance of statistical models should be made. Various methods were used to estimate the gene-wise ChIP-seq enrichment levels for 20 histone methylations and the histone variant H2A.Z. The Multivariate Adaptive Regression Splines (MARS) algorithm was applied for each estimation method using the estimation of enrichment levels as predictors and gene expression levels as responses. The methods used to estimate enrichment levels included tag counting and model-based methods that were applied to whole genes and specific gene regions. These methods were also applied to various sizes of estimation windows. The MARS model performance was assessed with the Generalized Cross-Validation Score (GCV). We determined that model-based methods of enrichment estimation that spatially weight enrichment based on average patterns provided an improvement over tag counting methods. Also, methods that included information across the entire gene body provided improvement over methods that focus on a specific sub-region of the gene (e.g., the 5' or 3' region). The performance of data mining and machine learning methods when applied to histone modification ChIP-seq data can be improved by using data across the entire gene body, and incorporating the spatial distribution of enrichment. Refinement of enrichment estimation ultimately improved accuracy of model predictions. Abstract Background The advent of ChIP-seq technology has made the investigation of epigenetic regulatory networks a computationally tractable problem. Several groups have applied statistical computing methods to ChIP-seq datasets to gain insight into the epigenetic regulation of transcription. However, methods for estimating enrichment levels in ChIP-seq data for these computational studies are understudied and variable. Since the conclusions drawn from these data mining and machine learning applications strongly depend on the enrichment level inputs, a comparison of estimation methods with respect to the performance of statistical models should be made. Results Various methods were used to estimate the gene-wise ChIP-seq enrichment levels for 20 histone methylations and the histone variant H2A.Z. The Multivariate Adaptive Regression Splines (MARS) algorithm was applied for each estimation method using the estimation of enrichment levels as predictors and gene expression levels as responses. The methods used to estimate enrichment levels included tag counting and model-based methods that were applied to whole genes and specific gene regions. These methods were also applied to various sizes of estimation windows. The MARS model performance was assessed with the Generalized Cross-Validation Score (GCV). We determined that model-based methods of enrichment estimation that spatially weight enrichment based on average patterns provided an improvement over tag counting methods. Also, methods that included information across the entire gene body provided improvement over methods that focus on a specific sub-region of the gene (e.g., the 5' or 3' region). Conclusion The performance of data mining and machine learning methods when applied to histone modification ChIP-seq data can be improved by using data across the entire gene body, and incorporating the spatial distribution of enrichment. Refinement of enrichment estimation ultimately improved accuracy of model predictions. Background The advent of ChIP-seq technology has made the investigation of epigenetic regulatory networks a computationally tractable problem. Several groups have applied statistical computing methods to ChIP-seq datasets to gain insight into the epigenetic regulation of transcription. However, methods for estimating enrichment levels in ChIP-seq data for these computational studies are understudied and variable. Since the conclusions drawn from these data mining and machine learning applications strongly depend on the enrichment level inputs, a comparison of estimation methods with respect to the performance of statistical models should be made. Results Various methods were used to estimate the gene-wise ChIP-seq enrichment levels for 20 histone methylations and the histone variant H2A.Z. The Multivariate Adaptive Regression Splines (MARS) algorithm was applied for each estimation method using the estimation of enrichment levels as predictors and gene expression levels as responses. The methods used to estimate enrichment levels included tag counting and model-based methods that were applied to whole genes and specific gene regions. These methods were also applied to various sizes of estimation windows. The MARS model performance was assessed with the Generalized Cross-Validation Score (GCV). We determined that model-based methods of enrichment estimation that spatially weight enrichment based on average patterns provided an improvement over tag counting methods. Also, methods that included information across the entire gene body provided improvement over methods that focus on a specific sub-region of the gene (e.g., the 5' or 3' region). Conclusion The performance of data mining and machine learning methods when applied to histone modification ChIP-seq data can be improved by using data across the entire gene body, and incorporating the spatial distribution of enrichment. Refinement of enrichment estimation ultimately improved accuracy of model predictions. Background The advent of ChIP-seq technology has made the investigation of epigenetic regulatory networks a computationally tractable problem. Several groups have applied statistical computing methods to ChIP-seq datasets to gain insight into the epigenetic regulation of transcription. However, methods for estimating enrichment levels in ChIP-seq data for these computational studies are understudied and variable. Since the conclusions drawn from these data mining and machine learning applications strongly depend on the enrichment level inputs, a comparison of estimation methods with respect to the performance of statistical models should be made. Results Various methods were used to estimate the gene-wise ChIP-seq enrichment levels for 20 histone methylations and the histone variant H2A.Z. The Multivariate Adaptive Regression Splines (MARS) algorithm was applied for each estimation method using the estimation of enrichment levels as predictors and gene expression levels as responses. The methods used to estimate enrichment levels included tag counting and model-based methods that were applied to whole genes and specific gene regions. These methods were also applied to various sizes of estimation windows. The MARS model performance was assessed with the Generalized Cross-Validation Score (GCV). We determined that model-based methods of enrichment estimation that spatially weight enrichment based on average patterns provided an improvement over tag counting methods. Also, methods that included information across the entire gene body provided improvement over methods that focus on a specific sub-region of the gene (e.g., the 5' or 3' region). Conclusion The performance of data mining and machine learning methods when applied to histone modification ChIP-seq data can be improved by using data across the entire gene body, and incorporating the spatial distribution of enrichment. Refinement of enrichment estimation ultimately improved accuracy of model predictions. The advent of ChIP-seq technology has made the investigation of epigenetic regulatory networks a computationally tractable problem. Several groups have applied statistical computing methods to ChIP-seq datasets to gain insight into the epigenetic regulation of transcription. However, methods for estimating enrichment levels in ChIP-seq data for these computational studies are understudied and variable. Since the conclusions drawn from these data mining and machine learning applications strongly depend on the enrichment level inputs, a comparison of estimation methods with respect to the performance of statistical models should be made.BACKGROUNDThe advent of ChIP-seq technology has made the investigation of epigenetic regulatory networks a computationally tractable problem. Several groups have applied statistical computing methods to ChIP-seq datasets to gain insight into the epigenetic regulation of transcription. However, methods for estimating enrichment levels in ChIP-seq data for these computational studies are understudied and variable. Since the conclusions drawn from these data mining and machine learning applications strongly depend on the enrichment level inputs, a comparison of estimation methods with respect to the performance of statistical models should be made.Various methods were used to estimate the gene-wise ChIP-seq enrichment levels for 20 histone methylations and the histone variant H2A.Z. The Multivariate Adaptive Regression Splines (MARS) algorithm was applied for each estimation method using the estimation of enrichment levels as predictors and gene expression levels as responses. The methods used to estimate enrichment levels included tag counting and model-based methods that were applied to whole genes and specific gene regions. These methods were also applied to various sizes of estimation windows. The MARS model performance was assessed with the Generalized Cross-Validation Score (GCV). We determined that model-based methods of enrichment estimation that spatially weight enrichment based on average patterns provided an improvement over tag counting methods. Also, methods that included information across the entire gene body provided improvement over methods that focus on a specific sub-region of the gene (e.g., the 5' or 3' region).RESULTSVarious methods were used to estimate the gene-wise ChIP-seq enrichment levels for 20 histone methylations and the histone variant H2A.Z. The Multivariate Adaptive Regression Splines (MARS) algorithm was applied for each estimation method using the estimation of enrichment levels as predictors and gene expression levels as responses. The methods used to estimate enrichment levels included tag counting and model-based methods that were applied to whole genes and specific gene regions. These methods were also applied to various sizes of estimation windows. The MARS model performance was assessed with the Generalized Cross-Validation Score (GCV). We determined that model-based methods of enrichment estimation that spatially weight enrichment based on average patterns provided an improvement over tag counting methods. Also, methods that included information across the entire gene body provided improvement over methods that focus on a specific sub-region of the gene (e.g., the 5' or 3' region).The performance of data mining and machine learning methods when applied to histone modification ChIP-seq data can be improved by using data across the entire gene body, and incorporating the spatial distribution of enrichment. Refinement of enrichment estimation ultimately improved accuracy of model predictions.CONCLUSIONThe performance of data mining and machine learning methods when applied to histone modification ChIP-seq data can be improved by using data across the entire gene body, and incorporating the spatial distribution of enrichment. Refinement of enrichment estimation ultimately improved accuracy of model predictions. |
| ArticleNumber | 288 |
| Audience | Academic |
| Author | Hoang, Stephen A Bekiranov, Stefan Xu, Xiaojiang |
| AuthorAffiliation | 1 Department of Biochemistry and Molecular Genetics, University of Virginia Health System, Charlottesville, Virginia, USA |
| AuthorAffiliation_xml | – name: 1 Department of Biochemistry and Molecular Genetics, University of Virginia Health System, Charlottesville, Virginia, USA |
| Author_xml | – sequence: 1 givenname: Stephen A surname: Hoang fullname: Hoang, Stephen A organization: Department of Biochemistry and Molecular Genetics, University of Virginia Health System – sequence: 2 givenname: Xiaojiang surname: Xu fullname: Xu, Xiaojiang organization: Department of Biochemistry and Molecular Genetics, University of Virginia Health System – sequence: 3 givenname: Stefan surname: Bekiranov fullname: Bekiranov, Stefan email: sb3de@virginia.edu organization: Department of Biochemistry and Molecular Genetics, University of Virginia Health System |
| BackLink | https://www.ncbi.nlm.nih.gov/pubmed/21834981$$D View this record in MEDLINE/PubMed |
| BookMark | eNp9UkuP0zAYjNAi9gFnbigSB8Qhu3b8iHNBWlU8Kq20IAFX66sfqavE7toJgn-PuylliwD5EGsyM983yZwXJz54UxTPMbrEWPAr3DBeIYZQRataiEfF2QE5eXA_Lc5T2iDEsRD4SXFaY0FoK_BZ0X-awI_OOgWjC74Mtly7NOYp5RD0b3yxXn6skrkrjY9OrQfjx9KGWGoYoRycd74rwetyALV2WdwbiDO43fZ7k_S0eGyhT-bZ_nlRfHn39vPiQ3Vz-365uL6pFBNYVIy1hhDcUIIJBWgY44oIjQlrVhhyAk0Usi1rqcGsFdyA1YKjhlswtLY1uSiWs68OsJHb6AaIP2QAJ--BEDsJcXSqN7KhYqVrrXDNEaWIQE0ExdS0Tf48dEWy15vZazutBqNVDh6hPzI9fuPdWnbhm8wBECEsG7zaG8RwN5k0ysElZfoevAlTkkK0uOG83o16OTM7yJs5b0M2VDu2vK45Zy1HNc2sy7-w8tFmcCr_OOsyfiR4fSTInNF8HzuYUpLL26_H3BcP0x5i_ipMJlzNBBVDStHYAwUjuauk3JVO7konqcyVzAr2h0K58b4PeW_X_0eHZl3KE3xnotyEKfrcm39KfgJAsvB- |
| CitedBy_id | crossref_primary_10_1038_ncomms8052 crossref_primary_10_1186_1756_8935_6_28 crossref_primary_10_1038_s41598_017_03665_1 crossref_primary_10_1186_1471_2164_15_76 crossref_primary_10_1074_jbc_M114_626929 crossref_primary_10_1093_nar_gkt035 crossref_primary_10_1371_journal_pone_0099844 |
| Cites_doi | 10.1126/science.1063127 10.1146/annurev.genet.032608.103928 10.1038/nbt.1662 10.1038/ng.154 10.1016/j.cell.2007.05.009 10.1186/1471-2164-12-134 10.1073/pnas.0400782101 10.1093/bioinformatics/btp340 10.1016/j.cell.2007.02.006 10.1101/gr.073080.107 10.1101/gad.824700 10.1186/1471-2105-11-396 10.1073/pnas.0909344107 10.1038/ng.322 10.1074/jbc.M500796200 10.1038/nsmb1307 10.1038/47412 10.1016/S1097-2765(03)00092-3 10.1128/MCB.23.12.4207-4218.2003 10.1016/S0959-437X(02)00287-3 10.1186/gb-2011-12-2-r15 10.1186/1471-2105-12-57 10.1093/hmg/ddp409 10.1214/aos/1176347963 10.1214/aos/1176347973 10.1074/jbc.M513462200 10.1371/journal.pone.0006700 |
| ContentType | Journal Article |
| Copyright | Bekiranov et al; licensee BioMed Central Ltd. 2011 COPYRIGHT 2011 BioMed Central Ltd. Copyright ©2011 Bekiranov et al; licensee BioMed Central Ltd. 2011 Bekiranov et al; licensee BioMed Central Ltd. |
| Copyright_xml | – notice: Bekiranov et al; licensee BioMed Central Ltd. 2011 – notice: COPYRIGHT 2011 BioMed Central Ltd. – notice: Copyright ©2011 Bekiranov et al; licensee BioMed Central Ltd. 2011 Bekiranov et al; licensee BioMed Central Ltd. |
| DBID | C6C AAYXX CITATION NPM IOV 7X8 5PM DOA |
| DOI | 10.1186/1756-0500-4-288 |
| DatabaseName | Springer Nature OA Free Journals CrossRef PubMed Gale In Context: Opposing Viewpoints MEDLINE - Academic PubMed Central (Full Participant titles) DOAJ Directory of Open Access Journals |
| DatabaseTitle | CrossRef PubMed MEDLINE - Academic |
| DatabaseTitleList | PubMed MEDLINE - Academic |
| Database_xml | – sequence: 1 dbid: DOA name: DOAJ Directory of Open Access Journals url: https://www.doaj.org/ sourceTypes: Open Website – sequence: 2 dbid: NPM name: PubMed url: http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=PubMed sourceTypes: Index Database – sequence: 3 dbid: 7X8 name: MEDLINE - Academic url: https://search.proquest.com/medline sourceTypes: Aggregation Database |
| DeliveryMethod | fulltext_linktorsrc |
| Discipline | Biology |
| EISSN | 1756-0500 |
| EndPage | 288 |
| ExternalDocumentID | oai_doaj_org_article_748bd2dc12604403a238414e979814b3 PMC3170335 A266596024 21834981 10_1186_1756_0500_4_288 |
| Genre | Journal Article |
| GrantInformation_xml | – fundername: NIGMS NIH HHS grantid: T32 GM080186 |
| GroupedDBID | --- 0R~ 23N 2VQ 2WC 4.4 53G 5GY 5VS 6J9 7X7 88E 8FE 8FH 8FI 8FJ AAFWJ AAJSJ AASML ABDBF ABUWG ACGFO ACGFS ACIHN ACMJI ACPRK ACUHS ADBBV ADRAZ ADUKV AEAQA AFKRA AFPKN AHBYD AHMBA AHSBF AHYZX ALMA_UNASSIGNED_HOLDINGS AMKLP AMTXH AOIJS BAPOH BAWUL BBNVY BCNDV BENPR BFQNJ BHPHI BMC BPHCQ BVXVI C1A C6C CCPQU CS3 DIK E3Z EBLON EBS EJD EMOBN ESX F5P FYUFA GROUPED_DOAJ GX1 HCIFZ HMCUK HYE IAO IEA IHR INH INR IOV IPNFZ ITC KQ8 LK8 M1P M48 M7P MK0 M~E O5R O5S OK1 OVT P2P PGMZT PHGZM PHGZT PIMPY PJZUB PPXIY PQGLB PQQKQ PROAC PSQYO PUEGO RBZ RIG RNS ROL RPM RSV SBL SOJ SV3 TR2 TUS UKHRP ~8M AAYXX AFFHD CITATION ALIPV NPM 7X8 5PM |
| ID | FETCH-LOGICAL-c5818-559e331743134aa7556c38d1357b1a881d3c0f9594e15986eafd86076fae42f23 |
| IEDL.DBID | RSV |
| ISSN | 1756-0500 |
| IngestDate | Fri Oct 03 12:43:59 EDT 2025 Tue Nov 04 01:51:08 EST 2025 Wed Oct 01 12:56:19 EDT 2025 Tue Nov 11 10:34:46 EST 2025 Tue Nov 04 18:02:06 EST 2025 Thu Nov 13 15:17:47 EST 2025 Mon Jul 21 05:44:26 EDT 2025 Tue Nov 18 22:37:42 EST 2025 Sat Nov 29 05:36:44 EST 2025 Sat Sep 06 07:27:52 EDT 2025 |
| IsDoiOpenAccess | true |
| IsOpenAccess | true |
| IsPeerReviewed | true |
| IsScholarly | true |
| Issue | 1 |
| Keywords | Root Mean Square Deviation Transcription Start Site Multivariate Adaptive Regression Spline Model Multivariate Adaptive Regression Spline Enrichment Level |
| Language | English |
| License | http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
| LinkModel | DirectLink |
| MergedId | FETCHMERGED-LOGICAL-c5818-559e331743134aa7556c38d1357b1a881d3c0f9594e15986eafd86076fae42f23 |
| Notes | ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 23 |
| OpenAccessLink | https://link.springer.com/10.1186/1756-0500-4-288 |
| PMID | 21834981 |
| PQID | 889176623 |
| PQPubID | 23479 |
| ParticipantIDs | doaj_primary_oai_doaj_org_article_748bd2dc12604403a238414e979814b3 pubmedcentral_primary_oai_pubmedcentral_nih_gov_3170335 proquest_miscellaneous_889176623 gale_infotracmisc_A266596024 gale_infotracacademiconefile_A266596024 gale_incontextgauss_IOV_A266596024 pubmed_primary_21834981 crossref_primary_10_1186_1756_0500_4_288 crossref_citationtrail_10_1186_1756_0500_4_288 springer_journals_10_1186_1756_0500_4_288 |
| PublicationCentury | 2000 |
| PublicationDate | 20110811 |
| PublicationDateYYYYMMDD | 2011-08-11 |
| PublicationDate_xml | – month: 8 year: 2011 text: 20110811 day: 11 |
| PublicationDecade | 2010 |
| PublicationPlace | London |
| PublicationPlace_xml | – name: London – name: England |
| PublicationTitle | BMC research notes |
| PublicationTitleAbbrev | BMC Res Notes |
| PublicationTitleAlternate | BMC Res Notes |
| PublicationYear | 2011 |
| Publisher | BioMed Central BioMed Central Ltd BMC |
| Publisher_xml | – name: BioMed Central – name: BioMed Central Ltd – name: BMC |
| References | Z Wang (1040_CR5) 2008; 40 JK Sims (1040_CR27) 2006; 281 H Yu (1040_CR7) 2008; 18 1040_CR22 1040_CR24 T Jenuwein (1040_CR2) 2001; 293 1040_CR26 AI Su (1040_CR21) 2004; 101 1040_CR6 GC Hon (1040_CR13) 2009; 18 1040_CR8 AJ Bannister (1040_CR15) 2005; 280 1040_CR9 HH Ng (1040_CR18) 2003; 11 JH Friedman (1040_CR19) 1991; 19 JH Friedman (1040_CR20) 1991; 19 P Kolasinska-Zwierz (1040_CR14) 2009; 41 T Kouzarides (1040_CR11) 2002; 12 EI Campos (1040_CR12) 2009; 43 A Barski (1040_CR10) 2007; 129 L Teytelman (1040_CR23) 2009; 4 AD Goldberg (1040_CR3) 2007; 128 JA Latham (1040_CR4) 2007; 14 C Zang (1040_CR25) 2009; 25 BD Strahl (1040_CR1) 2000; 403 P Komarnitsky (1040_CR17) 2000; 14 NJ Krogan (1040_CR16) 2003; 23 11498575 - Science. 2001 Aug 10;293(5532):1074-80 19808796 - Hum Mol Genet. 2009 Oct 15;18(R2):R195-201 18562678 - Genome Res. 2008 Aug;18(8):1314-24 19886812 - Annu Rev Genet. 2009;43:559-99 15760899 - J Biol Chem. 2005 May 6;280(18):17732-6 17984964 - Nat Struct Mol Biol. 2007 Nov;14(11):1017-24 21338513 - BMC Bioinformatics. 2011 Feb 21;12:57 16517599 - J Biol Chem. 2006 May 5;281(18):12760-6 20653935 - BMC Bioinformatics. 2010 Jul 23;11:396 10638745 - Nature. 2000 Jan 6;403(6765):41-5 19182803 - Nat Genet. 2009 Mar;41(3):376-81 17320500 - Cell. 2007 Feb 23;128(4):635-8 11018013 - Genes Dev. 2000 Oct 1;14(19):2452-60 11893494 - Curr Opin Genet Dev. 2002 Apr;12(2):198-209 18552846 - Nat Genet. 2008 Jul;40(7):897-903 17512414 - Cell. 2007 May 18;129(4):823-37 19505939 - Bioinformatics. 2009 Aug 1;25(15):1952-8 20657582 - Nat Biotechnol. 2010 Aug;28(8):817-25 15075390 - Proc Natl Acad Sci U S A. 2004 Apr 20;101(16):6062-7 21324173 - Genome Biol. 2011;12(2):R15 12773564 - Mol Cell Biol. 2003 Jun;23(12):4207-18 19693276 - PLoS One. 2009;4(8):e6700 20133639 - Proc Natl Acad Sci U S A. 2010 Feb 16;107(7):2926-31 21356108 - BMC Genomics. 2011 Feb 28;12:134 12667453 - Mol Cell. 2003 Mar;11(3):709-19 |
| References_xml | – volume: 293 start-page: 1074 issue: 5532 year: 2001 ident: 1040_CR2 publication-title: Science doi: 10.1126/science.1063127 – volume: 43 start-page: 559 year: 2009 ident: 1040_CR12 publication-title: Annu Rev Genet doi: 10.1146/annurev.genet.032608.103928 – ident: 1040_CR22 doi: 10.1038/nbt.1662 – volume: 40 start-page: 897 issue: 7 year: 2008 ident: 1040_CR5 publication-title: Nat Genet doi: 10.1038/ng.154 – volume: 129 start-page: 823 issue: 4 year: 2007 ident: 1040_CR10 publication-title: Cell doi: 10.1016/j.cell.2007.05.009 – ident: 1040_CR24 doi: 10.1186/1471-2164-12-134 – volume: 101 start-page: 6062 issue: 16 year: 2004 ident: 1040_CR21 publication-title: Proc Natl Acad Sci USA doi: 10.1073/pnas.0400782101 – volume: 25 start-page: 1952 issue: 15 year: 2009 ident: 1040_CR25 publication-title: Bioinformatics doi: 10.1093/bioinformatics/btp340 – volume: 128 start-page: 635 issue: 4 year: 2007 ident: 1040_CR3 publication-title: Cell doi: 10.1016/j.cell.2007.02.006 – volume: 18 start-page: 1314 issue: 8 year: 2008 ident: 1040_CR7 publication-title: Genome Res doi: 10.1101/gr.073080.107 – volume: 14 start-page: 2452 issue: 19 year: 2000 ident: 1040_CR17 publication-title: Genes Dev doi: 10.1101/gad.824700 – ident: 1040_CR6 doi: 10.1186/1471-2105-11-396 – ident: 1040_CR9 doi: 10.1073/pnas.0909344107 – volume: 41 start-page: 376 issue: 3 year: 2009 ident: 1040_CR14 publication-title: Nat Genet doi: 10.1038/ng.322 – volume: 280 start-page: 17732 issue: 18 year: 2005 ident: 1040_CR15 publication-title: J Biol Chem doi: 10.1074/jbc.M500796200 – volume: 14 start-page: 1017 issue: 11 year: 2007 ident: 1040_CR4 publication-title: Nat Struct Mol Biol doi: 10.1038/nsmb1307 – volume: 403 start-page: 41 issue: 6765 year: 2000 ident: 1040_CR1 publication-title: Nature doi: 10.1038/47412 – volume: 11 start-page: 709 issue: 3 year: 2003 ident: 1040_CR18 publication-title: Mol Cell doi: 10.1016/S1097-2765(03)00092-3 – volume: 23 start-page: 4207 issue: 12 year: 2003 ident: 1040_CR16 publication-title: Mol Cell Biol doi: 10.1128/MCB.23.12.4207-4218.2003 – volume: 12 start-page: 198 issue: 2 year: 2002 ident: 1040_CR11 publication-title: Curr Opin Genet Dev doi: 10.1016/S0959-437X(02)00287-3 – ident: 1040_CR8 doi: 10.1186/gb-2011-12-2-r15 – ident: 1040_CR26 doi: 10.1186/1471-2105-12-57 – volume: 18 start-page: R195 issue: R2 year: 2009 ident: 1040_CR13 publication-title: Hum Mol Genet doi: 10.1093/hmg/ddp409 – volume: 19 start-page: 1 issue: 1 year: 1991 ident: 1040_CR19 publication-title: Annals of Statistics doi: 10.1214/aos/1176347963 – volume: 19 start-page: 123 issue: 1 year: 1991 ident: 1040_CR20 publication-title: Annals of Statistics doi: 10.1214/aos/1176347973 – volume: 281 start-page: 12760 issue: 18 year: 2006 ident: 1040_CR27 publication-title: J Biol Chem doi: 10.1074/jbc.M513462200 – volume: 4 start-page: e6700 issue: 8 year: 2009 ident: 1040_CR23 publication-title: PLoS One doi: 10.1371/journal.pone.0006700 – reference: 11018013 - Genes Dev. 2000 Oct 1;14(19):2452-60 – reference: 18562678 - Genome Res. 2008 Aug;18(8):1314-24 – reference: 17984964 - Nat Struct Mol Biol. 2007 Nov;14(11):1017-24 – reference: 15760899 - J Biol Chem. 2005 May 6;280(18):17732-6 – reference: 16517599 - J Biol Chem. 2006 May 5;281(18):12760-6 – reference: 17320500 - Cell. 2007 Feb 23;128(4):635-8 – reference: 15075390 - Proc Natl Acad Sci U S A. 2004 Apr 20;101(16):6062-7 – reference: 21356108 - BMC Genomics. 2011 Feb 28;12:134 – reference: 20657582 - Nat Biotechnol. 2010 Aug;28(8):817-25 – reference: 12667453 - Mol Cell. 2003 Mar;11(3):709-19 – reference: 20133639 - Proc Natl Acad Sci U S A. 2010 Feb 16;107(7):2926-31 – reference: 21338513 - BMC Bioinformatics. 2011 Feb 21;12:57 – reference: 20653935 - BMC Bioinformatics. 2010 Jul 23;11:396 – reference: 19182803 - Nat Genet. 2009 Mar;41(3):376-81 – reference: 19886812 - Annu Rev Genet. 2009;43:559-99 – reference: 18552846 - Nat Genet. 2008 Jul;40(7):897-903 – reference: 12773564 - Mol Cell Biol. 2003 Jun;23(12):4207-18 – reference: 11498575 - Science. 2001 Aug 10;293(5532):1074-80 – reference: 11893494 - Curr Opin Genet Dev. 2002 Apr;12(2):198-209 – reference: 19693276 - PLoS One. 2009;4(8):e6700 – reference: 21324173 - Genome Biol. 2011;12(2):R15 – reference: 19505939 - Bioinformatics. 2009 Aug 1;25(15):1952-8 – reference: 17512414 - Cell. 2007 May 18;129(4):823-37 – reference: 10638745 - Nature. 2000 Jan 6;403(6765):41-5 – reference: 19808796 - Hum Mol Genet. 2009 Oct 15;18(R2):R195-201 |
| SSID | ssj0061881 |
| Score | 1.9481295 |
| Snippet | Background
The advent of ChIP-seq technology has made the investigation of epigenetic regulatory networks a computationally tractable problem. Several groups... The advent of ChIP-seq technology has made the investigation of epigenetic regulatory networks a computationally tractable problem. Several groups have applied... Background The advent of ChIP-seq technology has made the investigation of epigenetic regulatory networks a computationally tractable problem. Several groups... Abstract Background The advent of ChIP-seq technology has made the investigation of epigenetic regulatory networks a computationally tractable problem. Several... |
| SourceID | doaj pubmedcentral proquest gale pubmed crossref springer |
| SourceType | Open Website Open Access Repository Aggregation Database Index Database Enrichment Source Publisher |
| StartPage | 288 |
| SubjectTerms | Biomedical and Life Sciences Biomedicine Data mining Genetic research Life Sciences Medicine/Public Health Methods Research Article Technology application |
| SummonAdditionalLinks | – databaseName: DOAJ Directory of Open Access Journals dbid: DOA link: http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwrV1Lb9QwELZQVSQuqOWZUpCFkICDaRw7iXMsFRW9lCIB6s1y_GBX6mah6SLx75lxnO2mqOLC1Z5I9nwTz4wz-YaQV6Fqgw-eMx5CxeCUFKy1bclsrdrCCdU2dmg2UZ-eqvPz5myj1RfWhA30wIPiDmqpWlc4yyHwljIXBnyM5NI3daO4bCPPJ0Q9YzI1nMEVV7E9KfhGyJfLPE-kPlxVB-sxJlkRG65c-6NI2__34bzhnW5WTt74fBq90vEOuZ_CSXo4bGOX3PHdA3J3aDD5-yG5-LwyQzVQBIAuA40Ew52ni6W7Hj-anZyx3v-kYE1zO8MbQwrRLMX6UbqIPSSo6RxdxNJLT1OvCRjc-P79iHw9_vDl6CNL_RWYLcFPM0gmvBCYknAhjanLsrJCOS7KuuUGNOiEzUNTNtJzpHH3JjhV5XUVjJdFKMRjstXBip8SWoiqKWwNEoCNy11jlHMm4EWoghikzMi7UcvaJvJx7IFxoWMSoiqNsGiERUsNsGTkzfqBHwPvxu2i7xG2tRgSZscBMCOdzEj_y4wy8hJB10iJ0WHNzXez6nt98umbPoQYpoREr5AZeZ2EwhJWb036hQF0gCxaE8n9iSS8s3YyTUfb0jiFhW6dX656rVSDlJ0FLOjJYGrrfWEwK2G1GaknRjjZ-HSmm88iYziAnAsBKLwdzVWno6q_Tat7_0Orz8i98Q6e832ydXW58s_Jtv11Ne8vX8RX9g8o4T5n priority: 102 providerName: Directory of Open Access Journals |
| Title | Quantification of histone modification ChIP-seq enrichment for data mining and machine learning applications |
| URI | https://link.springer.com/article/10.1186/1756-0500-4-288 https://www.ncbi.nlm.nih.gov/pubmed/21834981 https://www.proquest.com/docview/889176623 https://pubmed.ncbi.nlm.nih.gov/PMC3170335 https://doaj.org/article/748bd2dc12604403a238414e979814b3 |
| Volume | 4 |
| hasFullText | 1 |
| inHoldings | 1 |
| isFullTextHit | |
| isPrint | |
| journalDatabaseRights | – providerCode: PRVADU databaseName: BioMed Central Open Access Free customDbUrl: eissn: 1756-0500 dateEnd: 99991231 omitProxy: false ssIdentifier: ssj0061881 issn: 1756-0500 databaseCode: RBZ dateStart: 20080101 isFulltext: true titleUrlDefault: https://www.biomedcentral.com/search/ providerName: BioMedCentral – providerCode: PRVAON databaseName: DOAJ Directory of Open Access Journals customDbUrl: eissn: 1756-0500 dateEnd: 99991231 omitProxy: false ssIdentifier: ssj0061881 issn: 1756-0500 databaseCode: DOA dateStart: 20080101 isFulltext: true titleUrlDefault: https://www.doaj.org/ providerName: Directory of Open Access Journals – providerCode: PRVHPJ databaseName: ROAD: Directory of Open Access Scholarly Resources customDbUrl: eissn: 1756-0500 dateEnd: 99991231 omitProxy: false ssIdentifier: ssj0061881 issn: 1756-0500 databaseCode: M~E dateStart: 20080101 isFulltext: true titleUrlDefault: https://road.issn.org providerName: ISSN International Centre – providerCode: PRVPQU databaseName: Biological Science Database customDbUrl: eissn: 1756-0500 dateEnd: 99991231 omitProxy: false ssIdentifier: ssj0061881 issn: 1756-0500 databaseCode: M7P dateStart: 20090101 isFulltext: true titleUrlDefault: http://search.proquest.com/biologicalscijournals providerName: ProQuest – providerCode: PRVPQU databaseName: Health & Medical Collection customDbUrl: eissn: 1756-0500 dateEnd: 99991231 omitProxy: false ssIdentifier: ssj0061881 issn: 1756-0500 databaseCode: 7X7 dateStart: 20090101 isFulltext: true titleUrlDefault: https://search.proquest.com/healthcomplete providerName: ProQuest – providerCode: PRVPQU databaseName: ProQuest Central (subscription) customDbUrl: eissn: 1756-0500 dateEnd: 99991231 omitProxy: false ssIdentifier: ssj0061881 issn: 1756-0500 databaseCode: BENPR dateStart: 20090101 isFulltext: true titleUrlDefault: https://www.proquest.com/central providerName: ProQuest – providerCode: PRVPQU databaseName: Publicly Available Content Database customDbUrl: eissn: 1756-0500 dateEnd: 99991231 omitProxy: false ssIdentifier: ssj0061881 issn: 1756-0500 databaseCode: PIMPY dateStart: 20090101 isFulltext: true titleUrlDefault: http://search.proquest.com/publiccontent providerName: ProQuest – providerCode: PRVAVX databaseName: Springer Journals customDbUrl: eissn: 1756-0500 dateEnd: 99991231 omitProxy: false ssIdentifier: ssj0061881 issn: 1756-0500 databaseCode: RSV dateStart: 20081201 isFulltext: true titleUrlDefault: https://link.springer.com/search?facet-content-type=%22Journal%22 providerName: Springer Nature |
| link | http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV1Lb9QwELb6AIkLb-hCWVkICTikxLETO8e2asUeWEKBajlZjh_dlbrZsuki8e8ZO8nSFHqASw72RJrMjD0z9uQbhF65rHTWWRIR57IIdkkalbpMI81FmRgqylw3zSb4eCwmk7zYQKT7FyZUu3dXkmGnDstaZO_Az0Hum8ZxxKJEiE20Db5O-LV48vm023wzIgRpEXz-8lLP-QSM_j934iuu6HqZ5LW70uCCju_9B_P30d023sT7jYE8QBu2eohuNx0ofz5C559WqikXChrCC4cDAnFl8Xxhfo8fTkdFVNvvGMxtpqf-SBFDuIt9gSmehyYTWFUGz0NtpsVtMwoYvHJB_hh9PT76cvg-ahswRDoFRx5BtmEp9TkLoUwpnqaZpsIQmvKSKJC0oTp2eZozSzzOu1XOiCzmmVOWJS6hT9BWBRzvIJzQLE80BwpGmIlNroQxyvmTUgFBSjpAe51mpG7RyX2TjHMZshSRSS9C6UUomQQRDtCb9QsXDTDHzaQHXtVrMo-oHQYWyzPZLlDJmShNYjSBBI-xmCqIZYBVm3MwMlbSAXrpDUV6zIzKF-WcqVVdy9HHU7kPQU4KmWDCBuh1S-QWwL1W7T8OIAMPs9Wj3O1RwqLWvWnc2aP0U74SrrKLVS2FyD2mZwIMPW3Mc_1dPtplwO0A8Z7h9j68P1PNpgFSHJQcUwpaeNuZr2z3svomqT77B9rn6E53Fk_ILtq6XK7sC3RL_7ic1csh2uQTHp5iiLYPjsbFyTCcjQx9JW4BY8XoQ_FtGFb6L_U5Rxo |
| linkProvider | Springer Nature |
| linkToHtml | http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV3db9QwDLfGAMEL3x8HAyKEBHsoNE3apo9jYtqJcQwY096iNB-7k3YtXHdI_Pc4aXusgz3Aa-JKru3EduL8DPDCZaWzztKIOpdFuEuyqNRlGulclIlhoix022win0zE0VGxvwa0fwsTqt37K8mwU4dlLbI36Ocw903jOOJRIsQluMzRXfkqvs9fDvvNN6NC0A7B5y8fDZxPwOj_cyc-44rOl0meuysNLmjn5n8wfwtudPEm2WoN5Das2eoOXG07UP68CyeflqotFwoaIrUjAYG4smRem9_j29PxftTY7wTNbaan_kiRYLhLfIEpmYcmE0RVhsxDbaYlXTMKHDxzQX4Pvu68O9jejboGDJFO0ZFHmG1YxnzOQhlXKk_TTDNhKEvzkiqUtGE6dkVacEs9zrtVzogszjOnLE9cwu7DeoUcPwSSsKxIdI4UnHITm0IJY5TzJ6UCg5R0BK97zUjdoZP7JhknMmQpIpNehNKLUHKJIhzBq9UH31pgjotJ33pVr8g8onYYqBfHslugMueiNInRFBM8zmOmMJZBVm2RF4Lyko3guTcU6TEzKl-Uc6yWTSPHHw_lFgY5KWaCCR_By47I1ci9Vt0bB5SBh9kaUG4MKHFR68E06e1R-ilfCVfZetlIIQqP6ZkgQw9a81z9l492OXI7gnxguIMfH85Us2mAFEclx4yhFjZ785XdXtZcJNVH_0D7DK7tHnzYk3vjyfvHcL0_l6d0A9ZPF0v7BK7oH6ezZvE0rOZfdJBC-Q |
| linkToPdf | http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwpV3db9QwDI9gwMQL34zCgAghAQ9lTZO26eMYnDiBjkPAtLcozcfupF07rndI_PfY_TjWwR4Qr4krObYT24n7MyHPfVp45x0LmfdpCKckDwtTJKHJZBFbLovctM0msslEHh3l0642p-6r3fsnyfafBkRpKld7p9a3W1yme-DzIA9OoigUYSzlZXJFYMcgTNa_HPYHccqkZB2az18-GjiiBq__z1P5jFs6XzJ57t20cUejm_-5kFvkRheH0v3WcG6TS668Q661nSl_3iUnn9e6LSNqNEcrTxtk4tLRRWV_jx_MxtOwdt8pmOHczPCqkUIYTLHwlC6a5hNUl5YumppNR7smFTB45uH8Hvk2evf14H3YNWYITQIOPoQsxHGOuQzjQussSVLDpWU8yQqmQeqWm8jnSS4cQ_x3p72VaZSlXjsR-5jfJ1slcPyA0JineWwyoBBM2MjmWlqrPd6gSghekoC87rWkTIdajs0zTlSTvchUoQgVilAJBSIMyMvNB6ctYMfFpG9Q7RsyRNpuBqrlseo2rsqELGxsDYPET4iIa4hxgFWXZ7lkouABeYZGoxBLo8RinWO9rms1_nSo9iH4SSBDjEVAXnREvgLuje7-fQAZIPzWgHJ3QAmb3QymaW-bCqewQq501bpWUuaI9RkDQzutqW7WhVGwAG4Dkg2MeLDw4Uw5nzVQ46DkiHPQwqvelFV3xtUXSfXhP9A-JdvTtyP1cTz58Ihc76_rGdslW6vl2j0mV82P1bxePmk29i8Zv0vd |
| openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Quantification+of+histone+modification+ChIP-seq+enrichment+for+data+mining+and+machine+learning+applications&rft.jtitle=BMC+research+notes&rft.au=Hoang%2C+Stephen+A&rft.au=Xu%2C+Xiaojiang&rft.au=Bekiranov%2C+Stefan&rft.date=2011-08-11&rft.pub=BioMed+Central&rft.eissn=1756-0500&rft.volume=4&rft.spage=288&rft.epage=288&rft_id=info:doi/10.1186%2F1756-0500-4-288&rft_id=info%3Apmid%2F21834981&rft.externalDocID=PMC3170335 |
| thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1756-0500&client=summon |
| thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1756-0500&client=summon |
| thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1756-0500&client=summon |