Quantification of histone modification ChIP-seq enrichment for data mining and machine learning applications

Background The advent of ChIP-seq technology has made the investigation of epigenetic regulatory networks a computationally tractable problem. Several groups have applied statistical computing methods to ChIP-seq datasets to gain insight into the epigenetic regulation of transcription. However, meth...

Celý popis

Uložené v:
Podrobná bibliografia
Vydané v:BMC research notes Ročník 4; číslo 1; s. 288
Hlavní autori: Hoang, Stephen A, Xu, Xiaojiang, Bekiranov, Stefan
Médium: Journal Article
Jazyk:English
Vydavateľské údaje: London BioMed Central 11.08.2011
BioMed Central Ltd
BMC
Predmet:
ISSN:1756-0500, 1756-0500
On-line prístup:Získať plný text
Tagy: Pridať tag
Žiadne tagy, Buďte prvý, kto otaguje tento záznam!
Abstract Background The advent of ChIP-seq technology has made the investigation of epigenetic regulatory networks a computationally tractable problem. Several groups have applied statistical computing methods to ChIP-seq datasets to gain insight into the epigenetic regulation of transcription. However, methods for estimating enrichment levels in ChIP-seq data for these computational studies are understudied and variable. Since the conclusions drawn from these data mining and machine learning applications strongly depend on the enrichment level inputs, a comparison of estimation methods with respect to the performance of statistical models should be made. Results Various methods were used to estimate the gene-wise ChIP-seq enrichment levels for 20 histone methylations and the histone variant H2A.Z. The Multivariate Adaptive Regression Splines (MARS) algorithm was applied for each estimation method using the estimation of enrichment levels as predictors and gene expression levels as responses. The methods used to estimate enrichment levels included tag counting and model-based methods that were applied to whole genes and specific gene regions. These methods were also applied to various sizes of estimation windows. The MARS model performance was assessed with the Generalized Cross-Validation Score (GCV). We determined that model-based methods of enrichment estimation that spatially weight enrichment based on average patterns provided an improvement over tag counting methods. Also, methods that included information across the entire gene body provided improvement over methods that focus on a specific sub-region of the gene (e.g., the 5' or 3' region). Conclusion The performance of data mining and machine learning methods when applied to histone modification ChIP-seq data can be improved by using data across the entire gene body, and incorporating the spatial distribution of enrichment. Refinement of enrichment estimation ultimately improved accuracy of model predictions.
AbstractList The advent of ChIP-seq technology has made the investigation of epigenetic regulatory networks a computationally tractable problem. Several groups have applied statistical computing methods to ChIP-seq datasets to gain insight into the epigenetic regulation of transcription. However, methods for estimating enrichment levels in ChIP-seq data for these computational studies are understudied and variable. Since the conclusions drawn from these data mining and machine learning applications strongly depend on the enrichment level inputs, a comparison of estimation methods with respect to the performance of statistical models should be made. Various methods were used to estimate the gene-wise ChIP-seq enrichment levels for 20 histone methylations and the histone variant H2A.Z. The Multivariate Adaptive Regression Splines (MARS) algorithm was applied for each estimation method using the estimation of enrichment levels as predictors and gene expression levels as responses. The methods used to estimate enrichment levels included tag counting and model-based methods that were applied to whole genes and specific gene regions. These methods were also applied to various sizes of estimation windows. The MARS model performance was assessed with the Generalized Cross-Validation Score (GCV). We determined that model-based methods of enrichment estimation that spatially weight enrichment based on average patterns provided an improvement over tag counting methods. Also, methods that included information across the entire gene body provided improvement over methods that focus on a specific sub-region of the gene (e.g., the 5' or 3' region). The performance of data mining and machine learning methods when applied to histone modification ChIP-seq data can be improved by using data across the entire gene body, and incorporating the spatial distribution of enrichment. Refinement of enrichment estimation ultimately improved accuracy of model predictions.
Background The advent of ChIP-seq technology has made the investigation of epigenetic regulatory networks a computationally tractable problem. Several groups have applied statistical computing methods to ChIP-seq datasets to gain insight into the epigenetic regulation of transcription. However, methods for estimating enrichment levels in ChIP-seq data for these computational studies are understudied and variable. Since the conclusions drawn from these data mining and machine learning applications strongly depend on the enrichment level inputs, a comparison of estimation methods with respect to the performance of statistical models should be made. Results Various methods were used to estimate the gene-wise ChIP-seq enrichment levels for 20 histone methylations and the histone variant H2A.Z. The Multivariate Adaptive Regression Splines (MARS) algorithm was applied for each estimation method using the estimation of enrichment levels as predictors and gene expression levels as responses. The methods used to estimate enrichment levels included tag counting and model-based methods that were applied to whole genes and specific gene regions. These methods were also applied to various sizes of estimation windows. The MARS model performance was assessed with the Generalized Cross-Validation Score (GCV). We determined that model-based methods of enrichment estimation that spatially weight enrichment based on average patterns provided an improvement over tag counting methods. Also, methods that included information across the entire gene body provided improvement over methods that focus on a specific sub-region of the gene (e.g., the 5' or 3' region). Conclusion The performance of data mining and machine learning methods when applied to histone modification ChIP-seq data can be improved by using data across the entire gene body, and incorporating the spatial distribution of enrichment. Refinement of enrichment estimation ultimately improved accuracy of model predictions.
The advent of ChIP-seq technology has made the investigation of epigenetic regulatory networks a computationally tractable problem. Several groups have applied statistical computing methods to ChIP-seq datasets to gain insight into the epigenetic regulation of transcription. However, methods for estimating enrichment levels in ChIP-seq data for these computational studies are understudied and variable. Since the conclusions drawn from these data mining and machine learning applications strongly depend on the enrichment level inputs, a comparison of estimation methods with respect to the performance of statistical models should be made. Various methods were used to estimate the gene-wise ChIP-seq enrichment levels for 20 histone methylations and the histone variant H2A.Z. The Multivariate Adaptive Regression Splines (MARS) algorithm was applied for each estimation method using the estimation of enrichment levels as predictors and gene expression levels as responses. The methods used to estimate enrichment levels included tag counting and model-based methods that were applied to whole genes and specific gene regions. These methods were also applied to various sizes of estimation windows. The MARS model performance was assessed with the Generalized Cross-Validation Score (GCV). We determined that model-based methods of enrichment estimation that spatially weight enrichment based on average patterns provided an improvement over tag counting methods. Also, methods that included information across the entire gene body provided improvement over methods that focus on a specific sub-region of the gene (e.g., the 5' or 3' region). The performance of data mining and machine learning methods when applied to histone modification ChIP-seq data can be improved by using data across the entire gene body, and incorporating the spatial distribution of enrichment. Refinement of enrichment estimation ultimately improved accuracy of model predictions.
Abstract Background The advent of ChIP-seq technology has made the investigation of epigenetic regulatory networks a computationally tractable problem. Several groups have applied statistical computing methods to ChIP-seq datasets to gain insight into the epigenetic regulation of transcription. However, methods for estimating enrichment levels in ChIP-seq data for these computational studies are understudied and variable. Since the conclusions drawn from these data mining and machine learning applications strongly depend on the enrichment level inputs, a comparison of estimation methods with respect to the performance of statistical models should be made. Results Various methods were used to estimate the gene-wise ChIP-seq enrichment levels for 20 histone methylations and the histone variant H2A.Z. The Multivariate Adaptive Regression Splines (MARS) algorithm was applied for each estimation method using the estimation of enrichment levels as predictors and gene expression levels as responses. The methods used to estimate enrichment levels included tag counting and model-based methods that were applied to whole genes and specific gene regions. These methods were also applied to various sizes of estimation windows. The MARS model performance was assessed with the Generalized Cross-Validation Score (GCV). We determined that model-based methods of enrichment estimation that spatially weight enrichment based on average patterns provided an improvement over tag counting methods. Also, methods that included information across the entire gene body provided improvement over methods that focus on a specific sub-region of the gene (e.g., the 5' or 3' region). Conclusion The performance of data mining and machine learning methods when applied to histone modification ChIP-seq data can be improved by using data across the entire gene body, and incorporating the spatial distribution of enrichment. Refinement of enrichment estimation ultimately improved accuracy of model predictions.
The advent of ChIP-seq technology has made the investigation of epigenetic regulatory networks a computationally tractable problem. Several groups have applied statistical computing methods to ChIP-seq datasets to gain insight into the epigenetic regulation of transcription. However, methods for estimating enrichment levels in ChIP-seq data for these computational studies are understudied and variable. Since the conclusions drawn from these data mining and machine learning applications strongly depend on the enrichment level inputs, a comparison of estimation methods with respect to the performance of statistical models should be made.BACKGROUNDThe advent of ChIP-seq technology has made the investigation of epigenetic regulatory networks a computationally tractable problem. Several groups have applied statistical computing methods to ChIP-seq datasets to gain insight into the epigenetic regulation of transcription. However, methods for estimating enrichment levels in ChIP-seq data for these computational studies are understudied and variable. Since the conclusions drawn from these data mining and machine learning applications strongly depend on the enrichment level inputs, a comparison of estimation methods with respect to the performance of statistical models should be made.Various methods were used to estimate the gene-wise ChIP-seq enrichment levels for 20 histone methylations and the histone variant H2A.Z. The Multivariate Adaptive Regression Splines (MARS) algorithm was applied for each estimation method using the estimation of enrichment levels as predictors and gene expression levels as responses. The methods used to estimate enrichment levels included tag counting and model-based methods that were applied to whole genes and specific gene regions. These methods were also applied to various sizes of estimation windows. The MARS model performance was assessed with the Generalized Cross-Validation Score (GCV). We determined that model-based methods of enrichment estimation that spatially weight enrichment based on average patterns provided an improvement over tag counting methods. Also, methods that included information across the entire gene body provided improvement over methods that focus on a specific sub-region of the gene (e.g., the 5' or 3' region).RESULTSVarious methods were used to estimate the gene-wise ChIP-seq enrichment levels for 20 histone methylations and the histone variant H2A.Z. The Multivariate Adaptive Regression Splines (MARS) algorithm was applied for each estimation method using the estimation of enrichment levels as predictors and gene expression levels as responses. The methods used to estimate enrichment levels included tag counting and model-based methods that were applied to whole genes and specific gene regions. These methods were also applied to various sizes of estimation windows. The MARS model performance was assessed with the Generalized Cross-Validation Score (GCV). We determined that model-based methods of enrichment estimation that spatially weight enrichment based on average patterns provided an improvement over tag counting methods. Also, methods that included information across the entire gene body provided improvement over methods that focus on a specific sub-region of the gene (e.g., the 5' or 3' region).The performance of data mining and machine learning methods when applied to histone modification ChIP-seq data can be improved by using data across the entire gene body, and incorporating the spatial distribution of enrichment. Refinement of enrichment estimation ultimately improved accuracy of model predictions.CONCLUSIONThe performance of data mining and machine learning methods when applied to histone modification ChIP-seq data can be improved by using data across the entire gene body, and incorporating the spatial distribution of enrichment. Refinement of enrichment estimation ultimately improved accuracy of model predictions.
Background The advent of ChIP-seq technology has made the investigation of epigenetic regulatory networks a computationally tractable problem. Several groups have applied statistical computing methods to ChIP-seq datasets to gain insight into the epigenetic regulation of transcription. However, methods for estimating enrichment levels in ChIP-seq data for these computational studies are understudied and variable. Since the conclusions drawn from these data mining and machine learning applications strongly depend on the enrichment level inputs, a comparison of estimation methods with respect to the performance of statistical models should be made. Results Various methods were used to estimate the gene-wise ChIP-seq enrichment levels for 20 histone methylations and the histone variant H2A.Z. The Multivariate Adaptive Regression Splines (MARS) algorithm was applied for each estimation method using the estimation of enrichment levels as predictors and gene expression levels as responses. The methods used to estimate enrichment levels included tag counting and model-based methods that were applied to whole genes and specific gene regions. These methods were also applied to various sizes of estimation windows. The MARS model performance was assessed with the Generalized Cross-Validation Score (GCV). We determined that model-based methods of enrichment estimation that spatially weight enrichment based on average patterns provided an improvement over tag counting methods. Also, methods that included information across the entire gene body provided improvement over methods that focus on a specific sub-region of the gene (e.g., the 5' or 3' region). Conclusion The performance of data mining and machine learning methods when applied to histone modification ChIP-seq data can be improved by using data across the entire gene body, and incorporating the spatial distribution of enrichment. Refinement of enrichment estimation ultimately improved accuracy of model predictions.
ArticleNumber 288
Audience Academic
Author Hoang, Stephen A
Bekiranov, Stefan
Xu, Xiaojiang
AuthorAffiliation 1 Department of Biochemistry and Molecular Genetics, University of Virginia Health System, Charlottesville, Virginia, USA
AuthorAffiliation_xml – name: 1 Department of Biochemistry and Molecular Genetics, University of Virginia Health System, Charlottesville, Virginia, USA
Author_xml – sequence: 1
  givenname: Stephen A
  surname: Hoang
  fullname: Hoang, Stephen A
  organization: Department of Biochemistry and Molecular Genetics, University of Virginia Health System
– sequence: 2
  givenname: Xiaojiang
  surname: Xu
  fullname: Xu, Xiaojiang
  organization: Department of Biochemistry and Molecular Genetics, University of Virginia Health System
– sequence: 3
  givenname: Stefan
  surname: Bekiranov
  fullname: Bekiranov, Stefan
  email: sb3de@virginia.edu
  organization: Department of Biochemistry and Molecular Genetics, University of Virginia Health System
BackLink https://www.ncbi.nlm.nih.gov/pubmed/21834981$$D View this record in MEDLINE/PubMed
BookMark eNp9UkuP0zAYjNAi9gFnbigSB8Qhu3b8iHNBWlU8Kq20IAFX66sfqavE7toJgn-PuylliwD5EGsyM983yZwXJz54UxTPMbrEWPAr3DBeIYZQRataiEfF2QE5eXA_Lc5T2iDEsRD4SXFaY0FoK_BZ0X-awI_OOgWjC74Mtly7NOYp5RD0b3yxXn6skrkrjY9OrQfjx9KGWGoYoRycd74rwetyALV2WdwbiDO43fZ7k_S0eGyhT-bZ_nlRfHn39vPiQ3Vz-365uL6pFBNYVIy1hhDcUIIJBWgY44oIjQlrVhhyAk0Usi1rqcGsFdyA1YKjhlswtLY1uSiWs68OsJHb6AaIP2QAJ--BEDsJcXSqN7KhYqVrrXDNEaWIQE0ExdS0Tf48dEWy15vZazutBqNVDh6hPzI9fuPdWnbhm8wBECEsG7zaG8RwN5k0ysElZfoevAlTkkK0uOG83o16OTM7yJs5b0M2VDu2vK45Zy1HNc2sy7-w8tFmcCr_OOsyfiR4fSTInNF8HzuYUpLL26_H3BcP0x5i_ipMJlzNBBVDStHYAwUjuauk3JVO7konqcyVzAr2h0K58b4PeW_X_0eHZl3KE3xnotyEKfrcm39KfgJAsvB-
CitedBy_id crossref_primary_10_1038_ncomms8052
crossref_primary_10_1186_1756_8935_6_28
crossref_primary_10_1038_s41598_017_03665_1
crossref_primary_10_1186_1471_2164_15_76
crossref_primary_10_1074_jbc_M114_626929
crossref_primary_10_1093_nar_gkt035
crossref_primary_10_1371_journal_pone_0099844
Cites_doi 10.1126/science.1063127
10.1146/annurev.genet.032608.103928
10.1038/nbt.1662
10.1038/ng.154
10.1016/j.cell.2007.05.009
10.1186/1471-2164-12-134
10.1073/pnas.0400782101
10.1093/bioinformatics/btp340
10.1016/j.cell.2007.02.006
10.1101/gr.073080.107
10.1101/gad.824700
10.1186/1471-2105-11-396
10.1073/pnas.0909344107
10.1038/ng.322
10.1074/jbc.M500796200
10.1038/nsmb1307
10.1038/47412
10.1016/S1097-2765(03)00092-3
10.1128/MCB.23.12.4207-4218.2003
10.1016/S0959-437X(02)00287-3
10.1186/gb-2011-12-2-r15
10.1186/1471-2105-12-57
10.1093/hmg/ddp409
10.1214/aos/1176347963
10.1214/aos/1176347973
10.1074/jbc.M513462200
10.1371/journal.pone.0006700
ContentType Journal Article
Copyright Bekiranov et al; licensee BioMed Central Ltd. 2011
COPYRIGHT 2011 BioMed Central Ltd.
Copyright ©2011 Bekiranov et al; licensee BioMed Central Ltd. 2011 Bekiranov et al; licensee BioMed Central Ltd.
Copyright_xml – notice: Bekiranov et al; licensee BioMed Central Ltd. 2011
– notice: COPYRIGHT 2011 BioMed Central Ltd.
– notice: Copyright ©2011 Bekiranov et al; licensee BioMed Central Ltd. 2011 Bekiranov et al; licensee BioMed Central Ltd.
DBID C6C
AAYXX
CITATION
NPM
IOV
7X8
5PM
DOA
DOI 10.1186/1756-0500-4-288
DatabaseName Springer Nature OA Free Journals
CrossRef
PubMed
Gale In Context: Opposing Viewpoints
MEDLINE - Academic
PubMed Central (Full Participant titles)
DOAJ Directory of Open Access Journals
DatabaseTitle CrossRef
PubMed
MEDLINE - Academic
DatabaseTitleList

PubMed


MEDLINE - Academic

Database_xml – sequence: 1
  dbid: DOA
  name: DOAJ Directory of Open Access Journals
  url: https://www.doaj.org/
  sourceTypes: Open Website
– sequence: 2
  dbid: NPM
  name: PubMed
  url: http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=PubMed
  sourceTypes: Index Database
– sequence: 3
  dbid: 7X8
  name: MEDLINE - Academic
  url: https://search.proquest.com/medline
  sourceTypes: Aggregation Database
DeliveryMethod fulltext_linktorsrc
Discipline Biology
EISSN 1756-0500
EndPage 288
ExternalDocumentID oai_doaj_org_article_748bd2dc12604403a238414e979814b3
PMC3170335
A266596024
21834981
10_1186_1756_0500_4_288
Genre Journal Article
GrantInformation_xml – fundername: NIGMS NIH HHS
  grantid: T32 GM080186
GroupedDBID ---
0R~
23N
2VQ
2WC
4.4
53G
5GY
5VS
6J9
7X7
88E
8FE
8FH
8FI
8FJ
AAFWJ
AAJSJ
AASML
ABDBF
ABUWG
ACGFO
ACGFS
ACIHN
ACMJI
ACPRK
ACUHS
ADBBV
ADRAZ
ADUKV
AEAQA
AFKRA
AFPKN
AHBYD
AHMBA
AHSBF
AHYZX
ALMA_UNASSIGNED_HOLDINGS
AMKLP
AMTXH
AOIJS
BAPOH
BAWUL
BBNVY
BCNDV
BENPR
BFQNJ
BHPHI
BMC
BPHCQ
BVXVI
C1A
C6C
CCPQU
CS3
DIK
E3Z
EBLON
EBS
EJD
EMOBN
ESX
F5P
FYUFA
GROUPED_DOAJ
GX1
HCIFZ
HMCUK
HYE
IAO
IEA
IHR
INH
INR
IOV
IPNFZ
ITC
KQ8
LK8
M1P
M48
M7P
MK0
M~E
O5R
O5S
OK1
OVT
P2P
PGMZT
PHGZM
PHGZT
PIMPY
PJZUB
PPXIY
PQGLB
PQQKQ
PROAC
PSQYO
PUEGO
RBZ
RIG
RNS
ROL
RPM
RSV
SBL
SOJ
SV3
TR2
TUS
UKHRP
~8M
AAYXX
AFFHD
CITATION
ALIPV
NPM
7X8
5PM
ID FETCH-LOGICAL-c5818-559e331743134aa7556c38d1357b1a881d3c0f9594e15986eafd86076fae42f23
IEDL.DBID RSV
ISSN 1756-0500
IngestDate Fri Oct 03 12:43:59 EDT 2025
Tue Nov 04 01:51:08 EST 2025
Wed Oct 01 12:56:19 EDT 2025
Tue Nov 11 10:34:46 EST 2025
Tue Nov 04 18:02:06 EST 2025
Thu Nov 13 15:17:47 EST 2025
Mon Jul 21 05:44:26 EDT 2025
Tue Nov 18 22:37:42 EST 2025
Sat Nov 29 05:36:44 EST 2025
Sat Sep 06 07:27:52 EDT 2025
IsDoiOpenAccess true
IsOpenAccess true
IsPeerReviewed true
IsScholarly true
Issue 1
Keywords Root Mean Square Deviation
Transcription Start Site
Multivariate Adaptive Regression Spline Model
Multivariate Adaptive Regression Spline
Enrichment Level
Language English
License http://creativecommons.org/licenses/by/2.0
This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-c5818-559e331743134aa7556c38d1357b1a881d3c0f9594e15986eafd86076fae42f23
Notes ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
OpenAccessLink https://link.springer.com/10.1186/1756-0500-4-288
PMID 21834981
PQID 889176623
PQPubID 23479
ParticipantIDs doaj_primary_oai_doaj_org_article_748bd2dc12604403a238414e979814b3
pubmedcentral_primary_oai_pubmedcentral_nih_gov_3170335
proquest_miscellaneous_889176623
gale_infotracmisc_A266596024
gale_infotracacademiconefile_A266596024
gale_incontextgauss_IOV_A266596024
pubmed_primary_21834981
crossref_primary_10_1186_1756_0500_4_288
crossref_citationtrail_10_1186_1756_0500_4_288
springer_journals_10_1186_1756_0500_4_288
PublicationCentury 2000
PublicationDate 20110811
PublicationDateYYYYMMDD 2011-08-11
PublicationDate_xml – month: 8
  year: 2011
  text: 20110811
  day: 11
PublicationDecade 2010
PublicationPlace London
PublicationPlace_xml – name: London
– name: England
PublicationTitle BMC research notes
PublicationTitleAbbrev BMC Res Notes
PublicationTitleAlternate BMC Res Notes
PublicationYear 2011
Publisher BioMed Central
BioMed Central Ltd
BMC
Publisher_xml – name: BioMed Central
– name: BioMed Central Ltd
– name: BMC
References Z Wang (1040_CR5) 2008; 40
JK Sims (1040_CR27) 2006; 281
H Yu (1040_CR7) 2008; 18
1040_CR22
1040_CR24
T Jenuwein (1040_CR2) 2001; 293
1040_CR26
AI Su (1040_CR21) 2004; 101
1040_CR6
GC Hon (1040_CR13) 2009; 18
1040_CR8
AJ Bannister (1040_CR15) 2005; 280
1040_CR9
HH Ng (1040_CR18) 2003; 11
JH Friedman (1040_CR19) 1991; 19
JH Friedman (1040_CR20) 1991; 19
P Kolasinska-Zwierz (1040_CR14) 2009; 41
T Kouzarides (1040_CR11) 2002; 12
EI Campos (1040_CR12) 2009; 43
A Barski (1040_CR10) 2007; 129
L Teytelman (1040_CR23) 2009; 4
AD Goldberg (1040_CR3) 2007; 128
JA Latham (1040_CR4) 2007; 14
C Zang (1040_CR25) 2009; 25
BD Strahl (1040_CR1) 2000; 403
P Komarnitsky (1040_CR17) 2000; 14
NJ Krogan (1040_CR16) 2003; 23
11498575 - Science. 2001 Aug 10;293(5532):1074-80
19808796 - Hum Mol Genet. 2009 Oct 15;18(R2):R195-201
18562678 - Genome Res. 2008 Aug;18(8):1314-24
19886812 - Annu Rev Genet. 2009;43:559-99
15760899 - J Biol Chem. 2005 May 6;280(18):17732-6
17984964 - Nat Struct Mol Biol. 2007 Nov;14(11):1017-24
21338513 - BMC Bioinformatics. 2011 Feb 21;12:57
16517599 - J Biol Chem. 2006 May 5;281(18):12760-6
20653935 - BMC Bioinformatics. 2010 Jul 23;11:396
10638745 - Nature. 2000 Jan 6;403(6765):41-5
19182803 - Nat Genet. 2009 Mar;41(3):376-81
17320500 - Cell. 2007 Feb 23;128(4):635-8
11018013 - Genes Dev. 2000 Oct 1;14(19):2452-60
11893494 - Curr Opin Genet Dev. 2002 Apr;12(2):198-209
18552846 - Nat Genet. 2008 Jul;40(7):897-903
17512414 - Cell. 2007 May 18;129(4):823-37
19505939 - Bioinformatics. 2009 Aug 1;25(15):1952-8
20657582 - Nat Biotechnol. 2010 Aug;28(8):817-25
15075390 - Proc Natl Acad Sci U S A. 2004 Apr 20;101(16):6062-7
21324173 - Genome Biol. 2011;12(2):R15
12773564 - Mol Cell Biol. 2003 Jun;23(12):4207-18
19693276 - PLoS One. 2009;4(8):e6700
20133639 - Proc Natl Acad Sci U S A. 2010 Feb 16;107(7):2926-31
21356108 - BMC Genomics. 2011 Feb 28;12:134
12667453 - Mol Cell. 2003 Mar;11(3):709-19
References_xml – volume: 293
  start-page: 1074
  issue: 5532
  year: 2001
  ident: 1040_CR2
  publication-title: Science
  doi: 10.1126/science.1063127
– volume: 43
  start-page: 559
  year: 2009
  ident: 1040_CR12
  publication-title: Annu Rev Genet
  doi: 10.1146/annurev.genet.032608.103928
– ident: 1040_CR22
  doi: 10.1038/nbt.1662
– volume: 40
  start-page: 897
  issue: 7
  year: 2008
  ident: 1040_CR5
  publication-title: Nat Genet
  doi: 10.1038/ng.154
– volume: 129
  start-page: 823
  issue: 4
  year: 2007
  ident: 1040_CR10
  publication-title: Cell
  doi: 10.1016/j.cell.2007.05.009
– ident: 1040_CR24
  doi: 10.1186/1471-2164-12-134
– volume: 101
  start-page: 6062
  issue: 16
  year: 2004
  ident: 1040_CR21
  publication-title: Proc Natl Acad Sci USA
  doi: 10.1073/pnas.0400782101
– volume: 25
  start-page: 1952
  issue: 15
  year: 2009
  ident: 1040_CR25
  publication-title: Bioinformatics
  doi: 10.1093/bioinformatics/btp340
– volume: 128
  start-page: 635
  issue: 4
  year: 2007
  ident: 1040_CR3
  publication-title: Cell
  doi: 10.1016/j.cell.2007.02.006
– volume: 18
  start-page: 1314
  issue: 8
  year: 2008
  ident: 1040_CR7
  publication-title: Genome Res
  doi: 10.1101/gr.073080.107
– volume: 14
  start-page: 2452
  issue: 19
  year: 2000
  ident: 1040_CR17
  publication-title: Genes Dev
  doi: 10.1101/gad.824700
– ident: 1040_CR6
  doi: 10.1186/1471-2105-11-396
– ident: 1040_CR9
  doi: 10.1073/pnas.0909344107
– volume: 41
  start-page: 376
  issue: 3
  year: 2009
  ident: 1040_CR14
  publication-title: Nat Genet
  doi: 10.1038/ng.322
– volume: 280
  start-page: 17732
  issue: 18
  year: 2005
  ident: 1040_CR15
  publication-title: J Biol Chem
  doi: 10.1074/jbc.M500796200
– volume: 14
  start-page: 1017
  issue: 11
  year: 2007
  ident: 1040_CR4
  publication-title: Nat Struct Mol Biol
  doi: 10.1038/nsmb1307
– volume: 403
  start-page: 41
  issue: 6765
  year: 2000
  ident: 1040_CR1
  publication-title: Nature
  doi: 10.1038/47412
– volume: 11
  start-page: 709
  issue: 3
  year: 2003
  ident: 1040_CR18
  publication-title: Mol Cell
  doi: 10.1016/S1097-2765(03)00092-3
– volume: 23
  start-page: 4207
  issue: 12
  year: 2003
  ident: 1040_CR16
  publication-title: Mol Cell Biol
  doi: 10.1128/MCB.23.12.4207-4218.2003
– volume: 12
  start-page: 198
  issue: 2
  year: 2002
  ident: 1040_CR11
  publication-title: Curr Opin Genet Dev
  doi: 10.1016/S0959-437X(02)00287-3
– ident: 1040_CR8
  doi: 10.1186/gb-2011-12-2-r15
– ident: 1040_CR26
  doi: 10.1186/1471-2105-12-57
– volume: 18
  start-page: R195
  issue: R2
  year: 2009
  ident: 1040_CR13
  publication-title: Hum Mol Genet
  doi: 10.1093/hmg/ddp409
– volume: 19
  start-page: 1
  issue: 1
  year: 1991
  ident: 1040_CR19
  publication-title: Annals of Statistics
  doi: 10.1214/aos/1176347963
– volume: 19
  start-page: 123
  issue: 1
  year: 1991
  ident: 1040_CR20
  publication-title: Annals of Statistics
  doi: 10.1214/aos/1176347973
– volume: 281
  start-page: 12760
  issue: 18
  year: 2006
  ident: 1040_CR27
  publication-title: J Biol Chem
  doi: 10.1074/jbc.M513462200
– volume: 4
  start-page: e6700
  issue: 8
  year: 2009
  ident: 1040_CR23
  publication-title: PLoS One
  doi: 10.1371/journal.pone.0006700
– reference: 11018013 - Genes Dev. 2000 Oct 1;14(19):2452-60
– reference: 18562678 - Genome Res. 2008 Aug;18(8):1314-24
– reference: 17984964 - Nat Struct Mol Biol. 2007 Nov;14(11):1017-24
– reference: 15760899 - J Biol Chem. 2005 May 6;280(18):17732-6
– reference: 16517599 - J Biol Chem. 2006 May 5;281(18):12760-6
– reference: 17320500 - Cell. 2007 Feb 23;128(4):635-8
– reference: 15075390 - Proc Natl Acad Sci U S A. 2004 Apr 20;101(16):6062-7
– reference: 21356108 - BMC Genomics. 2011 Feb 28;12:134
– reference: 20657582 - Nat Biotechnol. 2010 Aug;28(8):817-25
– reference: 12667453 - Mol Cell. 2003 Mar;11(3):709-19
– reference: 20133639 - Proc Natl Acad Sci U S A. 2010 Feb 16;107(7):2926-31
– reference: 21338513 - BMC Bioinformatics. 2011 Feb 21;12:57
– reference: 20653935 - BMC Bioinformatics. 2010 Jul 23;11:396
– reference: 19182803 - Nat Genet. 2009 Mar;41(3):376-81
– reference: 19886812 - Annu Rev Genet. 2009;43:559-99
– reference: 18552846 - Nat Genet. 2008 Jul;40(7):897-903
– reference: 12773564 - Mol Cell Biol. 2003 Jun;23(12):4207-18
– reference: 11498575 - Science. 2001 Aug 10;293(5532):1074-80
– reference: 11893494 - Curr Opin Genet Dev. 2002 Apr;12(2):198-209
– reference: 19693276 - PLoS One. 2009;4(8):e6700
– reference: 21324173 - Genome Biol. 2011;12(2):R15
– reference: 19505939 - Bioinformatics. 2009 Aug 1;25(15):1952-8
– reference: 17512414 - Cell. 2007 May 18;129(4):823-37
– reference: 10638745 - Nature. 2000 Jan 6;403(6765):41-5
– reference: 19808796 - Hum Mol Genet. 2009 Oct 15;18(R2):R195-201
SSID ssj0061881
Score 1.947553
Snippet Background The advent of ChIP-seq technology has made the investigation of epigenetic regulatory networks a computationally tractable problem. Several groups...
The advent of ChIP-seq technology has made the investigation of epigenetic regulatory networks a computationally tractable problem. Several groups have applied...
Background The advent of ChIP-seq technology has made the investigation of epigenetic regulatory networks a computationally tractable problem. Several groups...
Abstract Background The advent of ChIP-seq technology has made the investigation of epigenetic regulatory networks a computationally tractable problem. Several...
SourceID doaj
pubmedcentral
proquest
gale
pubmed
crossref
springer
SourceType Open Website
Open Access Repository
Aggregation Database
Index Database
Enrichment Source
Publisher
StartPage 288
SubjectTerms Biomedical and Life Sciences
Biomedicine
Data mining
Genetic research
Life Sciences
Medicine/Public Health
Methods
Research Article
Technology application
SummonAdditionalLinks – databaseName: DOAJ Directory of Open Access Journals
  dbid: DOA
  link: http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwrV1Lb9QwELZQVSQuqOWZUpCFkICDafyMcywVFb2UIgHqzXIch12pm4Wmi8S_Z8ZJtpuiigvH2BPJnpl4ZpLJ9xHyivtactkYFrxQTMEVs5WomQjRWB8L6XVIZBPF6ak9Py_PNqi-sCeshwfuFXdQKFvVog4cEm-lcukhxiiuYlmUlqsq4XxC1jMWU_0ZbLhN9KQQG6Fe1nk-gPpwaw7WY0wxkQhXruNRgu3_-3DeiE43OydvfD5NUel4h9wf0kl62G9jl9yJ7QNytyeY_P2QXHxe-b4bKBmALhuaAIbbSBfL-nr8aHZyxrr4k4I3zcMM3xhSyGYp9o_SReKQoL6t6SK1XkY6cE3A4Mb370fk6_GHL0cf2cCvwIKGOM2gmIhSYknCpfK-0NoEaWsudVFxDxqsZcibUpcqcoRxj76prckL0_ioRCPkY7LVwoqfEhplJWNQEcFhlPDGl17zpiqFrZSRSmbk3ahlFwbwceTAuHCpCLHGoVkcmsUpB2bJyJv1DT963I3bRd-j2dZiCJidBsCN3OBG7l9ulJGXaHSHkBgt9tx896uucyefvrlDyGE0FHpCZeT1INQsYfXBD78wgA4QRWsiuT-RhGc2TKbp6FsOp7DRrY3LVeesLRGyU8CCnvSutt4XJrMKVpuRYuKEk41PZ9r5LCGGg5FzKXVG3o7u6oajqrtNq3v_Q6vPyL3xHTzn-2Tr6nIVn5Pt8Otq3l2-SI_sHwjtPiE
  priority: 102
  providerName: Directory of Open Access Journals
Title Quantification of histone modification ChIP-seq enrichment for data mining and machine learning applications
URI https://link.springer.com/article/10.1186/1756-0500-4-288
https://www.ncbi.nlm.nih.gov/pubmed/21834981
https://www.proquest.com/docview/889176623
https://pubmed.ncbi.nlm.nih.gov/PMC3170335
https://doaj.org/article/748bd2dc12604403a238414e979814b3
Volume 4
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
journalDatabaseRights – providerCode: PRVADU
  databaseName: BioMed Central Open Access Free
  customDbUrl:
  eissn: 1756-0500
  dateEnd: 99991231
  omitProxy: false
  ssIdentifier: ssj0061881
  issn: 1756-0500
  databaseCode: RBZ
  dateStart: 20080101
  isFulltext: true
  titleUrlDefault: https://www.biomedcentral.com/search/
  providerName: BioMedCentral
– providerCode: PRVAON
  databaseName: DOAJ Directory of Open Access Journals
  customDbUrl:
  eissn: 1756-0500
  dateEnd: 99991231
  omitProxy: false
  ssIdentifier: ssj0061881
  issn: 1756-0500
  databaseCode: DOA
  dateStart: 20080101
  isFulltext: true
  titleUrlDefault: https://www.doaj.org/
  providerName: Directory of Open Access Journals
– providerCode: PRVHPJ
  databaseName: ROAD: Directory of Open Access Scholarly Resources
  customDbUrl:
  eissn: 1756-0500
  dateEnd: 99991231
  omitProxy: false
  ssIdentifier: ssj0061881
  issn: 1756-0500
  databaseCode: M~E
  dateStart: 20080101
  isFulltext: true
  titleUrlDefault: https://road.issn.org
  providerName: ISSN International Centre
– providerCode: PRVPQU
  databaseName: Biological Science Database
  customDbUrl:
  eissn: 1756-0500
  dateEnd: 99991231
  omitProxy: false
  ssIdentifier: ssj0061881
  issn: 1756-0500
  databaseCode: M7P
  dateStart: 20090101
  isFulltext: true
  titleUrlDefault: http://search.proquest.com/biologicalscijournals
  providerName: ProQuest
– providerCode: PRVPQU
  databaseName: Health & Medical Collection
  customDbUrl:
  eissn: 1756-0500
  dateEnd: 99991231
  omitProxy: false
  ssIdentifier: ssj0061881
  issn: 1756-0500
  databaseCode: 7X7
  dateStart: 20090101
  isFulltext: true
  titleUrlDefault: https://search.proquest.com/healthcomplete
  providerName: ProQuest
– providerCode: PRVPQU
  databaseName: ProQuest Central (subscription)
  customDbUrl:
  eissn: 1756-0500
  dateEnd: 99991231
  omitProxy: false
  ssIdentifier: ssj0061881
  issn: 1756-0500
  databaseCode: BENPR
  dateStart: 20090101
  isFulltext: true
  titleUrlDefault: https://www.proquest.com/central
  providerName: ProQuest
– providerCode: PRVPQU
  databaseName: Publicly Available Content Database
  customDbUrl:
  eissn: 1756-0500
  dateEnd: 99991231
  omitProxy: false
  ssIdentifier: ssj0061881
  issn: 1756-0500
  databaseCode: PIMPY
  dateStart: 20090101
  isFulltext: true
  titleUrlDefault: http://search.proquest.com/publiccontent
  providerName: ProQuest
– providerCode: PRVAVX
  databaseName: SpringerLINK Contemporary 1997-Present
  customDbUrl:
  eissn: 1756-0500
  dateEnd: 99991231
  omitProxy: false
  ssIdentifier: ssj0061881
  issn: 1756-0500
  databaseCode: RSV
  dateStart: 20081201
  isFulltext: true
  titleUrlDefault: https://link.springer.com/search?facet-content-type=%22Journal%22
  providerName: Springer Nature
link http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV3db9MwED_tAyRe-IYVRmUhJOAhI_5I4jxu0yb6QAkDpvJkOY6zVlrT0axI_PecnaQsgz3AS6XaF-lyvvPd2ZffAbyiuuCUl3FgNBOBwH-BzFkRMGNjqW3CdWR8s4lkPJaTSZptAO2-hfHV7t2VpN-pvVnL-B36Ocx9ozAMRMCk3IRt9HXS2eLJ59Nu842plLRF8PnLQz3n4zH6_9yJr7ii62WS1-5KvQs6vvcfzN-Hu228SfYbBXkAG7Z6CLebDpQ_H8H5p5VuyoX8CpFFSTwCcWXJfFH8Hj-cjrKgtt8JqtvMTN2RIsFwl7gCUzL3TSaIrgoy97WZlrTNKHDwygX5Y_h6fPTl8H3QNmAITISOPMBsw3LuchbKhdZJFMWGy4LyKMmpRkkX3IRlGqXCUofzbnVZyDhM4lJbwUrGn8BWhRzvALE859YI69BjBNOxTnVEyzxlMhcxF3wAe93KKNOik7smGefKZykyVk6EyolQCYUiHMCb9QMXDTDHzaQHbqnXZA5R2w8slmeqNVCVCJkXrDAUEzwhQq4xlhFU2DRBJRM5MvjSKYpymBmVK8o506u6VqOPp2ofg5wIM0EmBvC6JSoXyL3R7TcOKAMHs9Wj3O1RolGb3jTp9FG5KVcJV9nFqlZSpg7TkyFDTxv1XL-Xi3YFcjuApKe4vRfvz1SzqYcUx0UOOY8G8LZTX9XuZfVNUn32D7TP4U53Fk_pLmxdLlf2BdwyPy5n9XIIm8kk8b9yCNsHR-PsZOjPRoauEjfDsWz0Ifs29Jb-C8iIRtQ
linkProvider Springer Nature
linkToHtml http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV1Lb9QwELZKAcGF9yNQwEJI0IMhfsY5loqqK8pSoFS9WY7jdFfqJrDpIvHvGXuTpSn0AMfYY2kyHntm7PE3CL2gtuSUV4o4ywQR8EV0wUrCnFfa-oxb6WKxiWw81kdH-f4aov1bmJjt3l9Jxp06Lmut3oCdg9hXpikRhGl9CV0WYK5CFt_nL4f95quo1rRD8PnLoIHxiRj9f-7EZ0zR-TTJc3el0QTt3PwP5m-hG52_ibeWCnIbrfn6Drq6rED58y46-bSwy3ShOEO4qXBEIK49njXl7_btyWiftP47BnWbukk4UsTg7uKQYIpnscgEtnWJZzE30-OuGAU0nrkgv4e-7rw72N4lXQEG4iQYcgLRhuc8xCyUC2szKZXjuqRcZgW1IOmSu7TKZS48DTjv3lalVmmmKusFqxi_j9Zr4Pghwp4X3DvhA3qMYFbZ3EpaFTnThVBc8AS97mfGuA6dPBTJODExStHKBBGaIEIjDIgwQa9WA74tgTkuJn0bpnpFFhC1Y0MzPzbdAjWZ0EXJSkchwBMi5RZ8GUGFz7NcU1EAg8-DopiAmVGHpJxju2hbM_p4aLbAyZEQCTKRoJcdUdUA9852bxxABgFma0C5MaCERe0G3bjXRxO6QiZc7ZtFa7TOA6YnA4YeLNVz9V_B2xXAbYKygeIOfnzYU08nEVIcJjnlXCZos1df0-1l7UVSffQPtM_Qtd2DD3tmbzR-_xhd78_lKd1A66fzhX-Crrgfp9N2_jSu5l9Ph0Kz
linkToPdf http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwpV3db9MwELdgwMQL34zAAAshAQ9h8UcS53EMKipQKQKmvVmO46yV1mQ0LRL_PXdOUpbBHhCPsS_S-Xz23dnn3xHynJlCMFEmoTVchhK-QpXzIuTWJcq4VJjY-mIT6WSijo6yaZeb0_TZ7v2VZPumAVGaqtXeaVG2S1wle2DzIA6OoyiUIVfqMrkisWIQButfDvuNOGFKsQ7N5y8_DQyRx-v_c1c-Y5bOp0yeuzf15mh08z8Hcovc6PxQut8qzm1yyVV3yLW2MuXPu-Tk89q0aUR-5mhdUo9MXDm6qIvf7Qez8TRs3HcKaji3MzxqpOAGU0w8pQtffIKaqqALn7PpaFekAhrPXJzfI99G774evA-7wgyhjcHAhxCFOCEwlmFCGpPGcWKFKpiI05wZkHohbFRmcSYdQ_x3Z8pCJVGalMZJXnJxn2xVwPEDQp3IhbPSIaqM5CYxmYlZmWdc5TIRUgTkdT9L2nao5Vg840T76EUlGkWoUYRaahBhQF5ufjhtATsuJn2D074hQ6Rt31Avj3W3cHUqVV7wwjII_KSMhAEfRzLpsjRTTObA4DNUGo1YGhUm6xybddPo8adDvQ_OTwwRIpcBedERlTVwb0339gFkgPBbA8rdASUsdjvopr1uauzCDLnK1etGK5Uh1icHhnZaVd2MC71gCdwGJB0o8WDgw55qPvNQ4zDJkRBxQF71qqy7Pa65SKoP_4H2Kdmevh3pj-PJh0fken9cz9gu2Vot1-4xuWp_rObN8olf2L8A7qVLlw
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Quantification+of+histone+modification+ChIP-seq+enrichment+for+data+mining+and+machine+learning+applications&rft.jtitle=BMC+research+notes&rft.au=Hoang%2C+Stephen+A&rft.au=Xu%2C+Xiaojiang&rft.au=Bekiranov%2C+Stefan&rft.date=2011-08-11&rft.pub=BioMed+Central+Ltd&rft.issn=1756-0500&rft.eissn=1756-0500&rft.volume=4&rft.spage=288&rft_id=info:doi/10.1186%2F1756-0500-4-288&rft.externalDocID=A266596024
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1756-0500&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1756-0500&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1756-0500&client=summon