MetaProm: A neural network based meta-predictor for alternative human promoter prediction

Junwen Wang; Lyle H. Ungar; Hung Tseng; Sridhar Hannenhalli

doi:10.1186/1471-2164-8-374

MetaProm: A neural network based meta-predictor for alternative human promoter prediction

Junwen Wang, Lyle H. Ungar, Hung Tseng, Sridhar Hannenhalli

Research

Research output: Contribution to journal › Article › peer-review

22 Scopus citations

Abstract

Background: De novo eukaryotic promoter prediction is important for discovering novel genes and understanding gene regulation. In spite of the great advances made in the past decade, recent studies revealed that the overall performances of the current promoter prediction programs (PPPs) are still poor, and predictions made by individual PPPs do not overlap each other. Furthermore, most PPPs are trained and tested on the most-upstream promoters; their performances on alternative promoters have not been assessed. Results: In this paper, we evaluate the performances of current major promoter prediction programs (i.e., PSPA, FirstEF, McPromoter, DragonGSF, DragonPF, and FProm) using 42,536 distinct human gene promoters on a genome-wide scale, and with emphasis on alternative promoters. We describe an artificial neural network (ANN) based meta-predictor program that integrates predictions from the current PPPs and the predicted promoters' relation to CpG islands. Our specific analysis of recently discovered alternative promoters reveals that although only 41% of the 3′ most promoters overlap a CpG island, 74% of 5′ most promoters overlap a CpG island. Conclusion: Our assessment of six PPPs on 1.06 × 10⁹ bps of human genome sequence reveals the specific strengths and weaknesses of individual PPPs. Our meta-predictor outperforms any individual PPP in sensitivity and specificity. Furthermore, we discovered that the 5′ alternative promoters are more likely to be associated with a CpG island.

Original language	English (US)
Article number	374
Journal	BMC genomics
Volume	8
DOIs	https://doi.org/10.1186/1471-2164-8-374
State	Published - Oct 17 2007

ASJC Scopus subject areas

Biotechnology
Genetics

Access to Document

10.1186/1471-2164-8-374

Cite this

@article{da45dc31c3f24c28975ed90f104740b8,

title = "MetaProm: A neural network based meta-predictor for alternative human promoter prediction",

abstract = "Background: De novo eukaryotic promoter prediction is important for discovering novel genes and understanding gene regulation. In spite of the great advances made in the past decade, recent studies revealed that the overall performances of the current promoter prediction programs (PPPs) are still poor, and predictions made by individual PPPs do not overlap each other. Furthermore, most PPPs are trained and tested on the most-upstream promoters; their performances on alternative promoters have not been assessed. Results: In this paper, we evaluate the performances of current major promoter prediction programs (i.e., PSPA, FirstEF, McPromoter, DragonGSF, DragonPF, and FProm) using 42,536 distinct human gene promoters on a genome-wide scale, and with emphasis on alternative promoters. We describe an artificial neural network (ANN) based meta-predictor program that integrates predictions from the current PPPs and the predicted promoters' relation to CpG islands. Our specific analysis of recently discovered alternative promoters reveals that although only 41% of the 3′ most promoters overlap a CpG island, 74% of 5′ most promoters overlap a CpG island. Conclusion: Our assessment of six PPPs on 1.06 × 109 bps of human genome sequence reveals the specific strengths and weaknesses of individual PPPs. Our meta-predictor outperforms any individual PPP in sensitivity and specificity. Furthermore, we discovered that the 5′ alternative promoters are more likely to be associated with a CpG island.",

author = "Junwen Wang and Ungar, {Lyle H.} and Hung Tseng and Sridhar Hannenhalli",

year = "2007",

month = oct,

day = "17",

doi = "10.1186/1471-2164-8-374",

language = "English (US)",

volume = "8",

journal = "BMC genomics",

issn = "1471-2164",

publisher = "BioMed Central",

}

TY - JOUR

T1 - MetaProm

T2 - A neural network based meta-predictor for alternative human promoter prediction

AU - Wang, Junwen

AU - Ungar, Lyle H.

AU - Tseng, Hung

AU - Hannenhalli, Sridhar

PY - 2007/10/17

Y1 - 2007/10/17

N2 - Background: De novo eukaryotic promoter prediction is important for discovering novel genes and understanding gene regulation. In spite of the great advances made in the past decade, recent studies revealed that the overall performances of the current promoter prediction programs (PPPs) are still poor, and predictions made by individual PPPs do not overlap each other. Furthermore, most PPPs are trained and tested on the most-upstream promoters; their performances on alternative promoters have not been assessed. Results: In this paper, we evaluate the performances of current major promoter prediction programs (i.e., PSPA, FirstEF, McPromoter, DragonGSF, DragonPF, and FProm) using 42,536 distinct human gene promoters on a genome-wide scale, and with emphasis on alternative promoters. We describe an artificial neural network (ANN) based meta-predictor program that integrates predictions from the current PPPs and the predicted promoters' relation to CpG islands. Our specific analysis of recently discovered alternative promoters reveals that although only 41% of the 3′ most promoters overlap a CpG island, 74% of 5′ most promoters overlap a CpG island. Conclusion: Our assessment of six PPPs on 1.06 × 109 bps of human genome sequence reveals the specific strengths and weaknesses of individual PPPs. Our meta-predictor outperforms any individual PPP in sensitivity and specificity. Furthermore, we discovered that the 5′ alternative promoters are more likely to be associated with a CpG island.

AB - Background: De novo eukaryotic promoter prediction is important for discovering novel genes and understanding gene regulation. In spite of the great advances made in the past decade, recent studies revealed that the overall performances of the current promoter prediction programs (PPPs) are still poor, and predictions made by individual PPPs do not overlap each other. Furthermore, most PPPs are trained and tested on the most-upstream promoters; their performances on alternative promoters have not been assessed. Results: In this paper, we evaluate the performances of current major promoter prediction programs (i.e., PSPA, FirstEF, McPromoter, DragonGSF, DragonPF, and FProm) using 42,536 distinct human gene promoters on a genome-wide scale, and with emphasis on alternative promoters. We describe an artificial neural network (ANN) based meta-predictor program that integrates predictions from the current PPPs and the predicted promoters' relation to CpG islands. Our specific analysis of recently discovered alternative promoters reveals that although only 41% of the 3′ most promoters overlap a CpG island, 74% of 5′ most promoters overlap a CpG island. Conclusion: Our assessment of six PPPs on 1.06 × 109 bps of human genome sequence reveals the specific strengths and weaknesses of individual PPPs. Our meta-predictor outperforms any individual PPP in sensitivity and specificity. Furthermore, we discovered that the 5′ alternative promoters are more likely to be associated with a CpG island.

UR - http://www.scopus.com/inward/record.url?scp=38049156077&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=38049156077&partnerID=8YFLogxK

U2 - 10.1186/1471-2164-8-374

DO - 10.1186/1471-2164-8-374

M3 - Article

C2 - 17941982

AN - SCOPUS:38049156077

SN - 1471-2164

VL - 8

JO - BMC genomics

JF - BMC genomics

M1 - 374

ER -

MetaProm: A neural network based meta-predictor for alternative human promoter prediction

Abstract

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this