Advanced computational algorithms for microbial community analysis using massive 16S rRNA sequence data

Yijun Sun, Yunpeng Cai, Volker Mai, William Farmerie, Fahong Yu, Jian Li, Steven Goodison

Research output: Contribution to journalArticle

36 Citations (Scopus)

Abstract

With the aid of next-generation sequencing technology, researchers can now obtain millions of microbial signature sequences for diverse applications ranging from human epidemiological studies to global ocean surveys. The development of advanced computational strategies to maximally extract pertinent information from massive nucleotide data has become a major focus of the bioinformatics community. Here, we describe a novel analytical strategy including discriminant and topology analyses that enables researchers to deeply investigate the hidden world of microbial communities, far beyond basic microbial diversity estimation. We demonstrate the utility of our approach through a computational study performed on a previously published massive human gut 16S rRNA data set. The application of discriminant and topology analyses enabled us to derive quantitative disease-associated microbial signatures and describe microbial community structure in far more detail than previously achievable. Our approach provides rigorous statistical tools for sequence-based studies aimed at elucidating associations between known or unknown organisms and a variety of physiological or environmental conditions.

Original languageEnglish (US)
JournalNucleic Acids Research
Volume38
Issue number22
DOIs
StatePublished - Dec 1 2010
Externally publishedYes

Fingerprint

Discriminant Analysis
Research Personnel
Computational Biology
Oceans and Seas
Epidemiologic Studies
Nucleotides
Technology
Datasets
Surveys and Questionnaires

ASJC Scopus subject areas

  • Genetics

Cite this

Advanced computational algorithms for microbial community analysis using massive 16S rRNA sequence data. / Sun, Yijun; Cai, Yunpeng; Mai, Volker; Farmerie, William; Yu, Fahong; Li, Jian; Goodison, Steven.

In: Nucleic Acids Research, Vol. 38, No. 22, 01.12.2010.

Research output: Contribution to journalArticle

Sun, Yijun ; Cai, Yunpeng ; Mai, Volker ; Farmerie, William ; Yu, Fahong ; Li, Jian ; Goodison, Steven. / Advanced computational algorithms for microbial community analysis using massive 16S rRNA sequence data. In: Nucleic Acids Research. 2010 ; Vol. 38, No. 22.
@article{9950e91afb294d90811bda3ff8402cef,
title = "Advanced computational algorithms for microbial community analysis using massive 16S rRNA sequence data",
abstract = "With the aid of next-generation sequencing technology, researchers can now obtain millions of microbial signature sequences for diverse applications ranging from human epidemiological studies to global ocean surveys. The development of advanced computational strategies to maximally extract pertinent information from massive nucleotide data has become a major focus of the bioinformatics community. Here, we describe a novel analytical strategy including discriminant and topology analyses that enables researchers to deeply investigate the hidden world of microbial communities, far beyond basic microbial diversity estimation. We demonstrate the utility of our approach through a computational study performed on a previously published massive human gut 16S rRNA data set. The application of discriminant and topology analyses enabled us to derive quantitative disease-associated microbial signatures and describe microbial community structure in far more detail than previously achievable. Our approach provides rigorous statistical tools for sequence-based studies aimed at elucidating associations between known or unknown organisms and a variety of physiological or environmental conditions.",
author = "Yijun Sun and Yunpeng Cai and Volker Mai and William Farmerie and Fahong Yu and Jian Li and Steven Goodison",
year = "2010",
month = "12",
day = "1",
doi = "10.1093/nar/gkq872",
language = "English (US)",
volume = "38",
journal = "Nucleic Acids Research",
issn = "0305-1048",
publisher = "Oxford University Press",
number = "22",

}

TY - JOUR

T1 - Advanced computational algorithms for microbial community analysis using massive 16S rRNA sequence data

AU - Sun, Yijun

AU - Cai, Yunpeng

AU - Mai, Volker

AU - Farmerie, William

AU - Yu, Fahong

AU - Li, Jian

AU - Goodison, Steven

PY - 2010/12/1

Y1 - 2010/12/1

N2 - With the aid of next-generation sequencing technology, researchers can now obtain millions of microbial signature sequences for diverse applications ranging from human epidemiological studies to global ocean surveys. The development of advanced computational strategies to maximally extract pertinent information from massive nucleotide data has become a major focus of the bioinformatics community. Here, we describe a novel analytical strategy including discriminant and topology analyses that enables researchers to deeply investigate the hidden world of microbial communities, far beyond basic microbial diversity estimation. We demonstrate the utility of our approach through a computational study performed on a previously published massive human gut 16S rRNA data set. The application of discriminant and topology analyses enabled us to derive quantitative disease-associated microbial signatures and describe microbial community structure in far more detail than previously achievable. Our approach provides rigorous statistical tools for sequence-based studies aimed at elucidating associations between known or unknown organisms and a variety of physiological or environmental conditions.

AB - With the aid of next-generation sequencing technology, researchers can now obtain millions of microbial signature sequences for diverse applications ranging from human epidemiological studies to global ocean surveys. The development of advanced computational strategies to maximally extract pertinent information from massive nucleotide data has become a major focus of the bioinformatics community. Here, we describe a novel analytical strategy including discriminant and topology analyses that enables researchers to deeply investigate the hidden world of microbial communities, far beyond basic microbial diversity estimation. We demonstrate the utility of our approach through a computational study performed on a previously published massive human gut 16S rRNA data set. The application of discriminant and topology analyses enabled us to derive quantitative disease-associated microbial signatures and describe microbial community structure in far more detail than previously achievable. Our approach provides rigorous statistical tools for sequence-based studies aimed at elucidating associations between known or unknown organisms and a variety of physiological or environmental conditions.

UR - http://www.scopus.com/inward/record.url?scp=78650464356&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=78650464356&partnerID=8YFLogxK

U2 - 10.1093/nar/gkq872

DO - 10.1093/nar/gkq872

M3 - Article

VL - 38

JO - Nucleic Acids Research

JF - Nucleic Acids Research

SN - 0305-1048

IS - 22

ER -