TY - JOUR
T1 - IM-TORNADO
T2 - A tool for comparison of 16S reads from paired-end libraries
AU - Jeraldo, Patricio
AU - Kalari, Krishna
AU - Chen, Xianfeng
AU - Bhavsar, Jaysheel
AU - Mangalam, Ashutosh
AU - White, Bryan
AU - Nelson, Heidi
AU - Kocher, Jean Pierre
AU - Chia, Nicholas
N1 - Publisher Copyright:
© 2014 Jeraldo et al.
PY - 2014/12/15
Y1 - 2014/12/15
N2 - Motivation: 16S rDNA hypervariable tag sequencing has become the de facto method for accessing microbial diversity. Illumina paired-end sequencing, which produces two separate reads for each DNA fragment, has become the platform of choice for this application. However, when the two reads do not overlap, existing computational pipelines analyze data from read separately and underutilize the information contained in the paired-end reads. Results: We created a workflow known as Illinois Mayo Taxon Organization from RNA Dataset Operations (IM-TORNADO) for processing non-overlapping reads while retaining maximal information content. Using synthetic mock datasets, we show that the use of both reads produced answers with greater correlation to those from full length 16S rDNA when looking at taxonomy, phylogeny, and beta-diversity. Availability and Implementation: IM-TORNADO is freely available at http://sourceforge.net/projects/imtornado and produces BIOM format output for cross compatibility with other pipelines such as QIIME, mothur, and phyloseq.
AB - Motivation: 16S rDNA hypervariable tag sequencing has become the de facto method for accessing microbial diversity. Illumina paired-end sequencing, which produces two separate reads for each DNA fragment, has become the platform of choice for this application. However, when the two reads do not overlap, existing computational pipelines analyze data from read separately and underutilize the information contained in the paired-end reads. Results: We created a workflow known as Illinois Mayo Taxon Organization from RNA Dataset Operations (IM-TORNADO) for processing non-overlapping reads while retaining maximal information content. Using synthetic mock datasets, we show that the use of both reads produced answers with greater correlation to those from full length 16S rDNA when looking at taxonomy, phylogeny, and beta-diversity. Availability and Implementation: IM-TORNADO is freely available at http://sourceforge.net/projects/imtornado and produces BIOM format output for cross compatibility with other pipelines such as QIIME, mothur, and phyloseq.
UR - http://www.scopus.com/inward/record.url?scp=84918559643&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84918559643&partnerID=8YFLogxK
U2 - 10.1371/journal.pone.0114804
DO - 10.1371/journal.pone.0114804
M3 - Article
C2 - 25506826
AN - SCOPUS:84918559643
SN - 1932-6203
VL - 9
JO - PLoS One
JF - PLoS One
IS - 12
M1 - e114804
ER -