-
ToNER: A tool for identifying nucleotide enrichment signals in feature-enriched RNA-seq data
- Back
Document Title
ToNER: A tool for identifying nucleotide enrichment signals in feature-enriched RNA-seq data
Author
Promworn Y, Kaewprommal P, Shaw PJ, Intarapanich A, Tongsima S, Piriyapongsa J
Name from Authors Collection
Scopus Author ID
7801321390
Affiliations
National Science & Technology Development Agency - Thailand; National Center Genetic Engineering & Biotechnology (BIOTEC); National Science & Technology Development Agency - Thailand; National Electronics & Computer Technology Center (NECTEC)
Type
Article
Source Title
PLOS ONE
ISSN
1932-6203
Year
2017
Volume
12
Issue
20
Open Access
Green Submitted, Green Published, gold
Publisher
PUBLIC LIBRARY SCIENCE
DOI
10.1371/journal.pone.0178483
Format
Abstract
Background Biochemical methods are available for enriching 50 ends of RNAs in prokaryotes, which are employed in the differential RNA-seq (dRNA-seq) and the more recent Cappable-seq protocols. Computational methods are needed to locate RNA 50 ends from these data by statistical analysis of the enrichment. Although statistical-based analysis methods have been developed for dRNA-seq, they may not be suitable for Cappable-seq data. The more efficient enrichment method employed in Cappable-seq compared with dRNA-seq could affect data distribution and thus algorithm performance. Results We present Transformation of Nucleotide Enrichment Ratios (ToNER), a tool for statistical modeling of enrichment from RNA-seq data obtained from enriched and unenriched libraries. The tool calculates nucleotide enrichment scores and determines the global transformation for fitting to the normal distribution using the Box-Cox procedure. From the transformed distribution, sites of significant enrichment are identified. To increase power of detection, meta-analysis across experimental replicates is offered. We tested the tool on Cappableseq and dRNA-seq data for identifying Escherichia coli transcript 50 ends and compared the results with those from the TSSAR tool, which is designed for analyzing dRNA-seq data. When combining results across Cappable-seq replicates, ToNER detects more known transcript 50 ends than TSSAR. In general, the transcript 50 ends detected by ToNER but not TSSAR occur in regions which cannot be locally modeled by TSSAR. Conclusion ToNER uses a simple yet robust statistical modeling approach, which can be used for detecting RNA 50ends from Cappable-seq data, in particular when combining information from experimental replicates. The ToNER tool could potentially be applied for analyzing other RNA-seq datasets in which enrichment for other structural features of RNA is employed. The program is freely available for download at ToNER webpage (http://www4a. biotec.or.th/GI/tools/toner) and GitHub repository (https://github.com/PavitaKae/ToNER).
Industrial Classification
Knowledge Taxonomy Level 1
Knowledge Taxonomy Level 2
Knowledge Taxonomy Level 3
Funding Sponsor
platform technology, National Center for Genetic Engineering and Biotechnology, Thailand [P-15-51103, P-12-01270]
License
CC BY
Rights
Authors
Publication Source
WOS