Supporting data for "RED-ML: a novel, effective RNA editing detection method based on machine learning"
Dataset type: Software, Epigenomic
Data released on February 28, 2017
Xiong H; Liu D; Li Q; Lei M; Xu L; Wu L; Wang Z; Ren S; Li W; Xia M; Lu L; Lu H; Hou Y; Zhu S; Liu X; Sun Y; Wang J; Yang H; Wu K; Xu X; Lee LJ (2017): Supporting data for "RED-ML: a novel, effective RNA editing detection method based on machine learning" GigaScience Database. http://dx.doi.org/10.5524/100275
With the advancement of second generation sequencing techniques, our ability to detect and quantify RNA editing on a global scale has been vastly improved. As a result, RNA editing is now being studied under a growing number of biological conditions so that its biochemical mechanisms and functional roles can be further understood. However, a major barrier that prevents RNA editing from being a routine RNA-seq analysis, similar to gene expression and splicing analysis for example, is the lack of user-friendly and effective computational tools.
Based on years of experience of analyzing RNA editing using diverse RNA-seq datasets, we have developed a software tool RED-ML: RNA Editing Detection based on Machine learning (pronounced as “red ML”). The input to RED-ML can be as simple as a single BAM file, while it can also take advantage of matched genomic variant information when available. The output not only contains detected RNA editing sites, but also a confidence score to facilitate downstream filtering. We have carefully designed validation experiments and performed extensive comparison and analysis to show the efficiency and effectiveness of RED-ML under different conditions, and it can accurately detect novel RNA editing sites without relying on curated RNA editing databases. We have also made this tool freely available via GitHub.
We have developed a highly accurate, speedy and general-purpose tool for RNA editing detection using RNA-seq data. With the availability of RED-ML, it is now possible to conveniently make RNA editing a routine analysis of RNA-seq. We believe this can greatly benefit the RNA editing research community and has profound impact to accelerate our understanding of this intriguing post-transcriptional modification process.
Additional details
Read the peer-reviewed publication(s):
(PubMed: 28328004)
Additional information:
https://github.com/BGIRED/RED-ML
Accessions (data generated as part of this study):
BioProject:
PRJNA373807
Accessions (data referenced by this study):
SRA:
SRP007605
Sample ID | Taxonomic ID | Common Name | Genbank Name | Scientific Name | Sample Attributes |
---|---|---|---|---|---|
SAMN06319117 | 9606 | Human | human | Homo sapiens | Description:DNA extracted from CH24T Alternative names:CH24T Sex:not collected ... + |
SAMN06319118 | 9606 | Human | human | Homo sapiens | Description:RNA extracted from CH24T Alternative names:CH24T Sex:not collected ... + |
SAMN06319119 | 9606 | Human | human | Homo sapiens | Description:RNA extracted from CH24T Alternative names:CH24T Sex:not collected ... + |
SAMN06319120 | 9606 | Human | human | Homo sapiens | Description:DNA extracted from CH62T Alternative names:CH62T Sex:not collected ... + |
SAMN06319121 | 9606 | Human | human | Homo sapiens | Description:RNA extracted from CH62T Alternative names:CH62T Sex:not collected ... + |
SAMN06319122 | 9606 | Human | human | Homo sapiens | Description:RNA extracted from CH62T Alternative names:CH62T Sex:not collected ... + |
SAMN06319123 | 9606 | Human | human | Homo sapiens | Description:RNA extracted from YH Alternative names:YH Sex:not collected ... + |
SAMN06319124 | 9606 | Human | human | Homo sapiens | Description:RNA extracted from Hela Alternative names:Hela Sex:not collected ... + |
SAMN06319125 | 9606 | Human | human | Homo sapiens | Description:RNA extracted from Hela Alternative names:Hela Sex:not collected ... + |