Abstract
Transcription of eukaryotic genomes involves complex alternative processing of RNAs. Sequencing of full-length RNAs using long-reads reveals the true complexity of processing, however the relatively high error rates of long-read technologies can reduce the accuracy of intron identification. Here we present a two-pass approach, combining alignment metrics and machine-learning-derived sequence information to filter spurious examples from splice junctions identified in long-read alignments. The remaining junctions are then used to guide realignment. This method, available in the software package 2passtools (https://github.com/bartongroup/2passtools), improves the accuracy of spliced alignment and transcriptome annotation without requiring orthogonal information from short read RNAseq or existing annotations.
| Original language | English |
|---|---|
| Place of Publication | Cold Spring Harbor Laboratory |
| Publisher | BioRxiv |
| Number of pages | 36 |
| DOIs | |
| Publication status | Published - 30 May 2020 |
Fingerprint
Dive into the research topics of 'Two-pass alignment using machine-learning-filtered splice junctions increases the accuracy of intron detection in long-read RNA sequencing'. Together they form a unique fingerprint.Research output
- 1 Article
-
2passtools: two-pass alignment using machine-learning-filtered splice junctions increases the accuracy of intron detection in long-read RNA sequencing
Parker, M. T. (Lead / Corresponding author), Knop, K., Barton, G. J. & Simpson, G. G. (Lead / Corresponding author), 1 Mar 2021, In: Genome Biology. 22, 1, 24 p., 72.Research output: Contribution to journal › Article › peer-review
Open AccessFile17 Link opens in a new tab Citations (Scopus)310 Downloads (Pure)
Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver