This project contains the links to the datasets, software and code that was used for our study “Comprehensive analysis of RNA-sequencing to find the source of 1 trillion reads across diverse adult human tissues” (https://www.biorxiv.org/content/early/2017/06/12/053041)
ROP-project (beta version). Please expect frequent updates.
Following Baruzzo et al. (2017), we have selected five RNA-Seq aligners that were reported to map a minimum of 75% of the reads simulated from human transcriptome using recommended parameters. For each of the RNA-Seq aligners we provide easy use wrappers with default and optomized parameters. The user is expected to install the tool of interest and downlaod the corresponding genome index. Before using, modify the following variable in the script :
To demonstrate the accuracy of ROP’s read assignment, we simulated RNA-Seq data as a mixture of transcriptomic, repeat, immune, and microbial reads.
Mapped simulation reads are available here
We have used TCRB-Seq data prepared from three samples of kidney renal clear cell carcinoma (KIRC) by Li, Bo, et al. (2016) to demonstrate the assignment accuracy of immune reads. We downloaded matching RNA-Seq samples from the TCGA portal. In total, we obtained 301 million 2x50bp reads from three RNA-Seq samples. RNA-Seq FASTQ file are available here