MISSSL: Multiple Insert Size Strand-specific Library
Abstract
Deep transcriptome sequencing is a powerful method for parallel quantitative and qualitative analysis of all mRNAs in a sample. According to a recent comparison of the diverse methods available for transcriptome library preparation [1], the top two approaches are dUTP second-strand marking and the Illumina RNA ligation protocol. A major limitation of the Illumina RNA ligation protocol is the inability to implement paired-end sequencing, although Levin et al. note that modifications to overcome this drawback should be possible. We have adjusted the Illumina RNA ligation protocol to allow for paired-end sequencing and evaluated a C. elegans test library using the methods proposed in Levin et al. Article Supplemental materialEvaluation Code
For the evaluation of our RNA-Seq library, we downloaded the scripts used for the analyses by Levin et al. from the supplementary website and adapted them to the C. elegans genome and our read data. The modified code is available on our FTP server.Supplementary Data
You can download the data that was used for library evaluation here:- C. elegans annotation WS200: C. elegans annotation WS200 as Matlab objects
- Aligned C. elegans reads in the same Matlab data format as in Levin et al.
You can download the Matlab code used to determine the optimal insert sizes here:
Full alignments and sequenced reads are available on request.