Transcript Discovery

Relying heavily on reads mapped with a gap as evidence for transcripts, the plugin is primarily developed for eukaryotic genomes. The proposed workflow for using the ab initio Transcript Discovery plugin in combination with the existing RNA-seq tool in CLC Genomics Workbench is this:

  • Run the large gap mapper using all your RNA-seq reads and a genomic reference sequence
  • Run the transcript discovery algorithm on the resulting read mapping to predict transcripts and genes
  • Inspect the results and if necessary re-run the transcript discovery to refine the settings to produce the desired result
  • Part of the result from the transcript discovery is a copy of the reference genome including the new transcript and gene annotations
  • This can now be used as a common reference for measuring gene expression using the existing RNA-seq tool in the workbench

If you have sequenced several samples that need to be compared, we suggest using the reads from all samples for the large gap mapping and subsequent transcript discovery. This way, you can establish a common set of reference transcripts and genes that makes it possible to compare gene expression levels across samples (using the RNA-seq tool in CLC Genomics Workbench). The initial read mapping created by the large gap mapper is then no longer used and can be deleted, unless you wish to be able to go back and double-check the basis of the prediction.

Check out the blog featuring this plugin

We frequently release updates and improvements such as bug fixes or new features. To get a complete overview, please visit the Latest Improvements page.

Plugin Manual
Plugin Download
Server Plugin Download
Sample to Insight
linkedin facebook pinterest youtube rss twitter instagram facebook-blank rss-blank linkedin-blank pinterest youtube twitter instagram
This site is registered on as a development site. Switch to a production site key to remove this banner.