Using CLC Genomics Workbench, we will go through a pipeline for analyzing metatranscriptome NGS data from microbial communities and perform pathways interpretation on it.
- Import "raw" NGS sequencing data and prepare the samples for analysis
 
- Find relevant annotated genomes with a curated reference database while removing ribosomal RNA with the SILVA database (database of rRNAs)
 
- De novo assemble the unclassified reads into contigs to also find transcripts of taxa not present in the reference database
 
- Map the reads to the assembled contigs and database · Build a functional profile of the samples to get an abundance of GO-terms.
 
- Statistical analysis of the groups
 
- Pathway analysis on the differential abundance analysis using MetaCyc database.