Child pages
  • RNA-seq Analysis with Tuxedo Tools

Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

 

Time series analysis

If set to True, instructs Cuffdiff to analyze the provided samples as a time series, rather than testing for differences between all pairs of samples. Samples should be provided in increasing time order.

Upper quartile norm

If set to True, normalizes by the upper quartile of the number of fragments mapping to individual loci instead of the total number of sequenced fragments. This can improve robustness of differential expression calls for less abundant genes and transcripts.

Hits norm

Instructs how to count all fragments. Total specifies to count all fragments, including those not compatible with any reference transcript, towards the number of mapped fragments used in the FPKM denominator. Compatible specifies to use only compatible fragments. Selecting Compatible is generally recommended in Cuffdiff to reduce certain types of bias caused by differential amounts of ribosomal reads which can create the impression of falsely differentially expressed genes.

Frag bias correct

Providing the sequences your reads were mapped to instructs Cuffdiff to run bias detection and correction algorithm which can significantly improve accuracy of transcript abundance estimates.

Multi read correct

Do an initial estimation procedure to more accurately weight reads mapping to multiple locations in the genome.

Library type

Specifies RNA-Seq protocol.

Mask file

Ignore all reads that could have come from transcripts in this file. It is recommended to include any annotated rRNA, mitochondrial transcripts other abundant transcripts you wish to ignore in your analysis in this file. Due to variable efficiency of mRNA enrichment methods and rRNA depletion kits, masking these transcripts often improves the overall robustness of transcript abundance estimates.

Min alignment count

The minimum number of alignments in a locus for needed to conduct significance testing on changes in that locus observed between samples. If no testing is performed, changes in the locus are deemed not significant, and the locus’ observed changes don’t contribute to correction for multiple testing.

FDR

Allowed false discovery rate used in testing.

Max MLE iterations

Sets the number of iterations allowed during maximum likelihood estimation of abundances.

Emit count tables

Include information about the fragment counts, fragment count variances, and fitted variance model into the report.

Cuffdiff tool path

The path to the Cuffdiff external tool in UGENE.

Temporary directory

The directory for temporary files.

Configure parameters, if necessary, and click Next. The last page of the wizard appears:

HTML
<center>
  <br>
  <img src="/wiki/download/attachments/3244623/RNA-seq analysis with Tuxedo tools_13.png"/>
  <br> 
</center>

Choose output directories for each tools and click Finish. 

 Note that default button reverts all parameters to default settings. 

 Now let’s validate and run the workflow. To validate that the workflow is correct and all parameters are set properly click the Validate workflow button on the Workflow Designer toolbar:

 

HTML
<center>
  <br>
  <img src="/wiki/download/attachments/3244617/Call variants with SAMtools_7.png"/>
  <br> 
</center>

If there are some errors, they will be shown in the Error list at the bottom of the Workflow Designer window, for example:

...

However, if you have set all the required parameters, then there shouldn’t be errors. To  After that you can estimate the workflow. To run estimation click the Estimate workflow button:  

HTML
<center>
  <br>
  <img src="/wiki/download/attachments/3244623/Call variants with SAMtools_11.jpg"/>
  <br> 
</center>

To run a valid workflow, click the Run workflow button on the Workflow Designer toolbar: 

...