Suppose you have genomes and you want to characterize them. One of the ways to do that is to build a table of what genes are in each genome and what are not there.
As the result you will get the report file. With "Yes" and "No" field. "Yes" answer means that the gene is in the genome. "No" answer MIGHT mean that there is no gene in the genome. It is a good idea to analyze all the "No" sequences using annotated files. Just open a file and find a sequence with a name of a gene that has "No" result.
If you haven't used the workflow samples in UGENE before, look at the "How to Use Sample Workflows" section of the documentation. |
The workflow sample "Gene-by-gene Approach for Characterization of Genomes" can be found in the "Scenarios" section of the Workflow Designer samples.
The workflow looks as follows:
<center> <br> <img src="/wiki/download/attachments/16122734/Gene-by-gene Approach for Characterization of Genomes.png"/> <br> </center> |
The wizard has 3 pages.
Input sequence(s): On this page you must input sequence(s).
<center> <br> <img src="/wiki/download/attachments/16122734/Gene-by-gene Approach for Characterization of Genomes_1.png"/> <br> </center> |
BLAST search: On this page you can modify BLAST search parameters.
<center> <br> <img src="/wiki/download/attachments/16122734/Gene-by-gene Approach for Characterization of Genomes_2.png"/> <br> </center> |
The following parameters are available:
Search type | Select type of BLAST searches. |
Database Path | Path with database files. |
Database Name | Base name for BLAST DB files. |
Expected value | This setting specifies the statistical significance threshold for reporting matches against database sequences. |
Annotate as | Name for annotations. |
Gapped alignment | Perform gapped alignment.
|
Tool Path | External tool path.
|
BLAST output | Location of BLAST output file. |
BLAST output type | Type of BLAST output file. |
Temporary directory | Directory for temporary files. |
Gap costs | Cost to create and extend a gap in an alignment. |
Match scores | Reward and penalty for matching and mismatching bases. |
Output data: On this page you can modify output parameters.
<center> <br> <img src="/wiki/download/attachments/16122734/Gene-by-gene Approach for Characterization of Genomes_3.png"/> <br> </center> |