This sample workflow describes shows how to find substrings in input sequences and group these sequences by different parameters.First, the workflow reads sequences and , annotate them, and merge the found substring annotations with the original sequence annotations.
The steps of the workflow are these:
- The workflow reads sequences from the input sequence files (e.g. GenBank). The input data may also contain the annotations, associated with the sequence.
- The workflow reads text strings (patterns) from the input text files.
...
- The data
...
- are multiplexed using
...
- the Multiplexer element. Multiplexing rule "1 to many" is used, so each input sequence is concatenated with each pattern and sent to the Find Substrings element.
- The Find Substrings element searches for the specified patterns in each sequence.
- The next element Grouper merges annotations, read for the sequence in the Read Sequence element, with annotations, found for the sequence by Find Substrings element. A sequence ID is used to group the appropriate sets of annotations.
- And finally, the data are written to the output file ("substrings.gb" , by default).
Tip | ||
---|---|---|
| ||
If you haven't used the workflow samples in UGENE before, look at the "How to Use Sample Workflows" section of the documentation. |
...