Child pages
  • Find Substrings in Sequences

Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

This sample workflow describes shows how to find substrings in input sequences and group these sequences by different parameters.First, the workflow reads sequences and , annotate them, and merge the found substring annotations with the original sequence annotations.

The steps of the workflow are these:

  1. The workflow reads sequences from the input sequence files (e.g. GenBank). The input data may also contain the annotations, associated with the sequence.
  2. The workflow reads text strings (patterns) from the input text files.

...

  1. The data

...

  1. are multiplexed using

...

  1. the Multiplexer element. Multiplexing rule "1 to many" is used, so each input sequence is concatenated with each pattern and sent to the Find Substrings element.
  2. The Find Substrings element searches for the specified patterns in each sequence.
  3. The next element Grouper merges annotations, read for the sequence in the Read Sequence element, with annotations, found for the sequence by Find Substrings element. A sequence ID is used to group the appropriate sets of annotations.
  4. And finally, the data are written to the output file ("substrings.gb" , by default).

Tip
titleHow to Use This Sample

If you haven't used the workflow samples in UGENE before, look at the "How to Use Sample Workflows" section of the documentation.

...