...
- The workflow reads sequences from the input sequence files (e.g. GenBank). The input data may also contain the annotations, associated with the sequences.
- The workflow reads text strings (patterns) from the input text files.
- The data are multiplexed using the Multiplexer element. Multiplexing rule "1 to many" is used, so each input sequence is concatenated with each pattern and . The result is sent to the Find Substrings element.
- The Find Substrings element searches for the specified patterns in each sequence.
- The next element Grouper merges annotations, read for the sequence in the Read Sequence element, with annotations, found for the sequence by Find Substrings element. A sequence ID is used to group the appropriate sets of annotations.
- And finally, the data are written to the output file ("substrings.gb" , by default).
...