Workflow Demos

Return to all workflow demos

cd-hit-est bioActor

Description

DNA sequence clustering.

Inputs

dnaSequenceFileA dna sequence file in FASTA format.

Outputs

clusterFileA cluster file indicating which sequences are clustered.

Parameters

outputPrefixThe prefix of output files.
seqIdCutoffClustering sequences with sequence identity > cutoff.
programThe command line program to execute.
checkOutputTimestampIf true, verify the last modification timestamp for each output file has increased after execution. If the timestamp has not increased, throw an error.
additionalOptionsAdditional command line options to run cd-hit-est
ChoiceThe execution choice to run.