Workflow Demos

Return to all workflow demos

cd-hit-454 bioActor

Description

454 read duplicate clustering.

Inputs

dnaSequenceFileA dna sequence file in FASTA format.

Outputs

clusterFileA cluster file indicating which sequences are clustered.

Parameters

outputPrefixThe prefix of output files.
seqIdCutoffClustering sequences with sequence identity > cutoff.
programThe command line program to execute.
checkOutputTimestampIf true, verify the last modification timestamp for each output file has increased after execution. If the timestamp has not increased, throw an error.
additionalOptionsAdditional command line options to run cd-hit-454
ChoiceThe execution choice to run.