Workflow Demos

Return to all workflow demos

cd-hit bioActor

Description

Protein sequence clustering.

Inputs

proteinSequenceFileProtein sequence file in FASTA format.

Outputs

clusterFileA cluster file indicating which seuences are clustered.

Parameters

outputPrefixThe prefix of output files.
seqIdCutoffClustering sequences with sequence identity > cutoff.
programThe command line program to execute.
checkOutputTimestampIf true, verify the last modification timestamp for each output file has increased after execution. If the timestamp has not increased, throw an error.
additionalOptionsAdditional command line options to run cd-hit
ChoiceThe execution choice to run.