Workflow: analysis for assembled sequences

Fetched 2023-01-11 12:52:55 GMT

rna / protein - qc, annotation, index, abundance

children parents
Workflow as SVG
  • Selected
  • Default Values
  • Nested Workflows
  • Tools
  • Inputs/Outputs

Inputs

ID Type Title Doc
jobid String
m5nrBDB File
m5nrSCG File
m5nrFull File[]
m5rnaFull File
sequences File
m5rnaClust File
m5rnaIndex Directory
m5rnaPrefix String

Steps

ID Runs Label Doc
abundance abundance

abundace profiles from annotated files, for protein and/or rna

darkmatter
../Tools/extract_darkmatter.tool.cwl (CommandLineTool)
extract darkmatter

retrieve predicted proteins that have no similarity hits >extract_darkmatter.py -i <input> -s <sim 1> -s <sim 2> -m <clust map 1> -m <clust map 2> -o <outName>

qcAssemble
indexSimSeq index sim seq

create sorted / filtered similarity file with feature sequences, and index by md5

rnaAnnotate rna annotation

RNAs - predict, cluster, identify, annotate

protAnnotate protein annotation

Proteins - predict, filter, cluster, identify, annotate

Outputs

ID Type Label Doc
qcStatOut File
seqBinOut File
simSeqOut File
rnaSimsOut File
seqStatOut File
protSimsOut File
qcSummaryOut File
darkmatterOut File
lcaProfileOut File
md5ProfileOut File
rnaFeatureOut File
protFeatureOut File
rnaClustMapOut File
rnaClustSeqOut File
sourceStatsOut File
protClustMapOut File
protClustSeqOut File
assemblyCoverage File
protFilterFeatureOut File
Permalink: https://w3id.org/cwl/view/git/9aba38fd1569287b7256ace7163ac84320909f8a/CWL/Workflows/assembled.workflow.cwl