WORKFLOW assemble_denovo
| File Path |
pipes/WDL/workflows/assemble_denovo.wdl
|
|---|---|
| WDL Version | 1.0 |
| Type | workflow |
Imports
| Namespace | Path |
|---|---|
taxon_filter
|
../tasks/tasks_taxon_filter.wdl
|
read_utils
|
../tasks/tasks_read_utils.wdl
|
assembly
|
../tasks/tasks_assembly.wdl
|
ncbi
|
../tasks/tasks_ncbi.wdl
|
assemble_refbased
|
assemble_refbased.wdl
|
Workflow: assemble_denovo
Assisted de novo viral genome assembly from raw reads.
Author: Broad Viral Genomics
Inputs
| Name | Type | Description | Default |
|---|---|---|---|
reads_unmapped_bams
|
Array[File]+
|
- | - |
reference_genome_fasta
|
Array[File]+
|
After denovo assembly, large contigs are scaffolded against a reference genome to determine orientation and to join contigs together, before further polishing by reads. You must supply at least one reference genome (all segments/chromomes in a single fasta file). If more than one reference is provided, contigs will be scaffolded against all of them and the one with the most complete assembly will be chosen for downstream polishing. | - |
filter_to_taxon_db
|
File?
|
Optional database to use to filter read set to those that match by LASTAL. Sequences in fasta format will be indexed on the fly. | - |
trim_clip_db
|
File
|
- | - |
sample_original_name
|
String?
|
a (possibly filename-unfriendly) sample name for fasta and bam headers | - |
reheader_table
|
File?
|
- | - |
query_chunk_size
|
Int?
|
- | - |
sample_name
|
String?
|
- | - |
reheader_table
|
File?
|
- | - |
sample_name
|
String?
|
- | - |
reheader_table
|
File?
|
- | - |
sample_name
|
String?
|
- | - |
reheader_table
|
File?
|
- | - |
spades_min_contig_len
|
Int?
|
- | - |
spades_options
|
String?
|
- | - |
machine_mem_gb
|
Int?
|
- | - |
min_length_fraction
|
Float?
|
- | - |
min_unambig
|
Float?
|
- | - |
skani_m
|
Int?
|
- | - |
skani_s
|
Int?
|
- | - |
skani_c
|
Int?
|
- | - |
nucmer_max_gap
|
Int?
|
- | - |
nucmer_min_match
|
Int?
|
- | - |
nucmer_min_cluster
|
Int?
|
- | - |
scaffold_min_contig_len
|
Int?
|
- | - |
scaffold_min_pct_contig_aligned
|
Float?
|
- | - |
machine_mem_gb
|
Int?
|
- | - |
sample_original_name
|
String?
|
a (possibly filename-unfriendly) sample name for fasta and bam headers | - |
novocraft_license
|
File?
|
- | - |
trim_coords_bed
|
File?
|
- | - |
machine_mem_gb
|
Int?
|
- | - |
min_keep_length
|
Int?
|
- | - |
sliding_window
|
Int?
|
- | - |
primer_offset
|
Int?
|
- | - |
machine_mem_gb
|
Int?
|
- | - |
reheader_table
|
File?
|
- | - |
amplicon_set
|
String?
|
- | - |
max_coverage_depth
|
Int?
|
- | - |
base_q_threshold
|
Int?
|
- | - |
mapping_q_threshold
|
Int?
|
- | - |
read_length_threshold
|
Int?
|
- | - |
plotXLimits
|
String?
|
- | - |
plotYLimits
|
String?
|
- | - |
machine_mem_gb
|
Int?
|
- | - |
reheader_table
|
File?
|
- | - |
max_coverage_depth
|
Int?
|
- | - |
base_q_threshold
|
Int?
|
- | - |
mapping_q_threshold
|
Int?
|
- | - |
read_length_threshold
|
Int?
|
- | - |
plotXLimits
|
String?
|
- | - |
plotYLimits
|
String?
|
- | - |
92 optional inputs with default values |
|||
Outputs
| Name | Type | Expression |
|---|---|---|
final_assembly_fasta
|
File
|
select_first([rename_fasta_header.renamed_fasta, refine.assembly_fasta])
|
aligned_only_reads_bam
|
File
|
refine.align_to_self_merged_aligned_only_bam
|
coverage_plot
|
File
|
refine.align_to_self_merged_coverage_plot
|
assembly_length
|
Int
|
refine.assembly_length
|
assembly_length_unambiguous
|
Int
|
refine.assembly_length_unambiguous
|
reads_aligned
|
Int
|
refine.align_to_self_merged_reads_aligned
|
mean_coverage
|
Float
|
refine.align_to_self_merged_mean_coverage
|
cleaned_bam
|
File
|
merge_cleaned_reads.out_bam
|
cleaned_fastqc
|
File?
|
merge_cleaned_reads.fastqc
|
depletion_read_count_post
|
Int
|
merge_cleaned_reads.read_count
|
taxfilt_bam
|
File
|
merge_taxfilt_reads.out_bam
|
taxfilt_fastqc
|
File?
|
merge_taxfilt_reads.fastqc
|
filter_read_count_post
|
Int
|
merge_taxfilt_reads.read_count
|
dedup_bam
|
File
|
merge_dedup_reads.out_bam
|
dedup_fastqc
|
File?
|
merge_dedup_reads.fastqc
|
dedup_read_count_post
|
Int
|
merge_dedup_reads.read_count
|
contigs_fasta
|
File
|
assemble.contigs_fasta
|
subsampBam
|
File
|
assemble.subsampBam
|
subsample_read_count
|
Int
|
assemble.subsample_read_count
|
scaffold_fasta
|
File
|
scaffold.scaffold_fasta
|
intermediate_scaffold_fasta
|
File
|
scaffold.intermediate_scaffold_fasta
|
intermediate_gapfill_fasta
|
File
|
scaffold.intermediate_gapfill_fasta
|
assembly_preimpute_length
|
Int
|
scaffold.assembly_preimpute_length
|
assembly_preimpute_length_unambiguous
|
Int
|
scaffold.assembly_preimpute_length_unambiguous
|
scaffolding_chosen_ref_names
|
Array[String]
|
scaffold.scaffolding_chosen_ref_names
|
scaffolding_stats
|
File
|
scaffold.scaffolding_stats
|
scaffolding_alt_contigs
|
File
|
scaffold.scaffolding_alt_contigs
|
replicate_concordant_sites
|
Int
|
refine.replicate_concordant_sites
|
replicate_discordant_snps
|
Int
|
refine.replicate_discordant_snps
|
replicate_discordant_indels
|
Int
|
refine.replicate_discordant_indels
|
num_read_groups
|
Int
|
refine.num_read_groups
|
num_libraries
|
Int
|
refine.num_libraries
|
replicate_discordant_vcf
|
File
|
refine.replicate_discordant_vcf
|
isnvs_vcf
|
File
|
refine.align_to_self_isnvs_vcf
|
aligned_bam
|
File
|
refine.align_to_self_merged_aligned_only_bam
|
aligned_only_reads_fastqc
|
File
|
refine.align_to_ref_fastqc
|
coverage_tsv
|
File
|
refine.align_to_self_merged_coverage_tsv
|
read_pairs_aligned
|
Int
|
refine.align_to_self_merged_read_pairs_aligned
|
bases_aligned
|
Int
|
refine.align_to_self_merged_bases_aligned
|
assembly_method
|
String
|
"viral-ngs/assemble_denovo"
|
assemble_viral_assemble_version
|
String
|
assemble.viralngs_version
|
scaffold_viral_assemble_version
|
String
|
scaffold.viralngs_version
|
Calls
This workflow calls the following tasks or subworkflows:
CALL
TASKS
renamed_reads
→ merge_and_reheader_bams
Input Mappings (3)
| Input | Value |
|---|---|
in_bams
|
[reads_unmapped_bam]
|
sample_name
|
sample_original_name
|
out_basename
|
out_basename
|
CALL
TASKS
deplete_taxa
Input Mappings (4)
| Input | Value |
|---|---|
raw_reads_unmapped_bam
|
reads_unmapped_renamed_bams
|
bmtaggerDbs
|
deplete_bmtaggerDbs
|
blastDbs
|
deplete_blastDbs
|
bwaDbs
|
deplete_bwaDbs
|
CALL
TASKS
filter_to_taxon
Input Mappings (2)
| Input | Value |
|---|---|
reads_unmapped_bam
|
reads_depleted_bams
|
lastal_db_fasta
|
select_first([filter_to_taxon_db])
|
CALL
TASKS
rmdup_ubam
Input Mappings (1)
| Input | Value |
|---|---|
reads_unmapped_bam
|
reads_taxfilt_bams
|
CALL
TASKS
merge_dedup_reads
→ merge_and_reheader_bams
Input Mappings (2)
| Input | Value |
|---|---|
in_bams
|
rmdup_ubam.dedup_bam
|
out_basename
|
out_basename
|
CALL
TASKS
merge_cleaned_reads
→ merge_and_reheader_bams
Input Mappings (2)
| Input | Value |
|---|---|
in_bams
|
reads_depleted_bams
|
out_basename
|
out_basename
|
CALL
TASKS
merge_taxfilt_reads
→ merge_and_reheader_bams
Input Mappings (2)
| Input | Value |
|---|---|
in_bams
|
reads_taxfilt_bams
|
out_basename
|
out_basename
|
CALL
TASKS
assemble
Input Mappings (4)
| Input | Value |
|---|---|
reads_unmapped_bam
|
merge_dedup_reads.out_bam
|
trim_clip_db
|
trim_clip_db
|
always_succeed
|
true
|
sample_name
|
out_basename
|
CALL
TASKS
scaffold
Input Mappings (3)
| Input | Value |
|---|---|
contigs_fasta
|
assemble.contigs_fasta
|
reads_bam
|
merge_dedup_reads.out_bam
|
reference_genome_fasta
|
reference_genome_fasta
|
CALL
WORKFLOW
refine
→ assemble_refbased
Input Mappings (3)
| Input | Value |
|---|---|
reads_unmapped_bams
|
reads_depleted_bams
|
reference_fasta
|
scaffold.scaffold_fasta
|
sample_name
|
out_basename
|
CALL
TASKS
rename_fasta_header
Input Mappings (2)
| Input | Value |
|---|---|
genome_fasta
|
refine.assembly_fasta
|
new_name
|
select_first([sample_original_name])
|
Images
Container images used by tasks in this workflow:
Parameterized Image
⚙️ Parameterized
Configured via input:
docker
Used by 6 tasks:
-
merge_dedup_reads -
merge_cleaned_reads -
merge_taxfilt_reads -
rmdup_ubam -
rename_fasta_header -
renamed_reads
Parameterized Image
⚙️ Parameterized
Configured via input:
docker
Used by 2 tasks:
-
assemble -
scaffold
Parameterized Image
⚙️ Parameterized
Configured via input:
docker
Used by 2 tasks:
-
deplete_taxa -
filter_to_taxon