Skip to content

Commit 419d56e

Browse files
authored
Merge pull request #1135 from Delphine-L/vgp8
Update vgp 8 and remove Markduplicates to replace it with samtools markdup
2 parents 205f553 + ef42b35 commit 419d56e

File tree

4 files changed

+2103
-1422
lines changed

4 files changed

+2103
-1422
lines changed

workflows/VGP-assembly-v2/Scaffolding-HiC-VGP8/CHANGELOG.md

Lines changed: 22 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,8 +1,29 @@
11
# Changelog
22

3+
## [3.4] - 2026-03-09
4+
5+
### Changes
6+
- Remove "Alignment General Stats 2" output that was incompatible with MultiQC v1.33 (element `general_stats_table` no longer exists in the `png_plot` output collection)
7+
- Set new PretextMap `high_res` parameter to `false`
8+
- Replace Picards MarkDuplicates tool by Samtools markdup
9+
10+
### Automatic update
11+
- `toolshed.g2.bx.psu.edu/repos/lparsons/cutadapt/cutadapt/5.2+galaxy0` was updated to `toolshed.g2.bx.psu.edu/repos/lparsons/cutadapt/cutadapt/5.2+galaxy2`
12+
- `toolshed.g2.bx.psu.edu/repos/iuc/pretext_map/pretext_map/0.1.9+galaxy1` was updated to `toolshed.g2.bx.psu.edu/repos/iuc/pretext_map/pretext_map/0.2.3+galaxy0`
13+
- `toolshed.g2.bx.psu.edu/repos/devteam/samtools_stats/samtools_stats/2.0.8` was updated to `toolshed.g2.bx.psu.edu/repos/devteam/samtools_stats/samtools_stats/2.0.9`
14+
- `toolshed.g2.bx.psu.edu/repos/bgruening/gfastats/gfastats/1.3.11+galaxy0` was updated to `toolshed.g2.bx.psu.edu/repos/bgruening/gfastats/gfastats/1.3.11+galaxy1`
15+
- `toolshed.g2.bx.psu.edu/repos/bgruening/text_processing/tp_find_and_replace/9.5+galaxy2` was updated to `toolshed.g2.bx.psu.edu/repos/bgruening/text_processing/tp_find_and_replace/9.5+galaxy3`
16+
- `toolshed.g2.bx.psu.edu/repos/bgruening/text_processing/tp_awk_tool/9.5+galaxy2` was updated to `toolshed.g2.bx.psu.edu/repos/bgruening/text_processing/tp_awk_tool/9.5+galaxy3`
17+
- `toolshed.g2.bx.psu.edu/repos/bgruening/text_processing/tp_cut_tool/9.5+galaxy2` was updated to `toolshed.g2.bx.psu.edu/repos/bgruening/text_processing/tp_cut_tool/9.5+galaxy3`
18+
- `toolshed.g2.bx.psu.edu/repos/bgruening/text_processing/tp_easyjoin_tool/9.5+galaxy2` was updated to `toolshed.g2.bx.psu.edu/repos/bgruening/text_processing/tp_easyjoin_tool/9.5+galaxy3`
19+
- `toolshed.g2.bx.psu.edu/repos/bgruening/text_processing/tp_tail_tool/9.5+galaxy2` was updated to `toolshed.g2.bx.psu.edu/repos/bgruening/text_processing/tp_tail_tool/9.5+galaxy3`
20+
- `toolshed.g2.bx.psu.edu/repos/devteam/bamtools_filter/bamFilter/2.5.2+galaxy3` was updated to `toolshed.g2.bx.psu.edu/repos/devteam/bamtools_filter/bamFilter/2.5.3+galaxy0`
21+
- `toolshed.g2.bx.psu.edu/repos/iuc/multiqc/multiqc/1.27+galaxy4` was updated to `toolshed.g2.bx.psu.edu/repos/iuc/multiqc/multiqc/1.33+galaxy0`
22+
23+
324
## [3.3] - 2025-12-10
425

5-
## Changes
26+
### Changes
627
- Show PretextMap files in history for easier export.
728

829
### Automatic update

workflows/VGP-assembly-v2/Scaffolding-HiC-VGP8/README.md

Lines changed: 18 additions & 15 deletions
Original file line numberDiff line numberDiff line change
@@ -10,25 +10,28 @@ Example of VGP trajectory :
1010

1111
## Inputs
1212

13-
1. Genome assembly [gfa]
14-
2. Haplotype being scaffolded (Will be added to scaffold names: e.g. `>scaffold_01_H1`)
15-
3. HiC reads paired collection [fastq]
16-
5. Trim Hi-C data? If `yes`, trim five bases at the beginning of each read. Use with Arima Hi-C data if the Hi-C map looks "noisy" and the reads haven't been trimmed before.
17-
6. Minimum Mapping Quality [int] (Default:20). Minimum mapping quality for Hi-C alignments. Set to 0 if you want no filtering.
18-
6. Database for busco lineage (recommended: latest)
19-
7. Busco lineage (recommended for VGP data: vertebrata)
20-
8. Restriction enzyme sequence (recommended for VGP data: Arima Hi-C 2.0)
21-
9. Estimated genome size [txt] (Output from the contigging workflows 3,4, or 5). A simple text file containing the estimated genome size as an integer. E.g. `2288021`
13+
1. Species Name [text] Name of the species being assembled.
14+
2. Assembly Name [text] Name of the assembly.
15+
3. Genome assembly [gfa]
16+
4. Haplotype being scaffolded (Will be added to scaffold names: e.g. `>scaffold_01_H1`)
17+
5. HiC reads paired collection [fastq]
18+
6. Trim Hi-C data? If `yes`, trim five bases at the beginning of each read. Use with Arima Hi-C data if the Hi-C map looks "noisy" and the reads haven't been trimmed before.
19+
7. Minimum Mapping Quality [int] (Default:20). Minimum mapping quality for Hi-C alignments. Set to 0 if you want no filtering.
20+
8. Database for Compleasm lineage (recommended: latest)
21+
9. Compleasm lineage (recommended for VGP data: vertebrata)
22+
10. Restriction enzyme sequence (recommended for VGP data: Arima Hi-C 2.0)
23+
11. Estimated genome size [txt] (Output from the contigging workflows 3,4, or 5). A simple text file containing the estimated genome size as an integer. E.g. `2288021`
2224

2325

2426
### Outputs
2527

2628
1. Scaffolds in [fasta] and [gfa] format with the haplotype in the scaffold names.
2729
2. If you selected `yes` for Hi-C trimming, the trimmed collections of Hi-C reads.
2830
3. QC: Assembly statistics.
29-
4. QC: Hi-C duplications statistics.
30-
5. QC: Nx plot.
31-
6. QC: Size plot.
32-
7. QC: Compleasm report.
33-
8. QC: Pretext Maps before and after scaffolding.
34-
9. QC: Statistics on Hi-C alignements before and after scaffolding
31+
4. QC: Hi-C duplications statistics on contigs and scaffolds.
32+
5. QC: Pairtools MultiQC stats on contigs and scaffolds.
33+
6. QC: Nx plot.
34+
7. QC: Size plot.
35+
8. QC: Compleasm summary and full table.
36+
9. QC: Pretext Maps before and after scaffolding.
37+
10. QC: Statistics on Hi-C alignements before and after scaffolding

workflows/VGP-assembly-v2/Scaffolding-HiC-VGP8/Scaffolding-HiC-VGP8-tests.yml

Lines changed: 14 additions & 11 deletions
Original file line numberDiff line numberDiff line change
@@ -171,10 +171,6 @@
171171
Lineage: vertebrata_odb10
172172
Restriction enzymes: arima2
173173
outputs:
174-
Markdup Stats:
175-
asserts:
176-
- has_text:
177-
text: "Unknown Library\t4\t16976\t3251\t4\t0\t2255\t1970\t0.132819\t390034"
178174
Suffixed AGP:
179175
asserts:
180176
- has_text:
@@ -190,13 +186,13 @@
190186
Scaffolding Plots:
191187
asserts:
192188
- has_size:
193-
value: 93200
194-
delta: 5000
189+
value: 118911
190+
delta: 30000
195191
Hi-C Alignment Stats scaffolds:
196192
asserts:
197193
- has_text:
198194
text: "SN\treads mapped and paired:\t33952"
199-
Hi-C alignments stats multiqc:
195+
Hi-C alignments on Scaffolds stats multiqc:
200196
asserts:
201197
- has_text:
202198
text: "Hi-C Alignment Stats pre-scaffolding for Haplotype 1\t1.26354"
@@ -211,10 +207,6 @@
211207
asserts:
212208
- has_text:
213209
text: "SN\treads mapped and paired:\t33952"
214-
Markduplicates Summary:
215-
asserts:
216-
- has_text:
217-
text: "Unknown Library\t4\t16976\t3251\t4\t0\t2255\t1970\t0.132819\t390034"
218210
Compleasm Full Table Busco:
219211
asserts:
220212
- has_n_lines:
@@ -231,6 +223,17 @@
231223
asserts:
232224
- has_text:
233225
text: "bases mapped:\t5093400"
226+
'Hi-C duplication stats on scaffolds: Raw':
227+
asserts:
228+
- has_text:
229+
text: "total_dups\t1327"
230+
- has_text:
231+
text: "total_nodups\t5990"
232+
'Hi-C duplication stats on scaffolds: MultiQc':
233+
asserts:
234+
- has_size:
235+
value: 2185479
236+
delta: 500000
234237
Scaffolds Compleasm Summary:
235238
asserts:
236239
- has_text:

0 commit comments

Comments
 (0)