You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: workflows/VGP-assembly-v2/hi-c-contact-map-for-assembly-manual-curation/CHANGELOG.md
+11Lines changed: 11 additions & 0 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -1,7 +1,18 @@
1
1
# Changelog
2
2
3
+
4
+
5
+
## [2.2] - 2026-03-10
6
+
7
+
### Changed
8
+
- The gene tracks generation can now be skipped (Useful for large genomes running out of memory)
9
+
- Added option to generate high resolution Hi-C Pretext maps
10
+
- Replace MarkDuplicates with Samtools markdup
11
+
12
+
3
13
## [2.1] - 2026-03-02
4
14
15
+
5
16
### Automatic update
6
17
-`toolshed.g2.bx.psu.edu/repos/bgruening/text_processing/tp_replace_in_line/9.5+galaxy2` was updated to `toolshed.g2.bx.psu.edu/repos/bgruening/text_processing/tp_replace_in_line/9.5+galaxy3`
7
18
-`toolshed.g2.bx.psu.edu/repos/bgruening/text_processing/tp_cat/9.5+galaxy2` was updated to `toolshed.g2.bx.psu.edu/repos/bgruening/text_processing/tp_cat/9.5+galaxy3`
13.**Do you want to trim the Hi-C data?**[boolean] - If yes, removes 5bp at the end of Hi-C reads (recommended for Arima Hi-C data if the map looks "noisy")
37
-
14.**Remove duplicated Hi-C reads?**[boolean] - Remove PCR duplicates from Hi-C alignments using Picard MarkDuplicates
38
-
15.**Minimum Mapping Quality**[integer] - Minimum MAPQ score for filtered PretextMap (default: 10; set to 0 to keep all mapped reads)
37
+
14.**Do you want to trim the Hi-C data?**[boolean] - If yes, removes 5bp at the end of Hi-C reads (recommended for Arima Hi-C data if the map looks "noisy")
38
+
15.**Remove duplicated Hi-C reads?**[boolean] - Remove PCR duplicates from Hi-C alignments using Samtools markdup
39
+
16.**Minimum Mapping Quality**[integer] - Minimum MAPQ score for filtered PretextMap (default: 10; set to 0 to keep all mapped reads)
39
40
40
41
### PacBio Processing Options
41
42
42
-
16.**Remove adapters from HiFi reads?**[boolean] - Trim adapters from PacBio HiFi reads using Cutadapt
43
+
17.**Remove adapters from HiFi reads?**[boolean] - Trim adapters from PacBio HiFi reads using Cutadapt
43
44
44
45
### Telomere Detection
45
46
46
-
17.**Canonical telomeric pattern**[text] - Expected telomere repeat sequence (default: TTAGGG for vertebrates; use CCCTAA for reverse complement)
47
-
18.**Telomeric Patterns to explore (comma-separated), IUPAC allowed**[text] - Additional telomeric patterns to search for (e.g., TTAGGG,CCCTAA)
47
+
18.**Canonical telomeric pattern**[text] - Expected telomere repeat sequence (default: TTAGGG for vertebrates; use CCCTAA for reverse complement)
48
+
19.**Telomeric Patterns to explore (comma-separated), IUPAC allowed**[text] - Additional telomeric patterns to search for (e.g., TTAGGG,CCCTAA)
49
+
50
+
### Hi-C Map Resolution
51
+
52
+
20.**Generate high resolution Hi-C maps**[boolean] - Generate high resolution Pretext maps (slower, requires more resources)
48
53
49
54
### Visualization Options
50
55
51
-
19.**Bin Size for Bigwig files**[integer] - Resolution for coverage tracks (default: 100; larger values = smaller files but lower resolution)
56
+
21.**Bin Size for Bigwig files**[integer] - Resolution for coverage tracks (default: 100; larger values = smaller files but lower resolution)
52
57
53
58
## Outputs
54
59
@@ -60,48 +65,57 @@ The Pretext files can be opened in PretextView for manual curation of genome ass
Copy file name to clipboardExpand all lines: workflows/VGP-assembly-v2/hi-c-contact-map-for-assembly-manual-curation/hi-c-map-for-assembly-manual-curation-tests.yml
+20-21Lines changed: 20 additions & 21 deletions
Original file line number
Diff line number
Diff line change
@@ -63,6 +63,8 @@
63
63
Assembly Name: toLid
64
64
Lineage for Compleasm: vertebrata_odb10
65
65
Database for Compleasm: v5
66
+
Generate gene annotations: false
67
+
Generate high resolution Hi-C maps: false
66
68
outputs:
67
69
Assembly for curation:
68
70
asserts:
@@ -85,17 +87,12 @@
85
87
has_size:
86
88
value: 10000
87
89
delta: 5000
88
-
Terminal Telomeres:
90
+
P telomeres bed:
89
91
asserts:
90
92
has_text:
91
-
text: "scaffold_10.H1"
93
+
text: "scaffold_10.H2"
92
94
has_text:
93
95
text: "scaffold_1.H1"
94
-
Merged Hi-C Alignments:
95
-
asserts:
96
-
has_size:
97
-
value: 7500000
98
-
delta: 3000000
99
96
Pretext All tracks:
100
97
asserts:
101
98
has_size:
@@ -106,22 +103,18 @@
106
103
has_size:
107
104
value: 800000
108
105
delta: 400000
109
-
Compleasm Genes track:
110
-
asserts:
111
-
has_text:
112
-
text: "scaffold_1.H2\t106669\t106765"
113
-
Markduplicates Summary:
114
-
asserts:
115
-
has_text:
116
-
text: "1042\t217\t3942"
117
-
"Hi-C duplication stats: Raw":
106
+
"Hi-C duplication stats on Scaffolds: Raw":
118
107
asserts:
119
108
has_text:
120
109
text: "total_dups\t3941"
121
110
P telomeres bed:
122
111
asserts:
123
112
has_text:
124
113
text: "scaffold_1.H1\t488600\t500600\t12000"
114
+
Hi-C duplication stats on Scaffolds:
115
+
asserts:
116
+
has_text:
117
+
text: "EXCLUDED: 1042"
125
118
- doc: Test 2 - Multiple read sets with collections
0 commit comments