Skip to content

v1.0 updates#69

Merged
nakib103 merged 96 commits intomainfrom
dev-v1.0
Mar 2, 2026
Merged

v1.0 updates#69
nakib103 merged 96 commits intomainfrom
dev-v1.0

Conversation

@nakib103
Copy link
Contributor

@nakib103 nakib103 commented Mar 2, 2026

Major Features & Enhancements

  • Support for structural Variants:
    • Support for variable-length extra fields in bed files for additional SV fields.
    • Script for merging dbVar call and region files
    • Script to process HPRC files to generate haplotype VCF
    • Script to generate of segdup track files for HPRC
  • Add MAGIC-16 as Source in Metadata: Updated configuration to include new data sources.

Pipeline & Performance Improvements

  • Optimised Memory Allocation: Several steps, especially the VEP step, have been optimised for better memory usage.
  • Update to data checks: added skip xfail to skip time-consuming and expected failure tests as well as generation of variant lists to test once in test runtime along with other improvements

Bug Fixes & Maintenance

  • Symlink issue: Fixed issues with symlink when multiple source is unset.
  • Allele mismatch in Summary Stats: Use allele number to avoid mistake in mismatching in allele names.
  • More Restrictive Regex for HPRC gnomAD: Only allow the 89 haplotypes we have data for.
  • VEP config generation: Fix cache version and only allow gencode primary for GRCh38.
  • Source name extraction: Fix minor bug in extracting source name from the input config.

Other Notable Changes

  • Post-Handover Test Script Updates: Added script to check handover was successful (file staging is ok).
  • Index Generation Improvements: Added -f flag and fixed symlink issues in track files.

nakib103 and others added 29 commits December 23, 2025 10:55
Bugfix: multiple source construction
Add MAGIC-16 as source in sources_meta config
Bug fix - HGSV3 haplo generation script and small patches to metadata gen
Added initial script for merging call and region files
Fix allele matching in summary stats + population_data regex update
Bugfix: wrong field extraction in vcf_to_bed
Modify MAGIC-16 source metadata
use gencode_primary filter only for human GRCh38
@nakib103 nakib103 merged commit 8b56436 into main Mar 2, 2026
0 of 2 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants