Skip to content

Commit 80122c4

Browse files
Will-Tylerjeromekelleher
authored andcommitted
Add multichunk test dataset
1 parent 1584639 commit 80122c4

File tree

3 files changed

+12
-3
lines changed

3 files changed

+12
-3
lines changed

tests/data/vcf/chr22.vcf.gz

46.5 KB
Binary file not shown.

tests/data/vcf/chr22.vcf.gz.csi

116 Bytes
Binary file not shown.

tests/utils.py

Lines changed: 12 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -140,8 +140,17 @@ def vcz_path_cache(vcf_path):
140140
cache_path.mkdir()
141141
cached_vcz_path = (cache_path / vcf_path.name).with_suffix(".vcz")
142142
if not cached_vcz_path.exists():
143-
vcf2zarr.convert(
144-
[vcf_path], cached_vcz_path, worker_processes=0, local_alleles=False
145-
)
143+
if vcf_path.name.startswith("chr22"):
144+
vcf2zarr.convert(
145+
[vcf_path],
146+
cached_vcz_path,
147+
worker_processes=0,
148+
variants_chunk_size=10,
149+
samples_chunk_size=10,
150+
)
151+
else:
152+
vcf2zarr.convert(
153+
[vcf_path], cached_vcz_path, worker_processes=0, local_alleles=False
154+
)
146155
create_index(cached_vcz_path)
147156
return cached_vcz_path

0 commit comments

Comments
 (0)