File tree Expand file tree Collapse file tree 3 files changed +23
-3
lines changed Expand file tree Collapse file tree 3 files changed +23
-3
lines changed Original file line number Diff line number Diff line change 1
1
# bio2zarr
2
2
3
3
` bio2zarr ` efficiently converts common bioinformatics formats to
4
- [ Zarr] ( https://zarr.readthedocs.io/en/stable/ ) format. Initially supporting converting
5
- VCF to the [ VCF Zarr specification] ( https://github.com/sgkit-dev/vcf-zarr-spec/ ) .
4
+ [ Zarr] ( https://zarr.readthedocs.io/en/stable/ ) format.
5
+
6
+ ## Tools
7
+
8
+ - {ref}` sec-vcf2zarr ` converts VCF data to
9
+ [ VCF Zarr] ( https://github.com/sgkit-dev/vcf-zarr-spec/ ) format.
10
+
11
+ - {ref}` sec-vcfpartition ` is a utility to split an input (set of)
12
+ VCFs into a given number of partitions. This is useful for
13
+ parallel processing.
14
+
15
+ ## Development status
6
16
7
17
` bio2zarr ` is in development, contributions, feedback and issues are welcome
8
18
at the [ GitHub repository] ( https://github.com/sgkit-dev/bio2zarr ) .
9
19
20
+ Support for converting PLINK data to VCF Zarr is partially implemented,
21
+ and adding BGEN support is also planned. If you would like to see
22
+ support for other formats (or an interested in helping with implementing),
23
+ please open an [ issue on Github] ( https://github.com/sgkit-dev/bio2zarr/issues )
24
+ to discuss!
25
+
Original file line number Diff line number Diff line change
1
+ (sec-vcf2zarr)=
1
2
# vcf2zarr
2
3
3
4
4
- Convert a VCF to zarr format:
5
+ Convert VCF data to the
6
+ [ VCF Zarr specification] ( https://github.com/sgkit-dev/vcf-zarr-spec/ )
7
+ reliably, in parallel.
5
8
6
9
```
7
10
$ vcf2zarr convert <VCF1> <VCF2> <zarr>
Original file line number Diff line number Diff line change
1
+ (sec-vcfpartition)=
1
2
# vcfpartition
2
3
3
4
## Overview
You can’t perform that action at this time.
0 commit comments