Skip to content

Commit 67c3526

Browse files
Update Dorado to 1.4.0 version (#1562)
* Create README.md for 1.4.0 * Create Dockerfile fop dorado cuda 13 * Update Dockerfile * Rename dorado folder to 1.4.0-cuda13.1.1 * Update README.md with new cuba version * Update README.md to 1.4.0cuba13.1.1 for dorado * Update README.md updated to the current models * Create README.md for 1.4.0-cuba13.1.1-no_model * Create Dockerfile for 1.4.0 cuba no model * Update README.md for 1.4.0cuba13.1.1-no_model dorado * Update README.md fix the link * Update README.md back to old Cuda Removing this: 13.1.1 is preferred. * Update Dockerfile updated back to Cuda 12.2 * Rename dorado folder back to 1.4.0-cuda12.2.0 * Update Dockerfile all the way back nvidia/cuda:12.2.0-base-ubuntu22.04 * Update README.md back to old cuda Removed the line 13.1.1 is preferred. * Update Dockerfile to go back to old cuda Added: nvidia/cuda:12.2.0-base-ubuntu22.04 * Update Dockerfile * Rename dorado folder back to 1.4.0-cuda12.2.0-no_model * Update README.md to cuda 12.2.0 * Update README.md updated the models details * Update README.md correcting my packages * Fix Docker run command for CUDA 12.2.0 Updated the README to reflect the correct Docker image tag for CUDA 12.2.0. * Delete build-files/dorado/1.1.0-cuda12.2.0/README.md accidental change in PR --------- Co-authored-by: Kutluhan Incekara <46578029+Kincekara@users.noreply.github.com>
1 parent deb2484 commit 67c3526

File tree

6 files changed

+311
-161
lines changed

6 files changed

+311
-161
lines changed

README.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -156,7 +156,7 @@ To learn more about the docker pull rate limits and the open source software pro
156156
| [datasets-sars-cov-2](https://github.com/CDCgov/datasets-sars-cov-2) <br/> [![docker pulls](https://badgen.net/docker/pulls/staphb/datasets-sars-cov-2)](https://hub.docker.com/r/staphb/datasets-sars-cov-2) | <details><summary>Click to see all versions</summary> <ul><li>[0.6.2](./build-files/datasets-sars-cov-2/0.6.2/)</li><li>[0.6.3](./build-files/datasets-sars-cov-2/0.6.3/)</li><li>[0.7.2](./build-files/datasets-sars-cov-2/)</li></ul> </details> | https://github.com/CDCgov/datasets-sars-cov-2 |
157157
| [dnaapler](https://hub.docker.com/r/staphb/dnaapler) <br/> [![docker pulls](https://badgen.net/docker/pulls/staphb/dnaapler)](https://hub.docker.com/r/staphb/dnaapler) | <details><summary>Click to see all versions</summary> <ul><li>[0.1.0](./build-files/dnaapler/0.1.0/)</li></ul> <ul><li>[0.4.0](./build-files/dnaapler/0.4.0/)</li><li>[0.5.0](./build-files/dnaapler/0.5.0/)</li><li>[0.5.1](./build-files/dnaapler/0.5.1/)</li><li>[0.7.0](./build-files/dnaapler/0.7.0/)</li><li>[0.8.0](./build-files/dnaapler/0.8.0/)</li><li>[1.0.1](./build-files/dnaapler/1.0.1/)</li><li>[1.1.0](./build-files/dnaapler/1.1.0/)</li><li>[1.2.0](./build-files/dnaapler/1.2.0/)</li><li>[1.3.0](./build-files/dnaapler/1.3.0/)</li></ul> </details> | https://github.com/gbouras13/dnaapler |
158158
| [diamond](https://github.com/bbuchfink/diamond) <br/> [![docker pulls](https://badgen.net/docker/pulls/staphb/diamond)](https://hub.docker.com/r/staphb/diamond) | <details><summary>Click to see all versions</summary> <ul><li>[2.1.9](./build-files/diamond/2.1.9)</li><li>[2.1.10](./build-files/diamond/2.1.10)</li><li>[2.1.11](./build-files/diamond/2.1.11)</li><li>[2.1.12](./build-files/diamond/2.1.12)</li><li>[2.1.13](./build-files/diamond/2.1.13)</li><li>[2.1.16](./build-files/diamond/2.1.16)</li><li>[2.1.22](./build-files/diamond/2.1.22)</li><li>[2.1.23](./build-files/diamond/2.1.23)</li></ul> </details> | https://github.com/bbuchfink/diamond|
159-
| [dorado](https://hub.docker.com/r/staphb/dorado) <br/> [![docker pulls](https://badgen.net/docker/pulls/staphb/dorado)](https://hub.docker.com/r/staphb/dorado) | <details><summary>Click to see all versions</summary> <ul><li>[0.8.0](./build-files/dorado/0.8.0/)</li><li>[0.8.3](./build-files/dorado/0.8.3/)</li><li>[0.9.0](./build-files/dorado/0.9.0-cuda12.2.0/)</li><li>[0.9.0-cuda12.2.0-no_model](./build-files/dorado/0.9.0-cuda12.2.0-no_model/)</li><li>[1.1.0-cuda12.2.0](./build-files/dorado/1.1.0-cuda12.2.0/)</li><li>[1.1.0-cuda12.2.0-no_model](./build-files/dorado/1.1.0-cuda12.2.0-no_model/)</li></ul></ul> </details> | [https://github.com/nanoporetech/dorado](https://github.com/nanoporetech/dorado) |
159+
| [dorado](https://hub.docker.com/r/staphb/dorado) <br/> [![docker pulls](https://badgen.net/docker/pulls/staphb/dorado)](https://hub.docker.com/r/staphb/dorado) | <details><summary>Click to see all versions</summary> <ul><li>[0.8.0](./build-files/dorado/0.8.0/)</li><li>[0.8.3](./build-files/dorado/0.8.3/)</li><li>[0.9.0](./build-files/dorado/0.9.0-cuda12.2.0/)</li><li>[0.9.0-cuda12.2.0-no_model](./build-files/dorado/0.9.0-cuda12.2.0-no_model/)</li><li>[1.1.0-cuda12.2.0](./build-files/dorado/1.1.0-cuda12.2.0/)</li><li>[1.1.0-cuda12.2.0-no_model](./build-files/dorado/1.1.0-cuda12.2.0-no_model/)</li><li>[1.4.0-cuda12.2.0](./build-files/dorado/1.4.0-cuda12.2.0/)</li><li>[1.4.0-cuda12.2.0-no_model](./build-files/dorado/1.4.0-cuda12.2.0-no_model/)</li></ul></ul> </details> | [https://github.com/nanoporetech/dorado](https://github.com/nanoporetech/dorado) |
160160
| [dragonflye](https://hub.docker.com/r/staphb/dragonflye) <br/> [![docker pulls](https://badgen.net/docker/pulls/staphb/dragonflye)](https://hub.docker.com/r/staphb/dragonflye) | <details><summary>Click to see all versions</summary> <ul><li>[1.0.14](./build-files/dragonflye/1.0.14/)</li><li>[1.1.1](./build-files/dragonflye/1.1.1/)</li><li>[1.1.2](./build-files/dragonflye/1.1.2/)</li><li>[1.2.0](./build-files/dragonflye/1.2.0/)</li><li>[1.2.1](./build-files/dragonflye/1.2.1/)</li></ul> </details> | https://github.com/rpetit3/dragonflye |
161161
| [Dr. PRG ](https://hub.docker.com/r/staphb/drprg) <br/> [![docker pulls](https://badgen.net/docker/pulls/staphb/drprg)](https://hub.docker.com/r/staphb/drprg) | <details><summary>Click to see all versions</summary> <ul><li>[0.1.1](./build-files/drprg/0.1.1/)</li></ul> </details> | https://mbh.sh/drprg/ |
162162
| [DSK](https://hub.docker.com/r/staphb/dsk) <br/> [![docker pulls](https://badgen.net/docker/pulls/staphb/dsk)](https://hub.docker.com/r/staphb/dsk) | <details><summary>Click to see all versions</summary> <ul><li>[0.0.100](./build-files/dsk/0.0.100/)</li><li>[2.3.3](./build-files/dsk/2.3.3/)</li></ul> </details> | https://gatb.inria.fr/software/dsk/ |

build-files/dorado/1.1.0-cuda12.2.0/README.md

Lines changed: 0 additions & 160 deletions
This file was deleted.
Lines changed: 36 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,36 @@
1+
FROM nvidia/cuda:12.2.0-base-ubuntu22.04 AS app
2+
3+
ARG DORADO_VER="1.4.0"
4+
5+
LABEL base.image="nvidia/cuda:12.2.0-base-ubuntu22.04"
6+
LABEL dockerfile.version="1"
7+
LABEL software="Dorado"
8+
LABEL software.version="${DORADO_VER}"
9+
LABEL description="A tool for basecalling Fast5/Pod5 files from Oxford Nanopore sequencing"
10+
LABEL website="https://github.com/nanoporetech/dorado"
11+
LABEL license="https://github.com/nanoporetech/dorado/blob/master/LICENSE"
12+
LABEL maintainer="Kutluhan Incekara"
13+
LABEL maintainer.email="kutluhan.incekara@ct.gov"
14+
15+
RUN apt-get update && apt-get install --no-install-recommends -y\
16+
wget \
17+
ca-certificates \
18+
pigz \
19+
procps &&\
20+
apt-get autoclean && rm -rf /var/lib/apt/lists/*
21+
22+
RUN wget -q https://cdn.oxfordnanoportal.com/software/analysis/dorado-${DORADO_VER}-linux-x64.tar.gz &&\
23+
tar --strip-components=1 -C /usr/local/ -xvf dorado-${DORADO_VER}-linux-x64.tar.gz &&\
24+
rm dorado-${DORADO_VER}-linux-x64.tar.gz
25+
26+
ENV LC_ALL=C
27+
28+
CMD ["dorado", "-h"]
29+
30+
WORKDIR /data
31+
32+
## Test ##
33+
FROM app AS test
34+
35+
RUN wget https://github.com/nanoporetech/dorado/raw/release-v1.1/tests/data/pod5/dna_r10.4.1_e8.2_400bps_5khz/dna_r10.4.1_e8.2_400bps_5khz-FLO_PRO114M-SQK_LSK114_XL-5000.pod5 &&\
36+
dorado basecaller fast dna_r10.4.1_e8.2_400bps_5khz-FLO_PRO114M-SQK_LSK114_XL-5000.pod5
Lines changed: 33 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,33 @@
1+
# Dorado container
2+
3+
Main tool: [dorado](https://github.com/nanoporetech/dorado)
4+
5+
Code repository: https://github.com/nanoporetech/dorado
6+
7+
Basic information on how to use this tool:
8+
- executable: dorado
9+
- help: `-h`, `--help`
10+
- version: `-v`, `--version`
11+
- description: Dorado is a high-performance, easy-to-use, open source basecaller for Oxford Nanopore reads.
12+
13+
Additional information:
14+
15+
- This container does not contain any models. The models can be downloaded manually by `dorado download` command. If no specific model is provided, Dorado will automatically select and download the appropriate one using the model selection complex.
16+
- Cuda drivers >= 12.2.0 are required for gpu support.
17+
- Use **"--gpus all"** flag to activate gpu support in docker. E.g.
18+
`docker run -it --gpus all staphb/dorado:1.4.0-cuda13.1.1-no_model bash`
19+
- Support for Fast5 files and basecalling models for DNA R10.4.1 4kHz data, DNA R9.4.1, and RNA002 were removed after v0.9.6
20+
21+
Full documentation: https://dorado-docs.readthedocs.io/en/latest/
22+
23+
## Example Usage
24+
```bash
25+
# list models
26+
dorado download --list
27+
# download a single model
28+
dorado download --model {model_name}
29+
# download all models
30+
dorado download
31+
# basecaller
32+
dorado basecaller {model} {data} > calls.bam
33+
```
Lines changed: 60 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,60 @@
1+
# Use NVIDIA CUDA image as the base image
2+
FROM nvidia/cuda:12.2.0-base-ubuntu22.04 AS app
3+
4+
ARG DORADO_VER="1.4.0"
5+
6+
# Metadata
7+
LABEL base.image="nvidia/cuda:12.2.0-base-ubuntu22.04"
8+
LABEL dockerfile.version="1"
9+
LABEL software="dorado ${DORADO_VER}"
10+
LABEL software.version="${DORADO_VER}"
11+
LABEL description="A tool for basecalling Fast5/Pod5 files from Oxford Nanopore sequencing"
12+
LABEL website="https://github.com/nanoporetech/dorado"
13+
LABEL license="https://github.com/nanoporetech/dorado/blob/master/LICENSE"
14+
LABEL original.website="https://nanoporetech.github.io/dorado/"
15+
LABEL maintainer="Fraser Combe"
16+
LABEL maintainer.email="fraser.combe@theiagen.com"
17+
18+
# Install dependencies
19+
RUN apt-get update && apt-get install -y --no-install-recommends \
20+
wget \
21+
curl \
22+
ca-certificates \
23+
pigz && \
24+
rm -rf /var/lib/apt/lists/* && apt-get autoclean
25+
26+
# Download and extract Dorado package
27+
RUN wget -q https://cdn.oxfordnanoportal.com/software/analysis/dorado-${DORADO_VER}-linux-x64.tar.gz &&\
28+
tar --strip-components=1 -C /usr/local/ -xvf dorado-${DORADO_VER}-linux-x64.tar.gz &&\
29+
rm dorado-${DORADO_VER}-linux-x64.tar.gz
30+
31+
# Set environment variables
32+
ENV LC_ALL=C
33+
34+
# Download basecalling models
35+
RUN mkdir /dorado_models && \
36+
cd /dorado_models && \
37+
dorado download --model all
38+
39+
# Default command
40+
CMD ["dorado", "-h"]
41+
42+
WORKDIR /data
43+
44+
# -----------------------------
45+
# Test Stage
46+
# -----------------------------
47+
FROM app AS test
48+
49+
# Download the specific Pod5 test file
50+
RUN wget https://github.com/nanoporetech/dorado/raw/release-v1.1/tests/data/pod5/dna_r10.4.1_e8.2_400bps_5khz/dna_r10.4.1_e8.2_400bps_5khz-FLO_PRO114M-SQK_LSK114_XL-5000.pod5
51+
52+
# Run test command (using CPU mode)
53+
RUN dorado basecaller \
54+
--device cpu \
55+
/dorado_models/dna_r10.4.1_e8.2_400bps_sup@v5.2.0 \
56+
dna_r10.4.1_e8.2_400bps_5khz-FLO_PRO114M-SQK_LSK114_XL-5000.pod5 \
57+
--emit-moves --max-reads 10 > basecalled.sam
58+
59+
# Verify the output file exists and is not empty
60+
RUN test -s basecalled.sam

0 commit comments

Comments
 (0)