Dorado

High-performance, easy-to-use, open source basecaller for Oxford Nanopore reads. Dorado Homepage

Available Modules¶

module load Dorado/0.9.1

Description¶

Dorado is a high-performance, easy-to-use, open source basecaller for Oxford Nanopore reads.

Features¶

One executable with sensible defaults, automatic hardware detection and configuration.
Nvidia GPUs including multi-GPU with linear scaling.
Modified basecalling (Remora models).
Duplex basecalling.
POD5 support for highest basecalling performance.
Based on libtorch, the C++ API for pytorch.
Multiple custom optimisations in CUDA and Metal for maximising inference performance.

License and Disclaimer¶

Dorado is distributed under the terms of the Oxford Nanopore Technologies, Ltd. Public License, v. 1.0. If a copy of the License was not distributed with this file, You can obtain one at http://nanoporetech.com

Example Slurm script¶

The following Slurm script is a template to run Basecalling on the NVIDIA A100 GPUs. We do not recommend running Dorado jobs on CPUs.
--device 'cuda:all' will automatically pick up the GPU over CPU
We are not providing the models as part of the module yet.

#!/bin/bash -e

#SBATCH --account        nesi12345
#SBATCH --job-name       dorado
#SBATCH --gpus-per-node  A100:1
#SBATCH --mem            6G
#SBATCH --cpus-per-task  4
#SBATCH --time           00:10:00
#SBATCH --output         slurmout.%j.out

module purge
module load Dorado/0.4.3

dorado download --model dna_r10.4.1_e8.2_400bps_hac@v4.1.0

dorado basecaller  --device 'cuda:all' dna_r10.4.1_e8.2_400bps_hac@v4.1.0 pod5s/ > calls.bam