Supported Applications

For more information on environment-modules see Finding Software.

On Mahuika, and Ancillary nodes (both Mahuika and Māui) software packages are provided using 'Lmod' an implementation of Environment Modules with additional features.
A list of available software can be obtained with the module spider command.

On Māui (XC50), software packages are provided using traditional Environment Modules. No modules are loaded by default.
A list of available software can be obtained with the module avail command.

Finite Element Analysis software for modeling, visualization and best-in-class implicit and explicit dynamics FEA.

Available Modules

module load ABAQUS/2022

module load ABAQUS/2022
Licence

ABAQUS is proprietary software.

Access to a valid licence token is required to run ABAQUS.

Institution Faculty Uptime
University of Auckland Faculty of Engineering 98%
University of Waikato 99%

Mass screening of contigs for antimicrobial and virulence genes

Available Modules

module load ABRicate/1.0.0-GCC-11.3.0-Perl-5.34.1

Assembly By Short Sequences - a de novo, parallel, paired-end sequence assembler

Available Modules

module load ABySS/2.2.5-gimpi-2020a

ACTC converts independent triangles into triangle strips or fans.

Available Modules

module load ACTC/1.1-GCCcore-9.2.0

Suite of tools to handle gene annotations in any GTF/GFF format.

Available Modules

module load AGAT/1.0.0-gimkl-2022a-Perl-5.34.1-R-4.2.1

Alignment of sequences with structural variants.

Available Modules

module load AGE/2017-GCC-7.4.0-serial
Links

Collection of tools for genome assembly

Available Modules

module load AMOS/3.1.0-GCC-11.3.0

NCBI Antimicrobial Resistance Gene Finder Plus

Available Modules

module load AMRFinderPlus/3.10.45-GCC-11.3.0

Calculate the bidirectional average nucleotide identity (gANI) and Alignment Fraction (AF) between two genomes.

Available Modules

module load ANIcalculator/1.0-GCC-11.3.0

Efficient software tool to utilize update-to-date information to functionally annotate genetic variants detected from diverse genomes .

Available Modules

module load ANNOVAR/2020Oct01-GCC-9.2.0-Perl-5.30.1

Available Modules

module load ANSYS/19.2
Licence

ANSYS is proprietary software.

Access to a valid licence token is required to run ANSYS.

Institution Faculty Uptime
University of Auckland Faculty of Engineering 72%
Orbis Diagnostics 100%
Auckland University of Technology Faculty of Engineering 72%
University of Waikato 72%
University of Canterbury 99%
University of Otago 99%

ANTLR, ANother Tool for Language Recognition, (formerly PCCTS) is a language tool that provides a framework for constructing recognizers, compilers, and translators from grammatical descriptions containing Java, C#, C++, or Python actions.

Available Modules

module load ANTLR/2.7.7-GCC-11.3.0

module load ANTLR/2.7.7-GCC-7.1.0

Ants is a versatile, easy to use Python library for developing ancillary applications. This package is restricted to the UM user group.

Available Modules

module load ANTS/0.9.0

ANTs extracts information from complex datasets that include imaging. ANTs is useful for managing, interpreting and visualizing multidimensional data.

Available Modules

module load ANTs/2.3.5-gimkl-2020a

AMD Optimized C/C++ & Fortran compilers (AOCC) based on LLVM 13.0

Available Modules

module load AOCC/4.0.0-GCC-11.3.0

Optimized version of BLIS for AMD EPYC family of processors..

Available Modules

module load AOCL-BLIS/4.0-GCC-11.3.0

Optimized version of FFTW for AMD EPYC family of processors.

Available Modules

module load AOCL-FFTW/4.0-gompi-2022a

Optimized version of ScaLAPACK for AMD EPYC family of processors.

Available Modules

module load AOCL-ScaLAPACK/4.0-gompi-2022a

Apache Portable Runtime (APR) libraries.

Available Modules

module load APR/1.6.5
Links

Apache Portable Runtime (APR) util libraries.

Available Modules

module load APR-util/1.6.1

Antimicrobial Resistance Identification By Assembly

Available Modules

module load ARIBA/2.14.6-gimkl-2022a-Python-3.10.5

a pArallel Server for Adaptive GeoInformation

Available Modules

module load ASAGI/20240327-foss-2023a

ATK provides the set of accessibility interfaces that are implemented by other toolkits and applications.

Available Modules

module load ATK/2.36.0-GCCcore-9.2.0
Links

AUGUSTUS is a program that predicts genes in eukaryotic genomic sequences

Collection of C++ library code designed to augment the C++ standard library.

Available Modules

module load Abseil/20230125.3-GCC-11.3.0

Ssearches for and removes remnant adapter sequences from High-Throughput Sequencing data.

Available Modules

module load AdapterRemoval/2.3.1-GCCcore-7.4.0

AdaptiveCpp (formerly hipSYCL) is a SYCL implementation targeting CPUs and GPUs, with a focus on leveraging existing toolchains such as CUDA or HIP

Available Modules

module load AdaptiveCpp/23.10.0-GCC-12.3.0-CUDA-12.2.2

Vectorization Optimization and Thread Prototyping - Vectorize & thread code or performance “dies” - Easy workflow + data + tips = faster code faster - Prioritize, Prototype & Predict performance gain

Available Modules

module load Advisor/2023.1.0

AlphaFold can predict protein structures with atomic accuracy even where no similar structure is known

Available Modules

module load AlphaFold/2.3.2

AlphaFold2 databases

Available Modules

module load AlphaFold2DB/2023-04
Links

Overrides the MKL internal utility function mkl_serv_intel_cpu_true so that AVX2 optimised kernels will be used, even when running on an AMD CPU.

Available Modules

module load AlwaysIntelMKL/1.0

Built to complement the rich, open source Python community, the Anaconda platform provides an enterprise-ready data analytics platform that empowers companies to adopt a modern open data science analytics architecture. IMPORTANT: This version of Anaconda Python comes with Intel MKL support to speed up certain types of mathematical computations, such as linear algebra or FFT. The module sets MKL_NUM_THREADS=1 to run MKL on a single thread by default, avoiding accidental oversubscription of cores. The number of threads can be increased for large problems, please refer to the Intel MKL documentation for guidance.

Available Modules

module load Anaconda2/2019.10-GCC-7.1.0

Built to complement the rich, open source Python community, the Anaconda platform provides an enterprise-ready data analytics platform that empowers companies to adopt a modern open data science analytics architecture.

Available Modules

module load Anaconda3/2021.05-gimkl-2020a

module load Anaconda3/2020.02-GCC-7.1.0

Apptainer is a portable application stack packaging and runtime utility.

Available Modules

module load Apptainer/1.3.1

C++ linear algebra library (matrix maths) aiming towards a good balance between speed and ease of use.

Available Modules

module load Armadillo/11.1.1-gimkl-2022a

Apache Arrow, a cross-language development platform for in-memory data.

Available Modules

module load Arrow/14.0.1-GCC-12.3.0

module load Arrow/8.0.0-GCCcore-9.2.0

IBM Aspera Command-Line Interface (the Aspera CLI) is a collection of Aspera tools for performing high-speed, secure data transfers from the command line. The Aspera CLI is for users and organizations who want to automate their transfer workflows.

Available Modules

module load Aspera-CLI/3.9.6.1467.159c5b1

OpenCL and Cuda accelerated version of AutoDock. It leverages its embarrasingly parallelizable LGA by processing ligand-receptor poses in parallel over multiple compute units.

Available Modules

module load AutoDock-GPU/1.5.3-GCC-11.3.0-CUDA-11.6.2

AutoDock Vina is an open-source program for doing molecular docking.

Available Modules

module load AutoDock_Vina/1.1.2_linux_x86

A collection of more than 500 macros for GNU Autoconf

Available Modules

module load Autoconf-archive/2023.02.20

BBMap short read aligner, and other bioinformatic tools.

Manipulate variant calls in the Variant Call Format (VCF) and its binary counterpart BCF.

Converts per cycle binary data output by Illumina sequencers containing basecall files and quality scores to per read FASTQ files

Available Modules

module load BCL-Convert/4.2.4

Bayesian MCMC phylogenetic analysis of molecular sequences for reconstructing phylogenies and testing evolutionary hypotheses.

Available Modules

BEDOPS is an open-source command-line toolkit that performs highly efficient and scalable Boolean and other set operations, statistical calculations, archiving, conversion and other management of genomic data of arbitrary scale.

Available Modules

module load BEDOPS/2.4.41-gimkl-2022a

The BEDTools utilities allow one to address common genomics tasks such as finding feature overlaps and computing coverage. The utilities are largely based on four widely-used file formats: BED, GFF/GTF, VCF, and SAM/BAM.

Available Modules

BEEF is a library implementing the Bayesian Error Estimation Functional, a description of which can be found here: http://dx.doi.org/10.1103/PhysRevB.85.235149

Available Modules

Collection of code for Bayesian genomic cline analyses.

Available Modules

module load BGC-Bayesian-genomic-clines/1.03-gompi-2023a

Basic Local Alignment Search Tool, or BLAST, is an algorithm for comparing primary biological sequence information, such as the amino-acid sequences of different proteins or the nucleotides of DNA sequences.

BLAST databases downloaded from NCBI.

Available Modules

module load BLASTDB/2024-04
Links

BLAT on DNA is designed to quickly find sequences of 95% and greater similarity of length 25 bases or more.

Available Modules

module load BLAT/3.5-gimkl-2018b

BLIS is a portable software framework for instantiating high-performance BLAS-like dense linear algebra libraries.

Available Modules

module load BLIS/4.0-GCC-11.3.0-amd

The BOLT-LMM algorithm for mixed model association testing, and the BOLT-REML algorithm for variance components analysis

Available Modules

module load BOLT-LMM/2.4

Pipeline for fully automated prediction of protein coding genes with GeneMark-ES/ET and AUGUSTUS in novel eukaryotic genomes.

Assessing genome assembly and annotation completeness with Benchmarking Universal Single-Copy Orthologs

Burrows-Wheeler Aligner (BWA) is an efficient program that aligns relatively short nucleotide sequences against a long reference sequence such as the human genome.

Available Modules
Links

BamTools provides both a programmer's API and an end-user's toolkit for handling BAM files.

Available Modules

module load BamTools/2.5.2-GCC-11.3.0

Bandage is a program for visualising de novo assembly graphs

Available Modules

module load Bandage/0.8.1_Centos

Basilisk is a Free Software program for the solution of partial differential equations on adaptive Cartesian meshes.

Available Modules

module load Basilisk/20180226-gimpi-2018b

Genome-Wide Scan for Adaptive Differentiation and Association Analysis with population-specific covariables

Available Modules

module load BayPass/2.31-intel-2022a

Identify candidate loci under natural selection from genetic data, using differences in allele frequencies between populations.

Available Modules

module load BayeScan/2.1-GCCcore-7.4.0

Program for inference of recent immigration rates between populations using unlinked multilocus genotypes

Available Modules

module load BayesAss/3.0.4-GCC-9.2.0

Bazel is a build tool that builds code quickly and reliably. It is used to build the majority of Google's software.

Available Modules

Package for phasing genotypes and for imputing ungenotyped markers.

Available Modules

module load Beagle/5.4-22Jul22.46e

Constructs sequence similarity networks of Biosynthetic Gene Clusters (BGCs) and groups them into Gene Cluster Families (GCFs).

Available Modules

module load BiG-SCAPE/1.1.5-gimkl-2022a-Python-3.11.3

Highly parallel construction, indexing and querying of colored and compacted de Bruijn graphs.

Available Modules

module load Bifrost/1.0.4-GCC-9.2.0

Read BigWig and BigBed genome feature databases

Available Modules

module load Bio-DB-BigFile/1.07-GCC-11.3.0-Perl-5.34.1

Read files using HTSlib including BAM/CRAM, Tabix and BCF database files

Available Modules

module load Bio-DB-HTS/3.01-GCC-11.3.0-Perl-5.34.1

Bio++ is a set of C++ libraries for Bioinformatics, including sequence analysis, phylogenetics, molecular evolution and population genetics. Bio++ is Object Oriented and is designed to be both easy to use and computer efficient. Bio++ intends to help programmers to write computer expensive programs, by providing them a set of re-usable tools.

Available Modules

module load BioPP/2.4.1-GCC-9.2.0

A tool to map bisulfite converted sequence reads and determine cytosine methylation states

Available Modules

module load Bismark/0.24.0-GCC-11.3.0

Bison is a general-purpose parser generator that converts an annotated context-free grammar into a deterministic LR or generalized LR (GLR) parser employing LALR(1) parser tables.

Available Modules

module load Bison/3.3.2-GCCcore-7.4.0

module load Bison/3.0.4-GCC-7.1.0

Blender provides a pipeline for 3D modeling, rigging, animation, simulation, rendering, compositing, motion tracking, video editing and 2D animation. This particular build of Blender provides a Python package 'bpy' rather than the stand-alone application.

Available Modules

module load BlenderPy/2.93.1-gimkl-2020a-Python-3.9.5

Ultrafast, memory-efficient short read aligner.

Available Modules

Ultrafast and memory-efficient tool for aligning sequencing reads to long reference sequences.

Available Modules

module load Bowtie2/2.4.5-GCC-11.3.0

A platform for running big bioinformatics jobs that consist of a series of processing stages

Available Modules

module load Bpipe/0.9.9.8

Hghly accurate statistical method that computes the abundance of species in DNA sequences from a metagenomics sample.

Available Modules

module load Bracken/2.7-GCC-11.3.0

Nucleotide-resolution analysis of structural variants

Available Modules

module load BreakSeq2/2.2-gimkl-2018b-Python-2.7.16

Clozure CL (often called CCL for short) is a free Common Lisp implementation

Available Modules

module load CCL/1.12-GCCcore-9.2.0
Links

CD-HIT is a very widely used program for clustering and comparing protein or nucleotide sequences.

Available Modules

module load CD-HIT/4.8.1-gimkl-2017a

CDO is a collection of command line Operators to manipulate and analyse Climate and NWP model Data.

Available Modules

module load CDO/2.1.1-gimpi-2022a

module load CDO/2.0.5-GCC-7.1.0
Links

CFITSIO is a library of C and Fortran subroutines for reading and writing data files in FITS (Flexible Image Transport System) data format.

Available Modules

module load CFITSIO/3.45-GCCcore-7.4.0

The goal of the CGAL Open Source Project is to provide easy access to efficient and reliable geometric algorithms in the form of a C++ library.

CMake, the cross-platform, open-source build system. CMake is a family of tools designed to build, test and package software.

Available Modules

module load CMake/3.27.7

module load CMake/3.22.1

Copy Number Variation discovery and genotyping from depth of read mapping.

Available Modules

module load CNVnator/0.4-GCC-7.4.0

Python package and command line tool for CNV/CNA analysis from depth-of-coverage by mapped read

Available Modules

module load CNVpytor/1.3.1-gimkl-2022a-Python-3.11.3

COMSOL is a multiphysics solver that provides a unified workflow for electrical, mechanical, fluid, and chemical applications.

Available Modules

module load COMSOL/6.1
Licence

COMSOL is proprietary software.

Access to a valid licence token is required to run COMSOL.

Institution Faculty Uptime
University of Auckland Physics 98%
Auckland Bioengineering Institute Implantable devices group 98%
University of Auckland Faculty of Engineering 98%
University of Auckland Deptartment of Engineering Science 98%
University of Auckland Deptartment of Engineering Science 98%
University of Otago 99%
University of Canterbury 100%

Program for unsupervised binning of metagenomic contigs by using nucleotide composition, coverage data in multiple samples and linkage data from paired end reads.

Available Modules

module load CONCOCT/1.1.0-gimkl-2020a-Python-3.8.2

CP2K is a freely available (GPL) program, written in Fortran 95, to perform atomistic and molecular simulations of solid state, liquid, molecular and biological systems. It provides a general framework for different methods such as e.g. density functional theory (DFT) using a mixed Gaussian and plane waves approach (GPW), and classical pair and many-body potentials.

Available Modules

The CPMD code is a parallelized plane wave / pseudopotential implementation of DFT, particularly designed for ab-initio molecular dynamics.

Available Modules

module load CPMD/4.3-foss-2023a

A tool for quick quality assessment of cram and bam files, intended for long read sequencing

Available Modules

module load CRAMINO/0.9.7-GCC-11.3.0

C++ Thread Pool Library

Available Modules

module load CTPL/0.0.2

CUDA (formerly Compute Unified Device Architecture) is a parallel computing platform and programming model created by NVIDIA and implemented by the graphics processing units (GPUs) that they produce. CUDA gives developers access to the virtual instruction set and memory of the parallel computational elements in CUDA GPUs.

Automated testing framework for C.

Available Modules

module load CUnit/2.1-3-GCCcore-9.2.0

Sequence assembler designed for high-noise single-molecule sequencing.

Available Modules

module load Canu/2.2-GCC-11.3.0

Fast data interchange format and capability-based RPC system.

Available Modules

A modern, C++-native, header-only, test framework for unit-tests, TDD and BDD - using C++11, C++14, C++17 and later (or C++03 on the Catch1.x branch)

Available Modules

module load Catch2/2.13.4

Cell Ranger is a set of analysis pipelines that process Chromium single-cell RNA-seq output to align reads, generate gene-cell matrices and perform clustering and gene expression analysis.

Available Modules

module load CellRanger/7.1.0

Classifier for metagenomic sequences

Available Modules

C++11 serialization library

Available Modules

module load Cereal/1.3.0

An asynchronous message-driven C++ communication library targeted towards tightly coupled, high-performance parallel machines.

Available Modules

module load Charm++/6.8.0-CrayIntel-23.02-19

CheckM provides a set of tools for assessing the quality of genomes recovered from isolates, single cells, or metagenomes.

Available Modules

module load CheckM/1.2.1-gimkl-2022a-Python-3.10.5

Rapid assessment of genome bin quality using machine learning

Available Modules

module load CheckM2/1.0.1-Miniconda3

Assess the quality of metagenome-assembled viral genomes.

Available Modules

A tool to circularize genome assemblies

Available Modules

module load Circlator/1.5.5-gimkl-2022a-Python-3.10.5

Package for visualizing data in a circular layout - this makes Circos ideal for exploring relationships between objects or positions.

Available Modules

module load Circos/0.69-9-GCC-11.3.0

Syumphonizing pileup and full-alignment for high-performance long-read variant calling.

Available Modules

module load Clair3/1.0.5-Miniconda3

C, C++, Objective-C compiler, based on LLVM. Does not include C++ standard library -- use libstdc++ from GCC.

Available Modules

module load Clang/17.0.6-GCC-12.3.0-CUDA-12.2.2

Clustal Omega is a multiple sequence alignment program for proteins. It produces biologically meaningful multiple sequence alignments of divergent sequences. Evolutionary relationships can be seen via viewing Cladograms or Phylograms

Available Modules

module load Clustal-Omega/1.2.4-gimkl-2020a

ClustalW2 is a general purpose multiple sequence alignment program for DNA or proteins.

Available Modules

module load ClustalW2/2.1-gimkl-2017a

Clusters contigs and counts reads from de novo assembled transcriptomes.

Available Modules

module load Corset/1.09-GCC-9.2.0

DNA read coverage and relative abundance calculator focused on metagenomics applications

Available Modules

module load CoverM/0.7.0-GCC-12.3.0

C++ port of the JUnit framework for unit testing.

Available Modules

module load CppUnit/1.15.1-GCCcore-11.3.0

Toolchain using Cray compiler wrapper, using PrgEnv-cray (PE release: February 2023).

Available Modules

module load CrayCCE/23.02

Toolchain using Cray compiler wrapper, using PrgEnv-gnu module (PE release: 23.02).

Available Modules

module load CrayGNU/23.02

Toolchain using Cray compiler wrapper, using PrgEnv-intel (PE release: February 2023 with Intel 19 compiler).

Available Modules

module load CrayIntel/23.02-19

Graphical report explorer report explorer for Scalasca and Score-P

Available Modules

module load CubeGUI/4.4.4-GCCcore-9.2.0

Cube general purpose C++ library component and command-line tools.

Available Modules

module load CubeLib/4.6-GCCcore-9.2.0

Cube high-performance C writer library component.

Available Modules

module load CubeWriter/4.5-GCCcore-9.2.0

Transcript assembly, differential expression, and differential regulation for RNA-Seq

Available Modules

module load Cufflinks/2.2.1-gimkl-2017a

Cytoscape is an open source software platform for visualizing molecular interaction networks and biological pathways and integrating these networks with annotations, gene expression profiles and other state data.

Available Modules

module load Cytoscape/3.9.1

D-Genies also allows to display dot plots from other aligners by uploading their PAF or MAF alignment file.

Available Modules

module load D-Genies/1.5.0-gimkl-2022a-Python-3.11.3

DAS Tool is an automated method that integrates the results of a flexible number of binning algorithms to calculate an optimized, non-redundant set of bins from a single assembly.

Available Modules

module load DAS_Tool/1.1.5-gimkl-2022a-R-4.2.1

Berkeley DB enables the development of custom data management solutions, without the overhead traditionally associated with such custom projects.

Available Modules

module load DB/6.2.23-gimkl-2017a
Links

D-Bus is a message bus system, a simple way for applications to talk to one another. In addition to interprocess communication, D-Bus helps coordinate process lifecycle; it makes it simple and reliable to code a "single instance" application or daemon, and to launch applications and daemons on demand when their services are needed.

Available Modules

module load DBus/1.14.0-GCC-11.3.0

Generally Applicable Atomic-Charge Dependent London Dispersion Correction.

Available Modules

module load DFT-D4/3.5.0-intel-2022a

Sequence aligner for protein and translated DNA searches

Assembler suitable for large genomes based on Illumina reads of length 250 or longer.

Available Modules

module load DISCOVARdenovo/52488

Tool for annotating metagenomic assembled genomes and VirSorter identified viral contigs..

Available Modules

module load DRAM/1.3.5-Miniconda3

Tool set for simulating/evaluating SVs, merging and comparing SVs within and among samples, and includes various methods to reformat or summarize SVs.

Available Modules

module load DaliLite/5.0-iimpi-2020a

A tool that can be used to automatically detect and efficiently remove sequence contaminations from genomic and metagenomic datasets.

Available Modules

module load DeconSeq/0.4.3

Available Modules

module load DeePMD-kit/2.1.1

Efficient method for 3D markerless pose estimation based on transfer learning with deep neural networks.

Available Modules

module load DeepLabCut/2.3.5

Defines the set of modules loaded by default

Available Modules

module load DefaultModules/
Links

Integrated simulation of sediment transport and morphology, waves, water quality and ecology.

Available Modules

module load Delft3D/141732-intel-2022a

3D modeling suite to investigate hydrodynamics, sediment transport and morphology and water quality for fluvial, estuarine and coastal environments

Available Modules

module load Delft3D_FM/68819-intel-2020a

Structural variant discovery by integrated paired-end and split-read analysis

Available Modules

module load Delly/1.1.3

High-performance, easy-to-use, open source basecaller for Oxford Nanopore reads.

Available Modules

Doxygen is a documentation system for C++, C, Java, Objective-C, Python, IDL (Corba and Microsoft flavors), Fortran, VHDL, PHP, C#, and to some extent D.

Available Modules

module load Doxygen/1.8.17-GCCcore-9.2.0

Fast calculation of the ABBA-BABA statistics across many populations/species

Available Modules

module load Dsuite/20210718-intel-compilers-2022.0.2

Automated whole-genome de-novo TE annotation and benchmarking the annotation performance of TE libraries.

Available Modules

module load EDTA/2.1.0

The EIGENSOFT package combines functionality from our population genetics methods (Patterson et al. 2006) and our EIGENSTRAT stratification correction method (Price et al. 2006). The EIGENSTRAT method uses principal components analysis to explicitly model ancestry differences between cases and controls along continuous axes of variation; the resulting correction is specific to a candidate marker’s variation in frequency across ancestral populations, minimizing spurious associations while maximizing power to detect true associations. The EIGENSOFT package has a built-in plotting script and supports multiple file formats and quantitative phenotypes.

Available Modules

module load EIGENSOFT/7.2.1-gimkl-2018b

Eigenvalue SoLvers for Petaflop-Applications .

Available Modules

Greyscale scientific image processing suite with a primary focus on processing data from transmission electron microscopes

Available Modules

module load EMAN2/2.99.47-gpu

EMBOSS is 'The European Molecular Biology Open Software Suite'. EMBOSS is a free Open Source software analysis package specially developed for the needs of the molecular biology (e.g. EMBnet) user community.

Available Modules

module load EMBOSS/6.6.0-gimkl-2020a

R package for integrated construction of Ecological Niche Models.

Available Modules

module load ENMTML/2021-06-24-gimkl-2020a-R-4.1.0

The Earth System Modeling Framework (ESMF) is software for building and coupling weather, climate, and related models.

Available Modules

module load ESMF/7.1.0r-intel-2018b

module load ESMF/7.1.0r-intel-2018b

A Python framework for the analysis and visualization of phylogenetic trees

EasyBuild is a software build and installation framework written in Python that allows you to install software in a structured, repeatable and robust way.

Available Modules

module load EasyBuild/4.9.0

module load EasyBuild/4.9.0

Eigen is a C++ template library for linear algebra: matrices, vectors, numerical solvers, and related algorithms.

Available Modules

module load Eigen/3.4.0

Embree is a collection of high-performance ray tracing kernels, developed at Intel. The target users of Embree are graphics application engineers who want to improve the performance of their photo-realistic rendering application by leveraging Embree's performance-optimized ray tracing kernels.

Available Modules

module load Embree/3.12.2-GCCcore-9.2.0

module load Embree/3.2.0-GCC-7.1.0

Completeness and contamination estimator for metagenomic assembled microbial eukaryotic genomes. Also condatains smetana, carveme and memote .

Available Modules

module load EukRep-EukCC/0.6.7-2.1.0-Miniconda3

Bayesian tree inference, particularly suitable for large-scale analyses.

Available Modules

module load ExaBayes/1.5.1-gimpi-2020a

Exascale Maximum Likelihood for phylogenetic inference using MPI.

Available Modules

module load ExaML/3.0.22-gimkl-2020a

Tool for estimating repeat sizes

Available Modules

module load ExpansionHunter/5.0.0

Extrae is capable of instrumenting applications based on MPI, OpenMP, pthreads, CUDA1, OpenCL1, and StarSs1 using different instrumentation approaches

Available Modules

module load Extrae/3.8.3-intel-2020a

Falcon: a set of tools for fast aligning long reads for consensus and assembly

Available Modules

module load FALCON/1.8.8-gimkl-2020a-Python-2.7.18

Tools for Short-Reads FASTA/FASTQ files preprocessing.

Available Modules

module load FASTX-Toolkit/0.0.14-gimkl-2017a

FCM Build - A powerful build system for modern Fortran software applications. FCM Version Control - Wrappers to the Subversion version control system, usage conventions and processes for scientific software development.

Available Modules

module load FCM/2019.09.0

module load FCM/2019.09.0-GCC-9.2.0
Links

Fire Dynamics Simulator (FDS) is a large-eddy simulation (LES) code for low-speed flows, with an emphasis on smoke and heat transport from fires.

Available Modules

module load FDS/6.7.1-CrayIntel-23.02-19

FFTW is a C subroutine library for computing the discrete Fourier transform (DFT) in one or more dimensions, of arbitrary input size, and of both real and complex data.

Available Modules

module load FFTW/3.3.9-CrayIntel-23.02-19

FFTW is a C subroutine library for computing the discrete Fourier transform (DFT) in one or more dimensions, of arbitrary input size, and of both real and complex data.

Available Modules

module load FFTW.MPI/3.3.10-gompi-2023a

A complete, cross-platform solution to record, convert and stream audio and video.

Available Modules

module load FFmpeg/3.4.2-GCC-7.1.0

An efficient and objective tool for optimizing microbiome rRNA gene trimming parameters.

Available Modules

module load FIGARO/1.1.2-gimkl-2022a-Python-3.11.3

FLTK is a cross-platform C++ GUI toolkit for UNIX/Linux (X11), Microsoft Windows, and MacOS X. FLTK provides modern GUI functionality without the bloat and supports 3D graphics via OpenGL and its built-in GLUT emulation.

Available Modules

module load FLTK/1.3.5-GCCcore-9.2.0

FTGL is a free cross-platform Open Source C++ library that uses Freetype2 to simplify rendering fonts in OpenGL applications. FTGL supports bitmaps, pixmaps, texture maps, outlines, polygon mesh, and extruded polygon rendering modes.

Available Modules

module load FTGL/2.1.3-rc5-GCCcore-7.4.0

module load FTGL/2.1.3-rc5-GCC-7.1.0

Tool for fast alignment-free computation of whole-genome Average Nucleotide Identity (ANI).

Available Modules

FastME: a comprehensive, accurate and fast distance-based phylogeny inference program.

Available Modules

module load FastME/2.1.6.2-GCC-9.2.0

A set of tools (in Java) for working with next generation sequencing data in the BAM format.

Available Modules

module load FastQC/0.12.1

FastQ Screen allows you to screen a library of sequences in FastQ format against a set of sequence databases so you can see if the composition of the library matches with what you expect.

Available Modules

module load FastQ_Screen/0.15.1-GCC-9.2.0-Perl-5.30.1

FastTree infers approximately-maximum-likelihood phylogenetic trees from alignments of nucleotide or protein sequences. FastTree can handle alignments with up to a million of sequences in a reasonable amount of time and memory.

Available Modules

A Perl version of the rename utility, with support for regular expressions.

Available Modules

module load File-Rename/1.13-GCC-9.2.0

Tool for filtering long reads by quality.

Available Modules

module load Filtlong/0.2.0

Identifies the FimH type in total or partial sequenced isolates of E. coli..

Available Modules

module load FimTyper/1.0.1-GCC-11.3.0-Perl-5.34.1

FlexiBLAS is a wrapper library that enables the exchange of the BLAS and LAPACK implementation used by a program without recompiling or relinking it.

Available Modules

module load FlexiBLAS/3.3.1-GCC-12.3.0

Flye is a de novo assembler for long and noisy reads, such as those produced by PacBio and Oxford Nanopore Technologies.

FragGeneScan is an application for finding (fragmented) genes in short reads.

Available Modules

module load FragGeneScan/1.31-gimkl-2018b

Genetic variant detector designed to find polymorphisms smaller than the length of a short-read sequencing alignment.

Available Modules

module load FreeBayes/1.3.2-GCC-9.2.0

FreeSurfer is a set of tools for analysis and visualization of structural and functional brain imaging data.

Available Modules

module load FreeSurfer/7.2.0-centos7_x86_64

FreeXL is an open source library to extract valid data from within an Excel (.xls) spreadsheet.

Available Modules

Free Implementation of the Unicode Bidirectional Algorithm.

Available Modules

module load FriBidi/1.0.12-GCC-12.3.0

The Genome Analysis Toolkit or GATK is a software package developed at the Broad Institute to analyse next-generation resequencing data. The toolkit offers a wide variety of tools, with a primary focus on variant discovery and genotyping as well as strong emphasis on data quality assurance. Its robust architecture, powerful processing engine and high-performance computing features make it capable of taking on projects of any size.

The GNU Compiler Collection includes front ends for C, C++, Objective-C, Fortran, Java, and Ada, as well as libraries for these languages (libstdc++, libgcj,...).

Available Modules

module load GCC/12.3.0

module load GCC/9.2.0
Links

The GNU Compiler Collection includes front ends for C, C++, Objective-C, Fortran, Java, and Ada, as well as libraries for these languages (libstdc++, libgcj,...).

Available Modules

module load GCCcore/12.3.0

module load GCCcore/9.2.0

Interface to Gd Graphics Library

Available Modules

module load GD/2.76-GCC-11.3.0-Perl-5.34.1
Links

GDAL is a translator library for raster geospatial data formats that is released under an X/MIT style Open Source license by the Open Source Geospatial Foundation. As a library, it presents a single abstract data model to the calling application for all supported formats. It also comes with a variety of useful command-line utilities for data translation and processing. NOTE: The GDAL IO cache by default uses 5% of total memory. This seems not necessary. This module sets GDAL_CACHEMAX=256 (256MB), which should have no performance impact. Feel free to change if necessary, using 'export GDAL_CACHEMAX=xxx' (in your job script) after loading the GDAL module.

Available Modules

Genome-wide Efficient Mixed Model Association

Available Modules

module load GEMMA/0.98.4-GCC-9.2.0

GEOS (Geometry Engine - Open Source) is a C++ port of the Java Topology Suite (JTS)

Available Modules

OpenGL Mathematics (GLM) is a header only C++ mathematics library for graphics software based on the OpenGL Shading Language (GLSL) specifications.

Available Modules

module load GLM/0.9.9.8
Links

GNU Linear Programming Kit is intended for solving large-scale linear programming (LP), mixed integer programming (MIP), and other related problems.

Available Modules

module load GLPK/5.0-GCCcore-11.3.0

GLib is one of the base libraries of the GTK+ project

Available Modules

module load GLib/2.53.1-GCC-7.1.0

GMAP: A Genomic Mapping and Alignment Program for mRNA and EST Sequences GSNAP: Genomic Short-read Nucleotide Alignment Program

Available Modules

module load GMAP-GSNAP/2021-12-17-GCC-11.3.0

GMP is a free library for arbitrary precision arithmetic, operating on signed integers, rational numbers, and floating point numbers.

GMT is an open source collection of about 80 command-line tools for manipulating geographic and Cartesian data sets (including filtering, trend fitting, gridding, projecting, etc.) and producing PostScript illustrations ranging from simple x-y plots via contour maps to artificially illuminated surfaces and 3D perspective views; the GMT supplements add another 40 more specialized and discipline-specific tools.

Available Modules

module load GMT/6.4.0-GCC-7.1.0
Links

A genetic algorithm for docking flexible ligands into protein binding sites

Available Modules

module load GOLD/2022
Licence

GOLD is proprietary software.

GObject introspection is a middleware layer between C libraries (using GObject) and language bindings. The C library can be scanned at compile time and generate a metadata file, in addition to the actual native C library. Then at runtime, language bindings can read this metadata and automatically provide bindings to call into the C library.

Available Modules

module load GObject-Introspection/1.72.0-GCC-11.3.0

GPAW is a density-functional theory (DFT) Python code based on the projector-augmented wave (PAW) method and the atomic simulation environment (ASE). It uses real-space uniform grids and multigrid methods or atom-centered basis-functions.

Available Modules

module load GPAW/24.1.0-foss-2023a

The Grid Analysis and Display System (GrADS) is an interactive desktop tool that is used for easy access, manipulation, and visualization of earth science data.

Available Modules

module load GRADS/2.2.0-GCC-7.1.0

The Geographic Resources Analysis Support System - used for geospatial data management and analysis, image processing, graphics and maps production, spatial modeling, and visualization

Available Modules

module load GRASS/8.2.1-gimkl-2022a

GRIDSS is a module software suite containing tools useful for the detection of genomic rearrangements.

Available Modules

module load GRIDSS/2.13.2-gimkl-2020a

GROMACS is a versatile package to perform molecular dynamics, i.e. simulate the Newtonian equations of motion for systems with hundreds to millions of particles.

The GNU Scientific Library (GSL) is a numerical library for C and C++ programmers. The library provides a wide range of mathematical routines such as random number generators, special functions and least-squares fitting.

GStreamer plug-ins and elements.

Available Modules

module load GST-plugins-base/1.16.2-GCC-9.2.0

GStreamer is a library for constructing graphs of media-handling components. The applications it supports range from simple Ogg/Vorbis playback, audio/video streaming to complex audio (mixing) and video (non-linear editing) processing.

Available Modules

module load GStreamer/1.16.2-GCC-9.2.0

module load GStreamer/1.16.0-GCC-7.1.0

A toolkit for assigning objective taxonomic classifications to bacterial and archaeal genomes.

GTK+ is the primary library used to construct user interfaces in GNOME.

Available Modules

module load GTK+/3.24.23-GCC-9.2.0

GTS stands for the GNU Triangulated Surface Library. It is an Open Source Free Software Library intended to provide a set of useful functions to deal with 3D surfaces meshed with interconnected triangles.

Available Modules

module load GTS/121130-GCCcore-7.4.0

module load GTS/0.7.6-CrayGNU-23.02

module load GTS/121130-GCC-7.1.0
Links

Assembly-free construction of UTRs from short read RNA-Seq data on the basis of coding sequence annotation.

Available Modules

module load GUSHR/2020-09-28-gimkl-2022a

Available Modules

module load Gaussian/09-D.01
Licence

Gaussian is proprietary software.

The Gdk Pixbuf is a toolkit for image loading and pixel buffer manipulation. It is used by GTK+ 2 and GTK+ 3 to load and manipulate images. In the past it was distributed as part of GTK+ 2 but it was split off into a separate package in preparation for the change to GTK+ 3.

Available Modules

module load Gdk-Pixbuf/2.42.8-GCC-11.3.0

Eukaryotic gene prediction suite with automatic training

Available Modules

module load GeneMark-ES/4.71-GCC-11.3.0

Generates circular genome representations for complete, draft, and multiple bacterial and archaeal genomes.

Available Modules

module load GenoVi/0.2.16-Miniconda3

GenomeThreader is a software tool to compute gene structure predictions.

Available Modules

module load GenomeThreader/1.7.3-Linux_x86_64-64bit

Gerris is a Free Software program for the solution of the partial differential equations describing fluid flow. This module also includes GfsView, a visualisation application for Gerris output.

Available Modules

module load Gerris/20180227-gimkl-2018b

module load Gerris/20180227-gimpi-2018b

Toolkit to assemble organelle genome from genomic skimming data.

Available Modules

module load GetOrganelle/1.7.7.0-gimkl-2022a

Gene finder based on a Generalized Hidden Markov Model.

Available Modules

module load GlimmerHMM/3.0.4-gimkl-2018b

An open source programming language

Available Modules

module load Go/1.21.3

module load Go/1.19.1
Links

Graphviz is open source graph visualization software. Graph visualization is a way of representing structural information as diagrams of abstract graphs and networks. It has important applications in networking, bioinformatics, software engineering, database and web design, machine learning, and in visual interfaces for other technical domains.

Available Modules

module load Graphviz/2.42.2-GCC-9.2.0

Genealogies Unbiased By recomBinations In Nucleotide Sequences

Available Modules

module load Gubbins/3.2.2-gimkl-2022a-Python-3.10.5

Guile is the GNU Ubiquitous Intelligent Language for Extensions, the official extension language for the GNU operating system.

Available Modules

module load Guile/1.8.8-GCC-5.4.0

HDF (also known as HDF4) is a library and multi-object file format for storing and managing data between machines.

Available Modules

module load HDF/4.2.13-iccifort-2018.1.163-GCC-7.1.0-fortran
Links

HDF-EOS (Hierarchical Data Format - Earth Observing System) is a self-describing file format for transfer of various types of data between different machines based upon HDF. HDF-EOS is a standard format to store data collected from EOS satellites: Terra, Aqua and Aura.

Available Modules

module load HDF-EOS/2.19v1.00-GCC-7.1.0

HDF-EOS (Hierarchical Data Format - Earth Observing System) is a self-describing file format for transfer of various types of data between different machines based upon HDF. HDF-EOS is a standard format to store data collected from EOS satellites: Terra, Aqua and Aura.

Available Modules

module load HDF-EOS5/5.1.15-GCC-7.1.0

HDF5 is a unique technology suite that makes possible the management of extremely large and complex data collections.

HISAT2 is a fast and sensitive alignment program for mapping next-generation sequencing reads (both DNA and RNA) against the general human population (as well as against a single reference genome).

Available Modules

HMMER is used for searching sequence databases for homologs of protein sequences, and for making protein sequence alignments. It implements methods using probabilistic models called profile hidden Markov models (profile HMMs). Compared to BLAST, FASTA, and other sequence alignment and database search tools based on older scoring methodology, HMMER aims to be significantly more accurate and more able to detect remote homologs because of the strength of its underlying mathematical models. In the past, this strength came at significant computational expense, but in the new HMMER3 project, HMMER is now essentially as fast as BLAST.

HMMER is used for searching sequence databases for homologs of protein sequences, and for making protein sequence alignments. It implements methods using probabilistic models called profile hidden Markov models (profile HMMs). Compared to BLAST, FASTA, and other sequence alignment and database search tools based on older scoring methodology, HMMER aims to be significantly more accurate and more able to detect remote homologs because of the strength of its underlying mathematical models. In the past, this strength came at significant computational expense, but in the new HMMER3 project, HMMER is now essentially as fast as BLAST.

Available Modules

module load HMMER2/2.4i-GCC-11.3.0

Pipeline which focuses on screening MALT data for the presence of a user-specified list of target species.

Available Modules

module load HOPS/0.33

HTSeq is a Python library to facilitate processing and analysis of data from high-throughput sequencing (HTS) experiments.

Available Modules

module load HTSeq/0.13.5-gimkl-2020a-Python-3.9.5

A C library for reading/writing high-throughput sequencing data. This package includes the utilities bgzip and tabix

HarfBuzz is an OpenType text shaping engine.

Available Modules

module load HarfBuzz/4.4.1-GCC-12.3.0

HPC GridRunner is a simple command-line interface to high throughput computing using a variety of different grid computing platforms, including LSF, SGE, SLURM, and PBS.

Available Modules

module load HpcGridRunner/20210803

Pipeline for efficiently and accurately determining the coverage and abundance of microbial pathways in a community from metagenomic data.

Available Modules

module load Humann/3.0.0.alpha.3

Extracting Coding Sequence and Introns for Phylogenetics from High-Throughput Sequencing Reads Using Target Enrichment.

Available Modules

module load HybPiper/2.0.1rc-Miniconda3

Hypre is a library for solving large, sparse linear systems of equations on massively parallel computers. The problems of interest arise in the simulation codes being developed at LLNL and elsewhere to study physical phenomena in the defense, environmental, energy, and biological sciences.

Available Modules

module load Hypre/2.25.0-gimkl-2022a

ICU is a mature, widely used set of C/C++ and Java libraries providing Unicode and Globalization support for software applications.

Available Modules

module load ICU/71.1-GCC-11.3.0

module load ICU/62.1-GCC-7.1.0
Links

IDBA-UD is a iterative De Bruijn Graph De Novo Assembler for Short Reads Sequencing data with Highly Uneven Sequencing Depth.

Available Modules

module load IDBA-UD/1.1.3-gimkl-2018b

IDL is the trusted scientific programming language used across disciplines to extract meaningful visualizations from complex numerical data.

Available Modules

module load IDL/8.7
Links

The Integrative Genomics Viewer (IGV) is a high-performance visualization tool for interactive exploration of large, integrated genomic datasets. It supports a wide variety of data types, including array-based and next-generation sequence data

Available Modules

module load IGV/2.16.1
Links

Genotype imputation and haplotype phasing.

Available Modules

module load IMPUTE/2.3.2

Efficient phylogenomic software by maximum likelihood

Available Modules

Available Modules

module load IQmol/2.11
Links
Licence

IQmol is proprietary software.

R packages for providing R kernel for Jupyter.

Available Modules

module load IRkernel/1.0.1-gimkl-2018b

Intelligent Storage Acceleration Library

Available Modules

ImageMagick is a software suite to create, edit, compose, or convert bitmap images

Available Modules

module load ImageMagick/7.0.9-5-GCCcore-9.2.0

module load ImageMagick/7.0.8-11-GCC-7.1.0

Infernal ('INFERence of RNA ALignment') is for searching DNA sequence databases for RNA structure and sequence similarities.

Available Modules

Intel Inspector XE is an easy to use memory error checker and thread checker for serial and parallel applications

Available Modules

module load Inspector/2019_update5

Sequence analysis application (nucleotide and protein sequences) that combines different protein signature recognition methods into one resource.

Just Another Gibbs Sampler - a program for the statistical analysis of Bayesian hierarchical models by Markov Chain Monte Carlo.

Available Modules

A programmer-oriented testing framework for Java.

Available Modules

module load JUnit/4.12

The JasPer Project is an open-source initiative to provide a free software-based reference implementation of the codec specified in the JPEG-2000 Part-1 standard.

Available Modules

module load JasPer/2.0.14-GCC-7.1.0

Java Platform, Standard Edition (Java SE) lets you develop and deploy Java applications on desktops and servers.

Available Modules

module load Java/20.0.2

module load Java/1.8.0_144

Jellyfish is a tool for fast, memory-efficient counting of k-mers in DNA.

Available Modules

JsonCpp is a C++ library that allows manipulating JSON values, including serialization and deserialization to and from strings. It can also preserve existing comment in unserialization/serialization steps, making it a convenient format to store user input files.

Available Modules

module load JsonCpp/1.9.3-GCCcore-9.2.0

A high-level, high-performance dynamic language for technical computing. This version was compiled from source with USE_INTEL_JITEVENTS=1 to enable profiling with VTune.

The K-mer Analysis Toolkit (KAT) contains a number of tools that analyse and compare K-mer spectra.

Available Modules

module load KAT/2.4.2-gimkl-2018b-Python-3.7.3
Links

KEALib provides an implementation of the GDAL data model. The format supports raster attribute tables, image pyramids, meta-data and in-built statistics while also handling very large files and compression throughout. Based on the HDF5 standard, it also provides a base from which other formats can be derived and is a good choice for long term data archiving. An independent software library (libkea) provides complete access to the KEA image format and a GDAL driver allowing KEA images to be used from any GDAL supported software.

Available Modules

module load KEALib/1.4.15-gimpi-2020a

Disk-based programm for counting k-mers from (possibly gzipped) FASTQ/FASTA files.

Available Modules

module load KMC/3.1.2rc1-gimkl-2020a-Python-3.8.2
Links

Kaiju is a program for sensitive taxonomic classification of high-throughput sequencing reads from metagenomic whole genome sequencing experiments

Available Modules

module load Kaiju/1.8.2-gimkl-2020a-Python-3.9.9-Perl-5.30.1-mt

Collection of tools used by the UCSC genome browser.

Available Modules

module load Kent_tools/20200519-linux.x86_64

KmerGenie estimates the best k-mer length for genome de novo assembly.

Available Modules

module load KmerGenie/1.7051-gimkl-2022a

Semi-HMM-based Nucleic Acid Parser

Available Modules

module load KorfSNAP/2013-11-29-GCC-9.2.0

Taxonomic sequence classifier.

Krona Tools is a set of scripts to create Krona charts from several Bioinformatics tools as well as from text and XML files.

Available Modules

module load KronaTools/2.8.1-GCC-11.3.0-Perl-5.34.1

Library of routines for managing a database.

Available Modules

module load KyotoCabinet/1.2.77-GCCcore-7.4.0

LAME is a high quality MPEG Audio Layer III (MP3) encoder licensed under the LGPL.

Available Modules

module load LAME/3.100-GCCcore-9.2.0

LAMMPS is a classical molecular dynamics code, and an acronym for Large-scale Atomic/Molecular Massively Parallel Simulator. LAMMPS has potentials for solid-state materials (metals, semiconductors) and soft matter (biomolecules, polymers) and coarse-grained or mesoscopic systems. It can be used to model atoms or, more generically, as a parallel particle simulator at the atomic, meso, or continuum scale. LAMMPS runs on single processors or in parallel using message-passing techniques and a spatial-decomposition of the simulation domain. The code is designed to be easy to modify or extend with new functionality.

Available Modules

LAST finds similar regions between sequences.

Available Modules

module load LAST/1060-gimkl-2020a

LASTZ is a program for aligning DNA sequences, a pairwise aligner. Originally designed to handle sequences the size of human chromosomes and from different species, it is also useful for sequences produced by NGS sequencing technologies such as Roche 454.

Available Modules

module load LASTZ/1.04.03-GCC-9.2.0

D programming language compiler

Available Modules

module load LDC/1.26.0
Links

Determines the features most likely to explain differences between classes by coupling standard tests for statistical significance with additional tests encoding biological consistency and effect relevance

Available Modules

module load LEfSe/1.1.2-gimkl-2022a-R-4.2.1-Python-3.10.5

Alignment-free scaffolding of genome assembly drafts with long reads

Available Modules

module load LINKS/1.8.7-gimkl-2018b

The LLVM Core libraries provide a modern source- and target-independent optimizer, along with code generation support for many popular CPUs (as well as some less common ones!) These libraries are built around a well specified code representation known as the LLVM intermediate representation ("LLVM IR"). The LLVM Core libraries are well documented, and it is particularly easy to invent your own language (or port an existing compiler) to use LLVM as an optimizer and code generator. This build includes the clang C/C++ compiler frontend.

Available Modules

LMDB is a fast, memory-efficient database. With memory-mapped files, it has the read performance of a pure in-memory database while retaining the persistence of standard disk-based databases.

Available Modules

Least-squares methods to estimate rates and dates from phylogenies

Available Modules

module load LSD2/2.3-GCC-11.3.0

Highly accurate and sensitive program for identification of LTR retrotransposons; The LTR Assembly Index (LAI) is also included in this package.

Available Modules

module load LTR_retriever/2.9.0-gimkl-2020a

A probabilistic framework for structural variant discovery.

Available Modules

module load LUMPY/0.3.1-gimkl-2020a-Python-3.8.2

Portable lossless data compression library

Available Modules

module load LZO/2.10-GCCcore-9.2.0
Links

Available Modules

module load LegacySystemLibs/7
Links

tiff: Library and tools for reading and writing TIFF data files

Available Modules

module load LibTIFF/4.3.0-CrayIntel-23.02-19

module load LibTIFF/4.4.0-GCCcore-9.2.0

Libint library is used to evaluate the traditional (electron repulsion) and certain novel two-body matrix elements (integrals) over Cartesian Gaussian functions used in modern atomic and molecular theory.

Available Modules

module load Libint/2.6.0-iimpi-2022a-lmax-6-cp2k

module load Libint/1.1.6-CrayGNU-23.02

Tool that accurately maps annotations in GFF or GTF between assemblies of the same, or closely-related species.

Available Modules

module load Liftoff/1.6.3.2-gimkl-2022a-Python-3.11.3

Color management engine.

Available Modules

module load LittleCMS/2.9-GCCcore-9.2.0

A genome assembly correction and scaffolding pipeline using long reads

Available Modules

module load LongStitch/1.0.4-Miniconda3

GNU M4 is an implementation of the traditional Unix macro processor. It is mostly SVR4 compatible although it has some extensions (for example, handling more than 9 positional parameters to macros). GNU M4 also has built-in functions for including files, running shell commands, doing arithmetic, etc.

Available Modules

module load M4/1.4.18-GCCcore-7.4.0
Links

Multiple sequence alignment program offering a range of methods.

Available Modules

Tool for gene analysis and generalized gene-set analysis of GWAS data.

Available Modules

module load MAGMA/1.08-GCCcore-9.2.0

Genome annotation pipeline

Available Modules

module load MAKER/2.31.9-gimkl-2020a

matio is an C library for reading and writing Matlab MAT files.

Available Modules

module load MATIO/1.5.17-GCCcore-9.2.0

A high-level language and interactive environment for numerical computing.

Available Modules

module load MATLAB/2023b

module load MATLAB/2022a
Licence

MATLAB is proprietary software.

Access to a valid licence token is required to run MATLAB.

Institution Uptime
University of Waikato 99%
National Institute of Water and Atmospheric Research 0%
Institute of Geological and Nuclear Sciences 99%
Massey University 36%
University of Auckland 98%
University of Otago 100%
Victoria University of Wellington 100%
Auckland University of Technology 35%
University of Canterbury 35%
Plant & Food Research 36%

The MCL algorithm is short for the Markov Cluster Algorithm, a fast and scalable unsupervised cluster algorithm for graphs (also known as networks) based on simulation of (stochastic) flow in graphs.

Available Modules

module load MCL/14.137-gimkl-2020a
Links

The Matlab Compiler Runtime is required for running compiled MATLAB executables without MATLAB itself.

Available Modules

module load MCR/2021a
Links

An ultra-fast single-node solution for large and complex metagenomics assembly via succinct de Bruijn graph

Available Modules

module load MEGAHIT/1.2.9-gimkl-2022a-Python-3.10.5

Metabolic And Biogeochemistry anaLyses In microbes

Available Modules

module load METABOLIC/4.0-Miniconda3

METIS is a set of serial programs for partitioning graphs, partitioning finite element meshes, and producing fill reducing orderings for sparse matrices. The algorithms implemented in METIS are based on the multilevel recursive-bisection, multilevel k-way, and multi-constraint partitioning schemes.

MMseqs2: ultra fast and sensitive search and clustering suite

Available Modules

Software tools for clustering, reconstruction and typing of plasmids from draft assemblies.

Available Modules

module load MOB-suite/3.0.3-Miniconda3

MODFLOW is the U.S. Geological Survey modular finite-difference flow model, which is a computer code that solves the groundwater flow equation. The program is used by hydrogeologists to simulate the flow of groundwater through aquifers.

Available Modules

module load MODFLOW/6.2.2-GCC-9.2.0

The MPFR library is a C library for multiple-precision floating-point computations with correct rounding.

Available Modules

Multiple Sequentially Markovian Coalescent, infers population size and gene flow from multiple genome sequences

Available Modules

module load MSMC/1.1.0-GCC-7.4.0

A parallel sparse direct solver

MUMmer is a system for rapidly aligning entire genomes, whether in complete or draft form. AMOS makes use of it.

Available Modules

MUSCLE is a program for creating multiple alignments of amino acid or nucleotide sequences. A range of options is provided that give you the choice of optimizing accuracy, speed, or some compromise between the two.

Available Modules

module load MUSCLE/3.8.1551

MUST detects usage errors of the Message Passing Interface (MPI) and reports them to the user.

Available Modules

module load MUST/1.6-gimkl-2020a-Python-3.8.2

MaSuRCA is whole genome assembly software. It combines the efficiency of the de Bruijn graph and Overlap-Layout-Consensus (OLC) approaches. MaSuRCA can assemble data sets containing only short reads from Illumina sequencing or a mixture of short reads and long reads (Sanger, 454, Pacbio and Nanopore).

Magma is a large, well-supported software package designed for computations in algebra, number theory, algebraic geometry and algebraic combinatorics. It provides a mathematically rigorous environment for defining and working with structures such as groups, rings, fields, modules, algebras, schemes, curves, graphs, designs, codes and many others. Magma also supports a number of databases designed to aid computational research in those areas of mathematics which are algebraic in nature. whatis([==[Homepage: http://magma.maths.usyd.edu.au/magma/

Available Modules

module load Magma/Magma-2.26.9
Links

Mamba is a fast, robust, and cross-platform package manager.

Available Modules

module load Mamba/23.1.0-1

Workflow for effective discovery of SCN loci in flowering plants angiosperms

Available Modules

module load MarkerMiner/1.0-gimkl-2020a-Python-2.7.18

Fast genome and metagenome distance estimation using MinHash

Available Modules

Implements a fast and approximate algorithm for computing local alignment boundaries between long DNA sequences

Available Modules

module load MashMap/3.0.4-Miniconda3

Binary maven install, Apache Maven is a software project management and comprehension tool. Based on the concept of a project object model (POM), Maven can manage a project's build, reporting and documentation from a central piece of information.

Available Modules

module load Maven/3.6.0

MaxBin is software for binning assembled metagenomic sequences based on an Expectation-Maximization algorithm.

Available Modules

module load MaxBin/2.2.7-GCC-11.3.0-Perl-5.34.1

Evaluate genome assemblies with k-mers and more

Available Modules

module load Merqury/1.3-Miniconda3

Mesa is an open-source implementation of the OpenGL specification - a system for rendering interactive 3D graphics. Note that this build enables CPU-based rendering with OpenSWR and LLVM. The module is intended to be used with visualisation software, such as ParaView, on nodes where no GPU hardware is available. Both on-screen and off-screen rendering are supported. IMPORTANT: The OpenSWR software rasteriser can use multiple threads for best performance. The number of threads is controlled by the environment variable KNOB_MAX_WORKER_THREADS. The module sets KNOB_MAX_WORKER_THREADS=1 by default to avoid accidental oversubscription of nodes.

Available Modules

module load Mesa/19.2.1-GCCcore-9.2.0

module load Mesa/18.1.8-GCC-7.1.0

Meson is a cross-platform build system designed to be both as fast and as user friendly as possible.

Available Modules

module load Meson/0.62.1

An efficient tool for accurately reconstructing single genomes from complex microbial communities

Available Modules

module load MetaBAT/2.15-GCC-11.3.0

MetaEuk is a modular toolkit designed for large-scale gene discovery and annotation in eukaryotic metagenomic contigs.

Available Modules

module load MetaEuk/6-GCC-11.3.0

MetaGeneAnnotator is a gene-finding program for prokaryote and phage.

Available Modules

module load MetaGeneAnnotator/20080819-x86-64

MetaPhlAn is a computational tool for profiling the composition of microbial communities (Bacteria, Archaea, Eukaryotes and Viruses) from metagenomic shotgun sequencing data (i.e. not 16S) with species-level. With the newly added StrainPhlAn module, it is now possible to perform accurate strain-level microbial profiling.

MetaPhlAn is a computational tool for profiling the composition of microbial communities (Bacteria, Archaea, Eukaryotes and Viruses) from metagenomic shotgun sequencing data (i.e. not 16S) with species-level. With the newly added StrainPhlAn module, it is now possible to perform accurate strain-level microbial profiling.

Available Modules

module load MetaPhlAn2/2.7.8-gimkl-2018b-Python-3.7.3

Structural-variant caller

Available Modules

module load MetaSV/0.5.4-gimkl-2018b-Python-2.7.16

Available Modules

module load Metashape/1.6
Links

Taxonomic classification of rRNA.

Available Modules

module load Metaxa2/2.2.3-gimkl-2022a

Available Modules

module load MiMiC/0.2.0-gimkl-2022a-Python-3.11.3

The MiMiC communication library (MCL) enables communication between external programs coupled through the MiMiC framework.

Available Modules

module load MiMiC-CommLib/2.0.2-gimkl-2022a

A platform for Python-based data analytics

Available Modules

module load Miniconda3/23.10.0-1

module load Miniconda3/22.11.1-1

module load Miniconda3/23.10.0-1

Low memory and more computationally efficient implementation of the genotype imputation algorithms.

Available Modules

module load Minimac3/2.0.1

Low memory and more computationally efficient implementation of the genotype imputation algorithms.

Available Modules

module load Minimac4/4.1.2

Toolkit which aims to automatically filter pair-end raw data, assemble genome, search for mitogenome sequences from the genome assembly result, annotate mitogenome, and mitogenome visualization.

Available Modules

module load MitoZ/2.3-gimkl-2020a-Python-3.8.2

Novel dot plot visualization tool used to view tandem repeats

Available Modules

module load ModDotPlot/2023-06-gimkl-2022a-Python-3.11.3

Tool for selecting the best-fit model of evolution for DNA and protein alignments.

Available Modules

module load ModelTest-NG/0.1.7-gompi-2022a

Molcas is an ab initio quantum chemistry software package developed by scientists to be used by scientists. The basic philosophy is is to be able to treat general electronic structures for molecules consisting of atoms from most of the periodic table. As such, the primary focus of the package is on multiconfigurational methods with applications typically connected to the treatment of highly degenerate states.

Available Modules

module load Molcas/8.0-15.06.18_CentOS_6.6_x86_64

Molpro is a complete system of ab initio programs for molecular electronic structure calculations.

Available Modules

module load Molpro/mpp-2019.2.2.linux_x86_64_openmp

An open source, cross-platform, implementation of C# and the CLR that is binary compatible with Microsoft.NET.

Available Modules

module load Mono/6.12.0.122

An analysis toolkit for single-cell RNA-seq.

Available Modules

module load Monocle3/1.0.0-gimkl-2020a-R-4.0.1

Mothur is a single piece of open-source, expandable software to fill the bioinformatics needs of the microbial ecology community.

Available Modules

module load Mothur/1.41.0-gimkl-2018b-Python-2.7.16

MrBayes is a program for the Bayesian estimation of phylogeny.

Available Modules

module load MrBayes/3.2.7-gimpi-2022a

Mule is an API written in Python which allows you to access and manipulate files produced by the UM (Unified Model, of the Met Office (UK)).

Aggregate results from bioinformatics analyses across many samples into a single report. MultiQC searches a given directory for analysis logs and compiles a HTML report. It's a general use tool, perfect for summarising the output from numerous bioinformatics tools.

NAMD is a parallel molecular dynamics code designed for high-performance simulation of large biomolecular systems.

Available Modules

module load NAMD/2.14-gimkl-2022a-mpi

NASM: General-purpose x86 assembler

Available Modules

module load NASM/2.14.02-CrayGNU-23.02

module load NASM/2.15.05

The NVIDIA Collective Communications Library (NCCL) implements multi-GPU and multi-node collective communication primitives that are performance optimized for NVIDIA GPUs.

NCL is an interpreted language designed specifically for scientific data analysis and visualization.

Available Modules

module load NCL/6.6.2-intel-2018b

module load NCL/6.4.0-GCC-7.1.0
Links

manipulates and analyzes data stored in netCDF-accessible formats, including DAP, HDF4, and HDF5

Available Modules

module load NCO/4.8.1-gimpi-2022a

module load NCO/4.8.1-GCC-7.1.0
Links

Ncview is a visual browser for netCDF format files. Typically you would use ncview to get a quick and easy, push-button look at your netCDF files. You can view simple movies of the data, view along various dimensions, take a look at the actual data values, change color maps, invert the data, etc.

Available Modules

module load NCVIEW/2.1.8-GCC-7.1.0

Rrror correction and de-novo assembly tool for Nanopore long noisy reads

Available Modules

module load NECAT/0.0.1-GCC-11.3.0

NGS is a new, domain-specific API for accessing reads, alignments and pileups produced from Next Generation Sequencing.

NLopt is a free/open-source library for nonlinear optimization, providing a common interface for a number of different free optimization routines available online as well as original implementations of various other algorithms

Available Modules

module load NLopt/2.5.0-gimkl-2018b

Available Modules

module load NONMEM/7.5.1-iimpi-2022a
Links

Netscape Portable Runtime (NSPR) provides a platform-neutral API for system level and libc-like functions.

Available Modules

module load NSPR/4.34-GCC-11.3.0

Network Security Services (NSS) is a set of libraries designed to support cross-platform development of security-enabled client and server applications.

Available Modules

module load NSS/3.64-GCC-11.3.0
Links

C, C++ and Fortran compilers included with the NVIDIA HPC SDK (previously: PGI)

NWChem aims to provide its users with computational chemistry tools that are scalable both in their ability to treat large scientific computational chemistry problems efficiently, and in their use of available parallel computing resources from high-performance parallel supercomputers to conventional workstation clusters. NWChem software can handle: biomolecules, nanostructures, and solid-state; from quantum to classical, and all combinations; Gaussian basis functions or plane-waves; scaling from one to thousands of processors; properties and relativity.

Available Modules

module load NWChem/6.8.1.revision133-gimkl-2018b-2018-06-14-Python-2.7.16

Comparing runs of Oxford Nanopore sequencing data and alignments

Available Modules

module load NanoComp/1.20.0-gimkl-2022a-Python-3.10.5

Removing reads mapping to the lambda genome.

Available Modules

module load NanoLyse/1.2.0-gimkl-2020a

Plotting suite for Oxford Nanopore sequencing data and alignments.

Available Modules

module load NanoPlot/1.41.0-gimkl-2022a-Python-3.10.5

Tool for phasing genomic variants using DNA sequencing reads, also called read-based phasing or haplotype assembly.

Available Modules

module load NanoStat/1.5.0-gimkl-2020a-Python-3.8.2

This implements a Gibbs sampler to estimate the posterior probability that genetically sampled individuals fall into each of a set of user-defined hybrid categories.

Available Modules

module load NewHybrids/1.1_Beta3-GCC-11.3.0

NX is a general-purpose program package for simulating the dynamics of electronically excited molecules and molecular assemblies.

Available Modules

module load Newton-X/2.4

NextGenMap is a flexible highly sensitive short read mapping tool that handles much higher mismatch rates than comparable algorithms while still outperforming them in terms of runtime.

Available Modules

module load NextGenMap/0.5.5-GCC-11.3.0

Nextflow is a reactive workflow framework and a programming DSL that eases writing computational pipelines with complex data

Available Modules

Nim is a systems and applications programming language.

Available Modules

module load Nim/1.6.14-GCC-11.3.0
Links

Ninja is a small build system with a focus on speed.

Available Modules

module load Ninja/1.11.1

NVIDIA® Nsight™ Compute is an interactive kernel profiler for CUDA applications. It provides detailed performance metrics and API debugging via a user interface and command line tool.

Available Modules

module load Nsight-Compute/2023.2.2.3

NVIDIA® Nsight™ Systems is a system-wide performance analysis tool designed to visualize an application’s algorithm, help you select the largest opportunities to optimize, and tune to scale efficiently across any quantity of CPUs and GPUs

Available Modules

module load Nsight-Systems/2023.3.1.92

The OASIS coupler is a software allowing synchronized exchanges of coupling information between numerical codes representing different components of the climate system.

Available Modules

module load OASIS3-MCT/r1660-CrayIntel-23.02-19

Manipulate various data and sequence files.

Available Modules

module load OBITools/1.2.13

Oracle Call Interface (OCI) is the comprehensive, high performance, native C language interface to Oracle Database for custom or packaged applications. NOTE: This package is only available on Maui Ancil nodes that provide database access.

Available Modules

module load OCI/12.2
Links

Orthologous MAtrix project is a method and database for the inference of orthologs among complete genomes

Available Modules

module load OMA/2.4.1
Links

source-to-source instrumentation tool for OpenMP and hybrid codes. It surrounds OpenMP directives and runtime library calls with calls to the POMP2 measurement interface.

Available Modules

module load OPARI2/2.0.5-GCCcore-9.2.0

ORCA is a flexible, efficient and easy-to-use general purpose tool for quantum chemistry with specific emphasis on spectroscopic properties of open-shell molecules. It features a wide variety of standard quantum chemical methods ranging from semiempirical methods to DFT to single- and multireference correlated ab initio methods. It can also treat environmental and relativistic effects.

OSPRay features interactive CPU rendering capabilities geared towards Scientific Visualization applications. Advanced shading effects such as Ambient Occlusion, shadows, and transparency can be rendered interactively, enabling new insights into data exploration.

Available Modules

module load OSPRay/1.8.2-GCCcore-7.4.0

module load OSPRay/1.6.1-GCC-7.1.0

OSU Micro-Benchmarks for MPI

Available Modules

module load OSU-Micro-Benchmarks/6.2-gompi-2022a

module load OSU-Micro-Benchmarks/6.2-CrayIntel-23.02-19

The Open Trace Format 2 is a highly scalable, memory efficient event trace data format plus support library

Available Modules

module load OTF2/2.2-GCCcore-9.2.0

GNU Octave is a high-level interpreted language, primarily intended for numerical computations.

Available Modules

module load Octave/6.3.0-gimkl-2020a

OpenBLAS is an optimized BLAS library based on GotoBLAS2 1.13 BSD version.

Available Modules

Open Babel is a chemical toolbox designed to speak the many languages of chemical data. It's an open, collaborative project allowing anyone to search, convert, analyze, or store data from molecular modeling, chemistry, solid-state materials, biochemistry, or related areas.

Available Modules

module load OpenBabel/2.4.1-GCC-7.4.0

OpenCV (Open Source Computer Vision Library) is an open source computer vision and machine learning software library. OpenCV was built to provide a common infrastructure for computer vision applications and to accelerate the use of machine perception in the commercial products.

Available Modules

module load OpenCV/3.4.7-gimkl-2020a

Wind turbine multiphysics simulation tool

Available Modules

module load OpenFAST/2.1.0-gimkl-2018b

OpenFOAM is a free, open source CFD software package. OpenFOAM has an extensive range of features to solve anything from complex fluid flows involving chemical reactions, turbulence and heat transfer, to solid dynamics and electromagnetics.

An open-source JPEG 2000 codec written in C

Available Modules

module load OpenJPEG/2.5.0-GCCcore-9.2.0

The Open MPI Project is an open source MPI-3 implementation.

The OpenSSL Project is a collaborative effort to develop a robust, commercial-grade, full-featured, and Open Source toolchain implementing the Secure Sockets Layer (SSL v2/v3) and Transport Layer Security (TLS v1) protocols as well as a full-strength general purpose cryptography library.

Available Modules

OpenSees is a software framework for developing applications to simulate the performance of structural and geotechnical systems subjected to earthquakes.

OpenSees is a software framework for developing applications to simulate the performance of structural and geotechnical systems subjected to earthquakes.

Available Modules

module load OpenSeesPy/20220411-gimkl-2020a

OpenSlide is a C library that provides a simple interface to read whole-slide images (also known as virtual slides).

Available Modules

module load OpenSlide/3.4.1-GCC-11.3.0

A simple and not slow open reading frame (ORF) caller.

Available Modules

module load OrfM/0.7.1-GCC-11.3.0

Identifies missing annotations for evolutionarily conserved genes.

Available Modules

module load OrthoFiller/1.1.4-gimkl-2018b-Python-2.7.16

OrthoFinder is a fast, accurate and comprehensive platform for comparative genomics

Available Modules

module load OrthoFinder/2.5.2-gimkl-2020a-Python-3.8.2

Genome-scale algorithm for grouping orthologous protein sequences.

Available Modules

module load OrthoMCL/2.0.9-gimkl-2020a-Perl-5.30.1

pipelines and tools designed to aid the rapid processing of High-Throughput Sequencing (HTS) data.

Available Modules

module load PALEOMIX/1.2.14

PAML is a package of programs for phylogenetic analyses of DNA or protein sequences using maximum likelihood.

Available Modules

module load PAML/4.10.7-GCC-12.3.0

PAPI provides the tool designer and application engineer with a consistent interface and methodology for use of the performance counter hardware found in most major microprocessors. PAPI enables software engineers to see, in near real time, the relation between software performance and processor events. In addition Component PAPI provides access to a collection of components that expose performance measurement opportunites across the hardware and software stack.

Available Modules

module load PAPI/6.0.0-GCCcore-9.2.0

The PCRE library is a set of functions that implement regular expression pattern matching using the same syntax and semantics as Perl 5.

Available Modules

module load PCRE/8.43-GCCcore-7.1.0

The PCRE library is a set of functions that implement regular expression pattern matching using the same syntax and semantics as Perl 5.

Available Modules

module load PCRE2/10.42-GCCcore-12.3.0

module load PCRE2/10.40-GCCcore-9.2.0

Program Database Toolkit (PDT) is a framework for analyzing source code written in several programming languages and for making rich program knowledge accessible to developers of static and dynamic analysis tools.

Available Modules

module load PDT/3.25.1-GCCcore-9.2.0
Links

Memory-efficient,fully parallelized and highly accurate pair-end read merger.

Available Modules

module load PEAR/0.9.11-GCCcore-9.2.0

PEST++ is a software suite aimed at supporting complex numerical models in the decision-support context. Much focus has been devoted to supporting environmental models (groundwater, surface water, etc) but these tools are readily applicable to any computer model.

Available Modules

module load PEST++/5.1.5-gimkl-2020a

PETSc, pronounced PET-see (the S is silent), is a suite of data structures and routines for the scalable (parallel) solution of scientific applications modeled by partial differential equations.

Available Modules

module load PETSc/3.19-gimkl-2022a

A tool to visualize phase block structure from (many) BAM or CRAM files together with BED annotation

Available Modules

module load PHASIUS/0.1.0d-GCC-11.3.0

PLINK is a free, open-source whole genome association analysis toolset, designed to perform a range of basic, large-scale analyses in a computationally efficient manner. The focus of PLINK is purely on analysis of genotype/phenotype data, so there is no support for steps prior to this (e.g. study design and planning, generating genotype or CNV calls from raw data). Through integration with gPLINK and Haploview, there is some support for the subsequent visualization, annotation and storage of results.

Available Modules

module load PLINK/2.00a2.3

PLUMED is an open source library for free energy calculations in molecular systems which works together with some of the most popular molecular dynamics engines. Free energy calculations can be performed as a function of many order parameters with a particular focus on biological problems, using state of the art methods such as metadynamics, umbrella sampling and Jarzynski-equation based steered MD. The software, written in C++, can be easily interfaced with both fortran and C/C++ codes.

Probabilistic multiple alignment program for DNA, codon and amino-acid sequences. .

Available Modules

module load PRANK/170427-gimkl-2022a

Program proj is a standard Unix filter function which converts geographic longitude and latitude coordinates into cartesian coordinates

Generates inline-Assembly for sparse Matrix Multiplication.

Available Modules

module load PSpaMM/0.2.1-foss-2023a-Python-3.11.6

parallel unstructured mesh infrastructure API

Available Modules

module load PUMI/2.2.7-foss-2023a

Pango is a library for laying out and rendering of text, with an emphasis on internationalization.

Available Modules

module load Pango/1.50.8-GCC-11.3.0

ParMETIS is an MPI-based parallel library that implements a variety of algorithms for partitioning unstructured graphs, meshes, and for computing fill-reducing orderings of sparse matrices. ParMETIS extends the functionality provided by METIS and includes routines that are especially suited for parallel AMR computations and large scale numerical simulations. The algorithms implemented in ParMETIS are based on the parallel multilevel k-way graph-partitioning, adaptive repartitioning, and parallel multi-constrained partitioning schemes.

Available Modules

ParaView is a scientific parallel visualizer. This version supports CPU-only rendering without X context using the OSMesa library, it does not support GPU rendering, and it does not provide a GUI. Use the GALLIUM_DRIVER environment variable to choose a software renderer, it is recommended to use GALLIUM_DRIVER=swr for best performance. Ray tracing using the OSPRay library is also supported.

parallel: Build and execute shell commands in parallel

Available Modules

module load Parallel/20220922

module load Parallel/20230522

A high-level Parallel I/O Library for structured grid applications

Available Modules

module load ParallelIO/2.5.4-intel-2020a

Performance visualization and analysis tool based on traces.

Available Modules

module load Paraver/4.9.0-GCC-9.2.0

Genome assembler for long reads (length > 10kb, accuracy > 99%). Based on Sparse HIereachical MimiMizER (SHIMMER) for fast read-to-read overlaping

Available Modules

module load Peregrine/0.1.6.1-gimkl-2020a

Larry Wall's Practical Extraction and Report Language

Available Modules

Phylogenetic estimation using Maximum Likelihood

Available Modules

module load PhyML/3.3.20211231-gimpi-2022a

Integrated pipeline for large-scale phylogenetic profiling of genomes and metagenomes.

Available Modules

Pilon is an automated genome assembly improvement and variant detection tool

Available Modules

module load Pilon/1.24-Java-15.0.2

Parallel netCDF: A Parallel I/O Library for NetCDF File Access

Available Modules

module load PnetCDF/1.13.0-iimpi-2022a

Porechop is a tool for finding and removing adapters from Oxford Nanopore reads. Adapters on the ends of reads are trimmed off, and when a read has an adapter in its middle, it is treated as chimeric and chopped into separate reads. Porechop performs thorough alignments to effectively find adapters, even at low sequence identity

Available Modules

Client-side programs and libraries for accessing PostgreSQL databases.

Available Modules

module load PostgreSQL/12.2-gimkl-2020a-PostGIS-3.1.1

module load PostgreSQL/11.0-GCC-7.1.0

Prodigal (Prokaryotic Dynamic Programming Genefinding Algorithm) is a microbial (bacterial and archaeal) gene finding program developed at Oak Ridge National Laboratory and the University of Tennessee.

Available Modules

module load Prodigal/2.6.3-GCC-9.2.0

Pipeline for predicting and scoring hints (in the form of introns, start and stop codons) in the genome of interest by mapping and spliced aligning predicted genes to a database of reference protein sequences.

Available Modules

Proteinortho is a tool to detect orthologous genes within different species.

Available Modules

module load Proteinortho/6.0.28-gimkl-2020a-Python-3.8.2-Perl-5.30.1

PyOpenGL is the most common cross platform Python binding to OpenGL and related APIs.

Available Modules

module load PyOpenGL/3.1.5-gimkl-2018b-Python-2.7.16

PyQt5 is a set of Python bindings for v5 of the Qt application framework from The Qt Company. This bundle includes PyQtWebEngine, a set of Python bindings for The Qt Company’s Qt WebEngine framework.

Available Modules

Tensors and Dynamic neural networks in Python with strong GPU acceleration.

Available Modules

module load PyTorch/1.12.1-gimkl-2022a-Python-3.10.5-CUDA-11.6.2

Python is a programming language that lets you work more quickly and integrate your systems more effectively.

Python packages for geospatial data I/O, mostly based on the OSGEO libraries GDAL and OGR

Available Modules

module load QChem/6.0.2-mpi
Links
Licence

QChem is proprietary software.

An open-source bioinformatics pipeline for microbiome analysis from raw DNA sequencing data.

Available Modules

Evaluates genome assemblies

Available Modules

module load QUAST/5.2.0-gimkl-2022a

Qt is a comprehensive cross-platform C++ application framework.

Available Modules
Links

Quantum ESPRESSO is an integrated suite of computer codes for electronic-structure calculations and materials modeling at the nanoscale. It is based on density-functional theory, plane waves, and pseudopotentials (both norm-conserving and ultrasoft).

Available Modules

module load QuantumESPRESSO/7.2-intel-2022a

Efficient implementation of the Neighbor-Joining algorithm, capable of reconstructing phylogenies from huge alignments .

Available Modules

module load QuickTree/2.5-GCC-11.3.0

R is a free software environment for statistical computing and graphics.

R packages for Geometric and Geospatial data which depend on GEOS and/or GDAL.

Bioconductor provides tools for the analysis and comprehension of high-throughput genomic data.

Available Modules

module load R-bundle-Bioconductor/3.17-gimkl-2022a-R-4.3.1

A binary file set RANGS (Regionally Accessible Nested Global Shorelines) based on GSHHS (Global Self-consistent Hierarchical High-resolution Shorelines) data. Note: RANGS-GSHHS is used with NCL.

Available Modules

module load RANGS-GSHHS/20171003

module load RANGS-GSHHS/20171003

RAxML search algorithm for maximum likelihood based inference of phylogenetic trees.

Available Modules

module load RAxML/8.2.12-gimkl-2022a

RAxML-NG is a phylogenetic tree inference tool which uses maximum-likelihood (ML) optimality criterion. Its search heuristic is based on iteratively performing a series of Subtree Pruning and Regrafting (SPR) moves, which allows to quickly navigate to the best-known ML tree.

Available Modules

module load RAxML-NG/1.1.0-gimkl-2022a

The RDP Classifier is a naive Bayesian classifier that can rapidly and accurately provides taxonomic assignments from domain to genus, with confidence estimates for each assignment.

Available Modules

module load RDP-Classifier/2.12

fast, safe, thread-friendly alternative to backtracking regular expression engines like those used in PCRE, Perl, and Python. It is a C++library.

Available Modules

module load RE2/2023-08-01-GCC-11.3.0
Links

De novo identification and classification of repeat sequence families from genomic sequences

Available Modules

module load RECON/1.08-GCC-9.2.0

Tool for visualizing alignments of reads in regions containing tandem repeats

Available Modules

module load REViewer/0.2.7

Predicting plasmid contigs from assemblies

Available Modules

module load RFPlasmid/0.0.18-gimkl-2022a-Python-3.10.5

Structure generation, with or without conditional information (a motif, target etc) It can perform a whole range of protein design challenges as we have outlined in the RFdiffusion paper.

Available Modules

module load RFdiffusion/1.1.0-gimkl-2022a-Python-3.10.5

RMBlast supports RepeatMasker searches by adding a few necessary features to the stock NCBI blastn program. These include: Support for custom matrices ( without KA-Statistics ). Support for cross_match-like complexity adjusted scoring. Cross_match is Phil Green's seeded smith-waterman search algorithm. Support for cross_match-like masklevel filtering..

Available Modules

module load RMBlast/2.10.0-GCC-9.2.0

consistent and rapid annotation of ribosomal RNA genes.

Available Modules

module load RNAmmer/1.2-GCC-11.3.0-Perl-5.34.1

Platform for GPU Enabled HPC and UltraScale Computing

Available Modules

module load ROCm/4.2.0-GCC-9.2.0-amd

The ROOT system provides a set of OO frameworks with all the functionality needed to handle and analyze large amounts of data in a very efficient way.

Available Modules

module load ROOT/6.16.00-GCC-7.4.0

Estimates gene and isoform expression levels from RNA-Seq data

Available Modules

module load RSEM/1.3.3-gimkl-2022a

The Remote Sensing and GIS software library (RSGISLib) is a collection of tools for processing remote sensing and GIS datasets. The tools are accessed using Python bindings or an XML interface.

Ultrafast consensus module for raw de novo genome assembly of long uncorrected reads.

Tool for chromosome assembly using multiple references.

Available Modules

module load Ragout/2.2-gimkl-2018b-Python-2.7.16

An algorithmic engineered implementation of canonical neighbour-joining.

Available Modules

module load RapidNJ/2.3.2

De novo genome assembler for long uncorrected reads.

Available Modules

module load Raven/1.5.0-GCC-9.2.0

kmer-based error correction method for RNA-seq data.

Available Modules

module load Rcorrector/1.0.4-gimkl-2020a

RELION (for REgularised LIkelihood OptimisatioN, pronounce rely-on) is a stand-alone computer program that employs an empirical Bayesian approach to refinement of (multiple) 3D reconstructions or 2D class averages in electron cryo-microscopy (cryo-EM).

Available Modules

module load Relion/4.0.1-gimkl-2020a

Screens DNA sequences for interspersed repeats and low complexity DNA sequences. For licensing reasons RepBase is not included, instead you must set LIBDIR to point at a directory which contains your copy of it.

Available Modules

module load RepeatMasker/4.1.0-gimkl-2020a

De novo transposable element (TE) family identification and modeling package.

Available Modules

module load RepeatModeler/2.0.3-Miniconda3

De novo identification of repeat families in large genomes

Available Modules

module load RepeatScout/1.0.5-GCC-9.2.0-Perl-5.30.1-mt

RiskScape is an open-source spatial data processing application used for multi-hazard risk analysis. RiskScape is highly customisable, letting modellers tailor the risk analysis to suit the problem domain and input data being modelled.

Available Modules

module load Riskscape/1.2.0

Rapid large-scale prokaryote pan genome analysis

Available Modules

module load Roary/3.13.0-gimkl-2020a

Rosetta is the premier software suite for modeling macromolecular structures. As a flexible, multi-purpose application, it includes tools for structure prediction, design, and remodeling of proteins and nucleic acids.

Available Modules

module load Rosetta/3.13-gimkl-2022a

RStudio is a set of integrated tools designed to help you be more productive with R.

Available Modules

module load Rstudio/1.1.456

Ruby is a dynamic, open source programming language with a focus on simplicity and productivity. It has an elegant syntax that is natural to read and easy to write.

Available Modules

module load Ruby/3.0.1-GCC-11.3.0

Systems programming language that runs blazingly fast, prevents segfaults, and guarantees thread safety.

Ppackage containing programs for use in the genetic analysis of family, pedigree and individual data.

Available Modules

module load SAGE/6.4

Samtools is a suite of programs for interacting with high-throughput sequencing data. SAMtools - Reading/writing/editing/indexing/viewing SAM/BAM/CRAM format

SAS is a statistical software suite developed by SAS Institute for data management, advanced analytics, multivariate analysis, business intelligence, criminal investigation, and predictive analytics. - Homepage: https://www.sas.com/en_nz/home.html/

Available Modules

module load SAS/9.4

module load SAS/9.4
Links

Software package and libraries for sequential and parallel graph partitioning, static mapping, and sparse matrix block ordering, and sequential mesh and hypergraph partitioning.

Available Modules

SCons is a software construction tool.

Available Modules

module load SCons/3.0.1

module load SCons/3.0.1

Simple DirectMedia Layer, a cross-platform multimedia library

Available Modules

module load SDL2/2.0.9-GCCcore-9.2.0

SATe-enabled Phylogenetic Placement. Phylogenetic placement of short reads into reference alignments and trees.

Estimation of haplotypes (aka phasing) for SNP array and high coverage sequencing data.

Available Modules

module load SHAPEIT4/4.2.0-GCC-9.2.0

Scalable I/O library for parallel access to task-local files.

Available Modules

module load SIONlib/1.7.7-GCC-11.3.0

SIP is a tool that makes it very easy to create Python bindings for C and C++ libraries.

SKESA is a de-novo sequence read assembler for cultured single isolate genomes based on DeBruijn graphs.

Available Modules

module load SKESA/2.4.0-gimpi-2022a_saute.1.3.0_1

PacBio’s open-source software suite is designed for use with Single Molecule, Real-Time (SMRT) Sequencing data.

Available Modules

module load SMRT-Link/11.1.0.166339-cli-tools-only

SNVoter - A top up tool to enhance SNV calling from Nanopore sequencing data & NanoMethPhase - Phase long reads and CpG methylations from Oxford Nanopore Technologies.

Available Modules

module load SNVoter-NanoMethPhase/1.0

Database access library for C++ that makes the illusion of embedding SQL queries in the regular C++ code, staying entirely within the Standard C++.

Available Modules

module load SOCI/4.0.2-GCC-9.2.0

Genome assembler for single-cell and isolates data sets

Available Modules

SPECFEM3D Cartesian simulates acoustic (fluid), elastic (solid), coupled acoustic/elastic, poroelastic or seismic wave propagation in any type of conforming mesh of hexahedra (structured or not.) It can, for instance, model seismic waves propagating in sedimentary basins or any other regional geological model following earthquakes. It can also be used for non-destructive testing or for ocean acoustics.

Available Modules

module load SPECFEM3D/20171123-CrayIntel-23.02-19

System for Processing Image Data from Electron microscopy and Related fields

Available Modules

module load SPIDER/22.02

SQLite: SQL Database Engine in a C Library

Available Modules

module load SQLite/3.36.0-CrayIntel-23.02-19

module load SQLite/3.31.1-GCC-9.2.0

SQL*Plus is an interactive and batch query tool that is installed with every Oracle Database installation. It has a command-line user interface and a Windows Graphical User Interface (GUI).

Available Modules

module load SQLplus/18.3.0.0.0

Pairwise sequence alignment program designed for the efficient mapping of sequencing reads onto genomic reference sequences.

Available Modules

module load SSAHA2/2.5.5-x86_64

Fast universal RNA-seq aligner

Available Modules

Processes the output generated by the STAR aligner to map junction reads and spanning reads to a reference annotation set

Available Modules

module load STAR-Fusion/1.6.0-gimkl-2018b

SUNDIALS: SUite of Nonlinear and DIfferential/ALgebraic Equation Solvers

Available Modules

Tool set for simulating/evaluating SVs, merging and comparing SVs within and among samples, and includes various methods to reformat or summarize SVs.

Available Modules

module load SURVIVOR/1.0.7

SWIG is a software development tool that connects programs written in C and C++ with a variety of high-level programming languages.

Available Modules

module load SWIG/4.0.2

module load SWIG/3.0.12

Salmon is a wicked-fast program to produce a highly-accurate, transcript-level quantification estimates from RNA-seq data.

Tools for working with SAM/BAM data

Available Modules

module load Sambamba/0.8.0

The ScaLAPACK (or Scalable LAPACK) library includes a subset of LAPACK routines redesigned for distributed memory MIMD parallel computers.

Available Modules

module load ScaLAPACK/2.2.0-gompi-2023a-fb

Tool that supports the performance optimization of parallel programs by measuring and analyzing their runtime behavior. The analysis identifies potential performance bottlenecks -- in particular those concerning communication and synchronization -- and offers guidance in exploring their causes.

Available Modules

module load Scalasca/2.5-gimpi-2020a

Measurement infrastructure is a highly scalable and easy-to-use tool suite for profiling, event tracing, and online analysis of HPC applications.

Available Modules

module load Score-P/6.0-gimpi-2020a

SeisSol is a software package for simulating wave propagation and dynamic rupture based on the arbitrary high-order accurate derivative discontinuous Galerkin method (ADER-DG).

Available Modules

module load SeisSol/1.1.3-foss-2023a-Python-3.11.6

SeqAn is an open source C++ library of efficient algorithms and data structures for the analysis of sequences with the focus on biological data.

Available Modules

module load SeqAn/2.4.0-GCCcore-9.2.0

C++ library of efficient algorithms and data structures for the analysis of sequences with the focus on biological data.

Available Modules

module load SeqAn3/3.1.0

Ultrafast toolkit for FASTA/Q file manipulation

Available Modules

module load SeqKit/2.4.0

A comparative genomics tool for analysing genomic variations that correlate with pathogens, or microorganisms adapt in different environments.

Available Modules

module load SiBELia/3.0.7

SIESTA is both a method and its computer program implementation, to perform efficient electronic structure calculations and ab initio molecular dynamics simulations of molecules and solids.

Available Modules

SignalP predicts the presence and location of signal peptide cleavage sites in amino acid sequences from different organisms

Available Modules

module load SignalP/6.0g-gimkl-2022a-Python-3.10.5

Available Modules

module load Singularity/3.9.4

A fast structural variant caller for long-read sequencing.

Available Modules

module load Sniffles/2.3.3-gimkl-2022a-Python-3.10.5

SortMeRNA is a biological sequence analysis tool for filtering, mapping and OTU-picking NGS reads.

Available Modules

module load SortMeRNA/4.3.6

SourceTracker is a Bayesian approach to estimating the proportion of a novel community that comes from a set of source environments.

Available Modules

module load SourceTracker/0.9.6

Spack is a package manager for supercomputers, Linux, and macOS. It makes installing scientific software easy. With Spack, you can build a package with multiple versions, configurations, platforms, and compilers, and all of these builds can coexist on the same machine.

Available Modules

module load Spack/0.17.2

Spark is Hadoop MapReduce done in memory

Available Modules

module load Spark/2.3.1-gimkl-2018b-Hadoop-2.7

module load Spark/2.3.1-gimkl-2018b-Hadoop-2.7

fully automated metagenomics pipeline, from reads to bins.

Available Modules

module load SqueezeMeta/1.6.0-Miniconda3

Stacks is a software pipeline for building loci from short-read sequences, such as those generated on the Illumina platform. Stacks was developed to work with restriction enzyme-based data, such as RAD-seq, for the purpose of building genetic maps and conducting population genomics and phylogeography.

StringTie is a fast and highly efficient assembler of RNA-Seq alignments into potential transcripts.

Available Modules

module load StringTie/2.2.0-gimkl-2022a

The program structure is a free software package for using multi-locus genotype data to investigate population structure.

Available Modules

module load Structure/2.3.4-iccifort-2020.0.166

High performance read alignment, quantification and mutation discovery

Available Modules

module load Subread/2.0.3-GCC-11.3.0

Subversion is an open source version control system.

Available Modules

module load Subversion/1.9.7

module load Subversion/1.13.0-GCC-9.2.0

Solution of large, sparse, nonsymmetric systems of linear equations.

Available Modules

module load SuperLU/5.2.1-gimkl-2020a

Supernova is a software package for de novo assembly from Chromium Linked-Reads that are made from a single whole-genome library from an individual DNA source

Available Modules

module load Supernova/2.1.1

Szip compression software, providing lossless compression of scientific data

Takes RNA-seq (and similar data) and annotates reads to both genes & transposable elements.

Available Modules

module load TEtranscripts/2.2.3-gimkl-2022a

Prediction of transmembrane helices in proteins

Available Modules

module load TMHMM/2.0c

Implements a novel machine learning based paradigm to infer orthologous genes between related species and to accurately distinguish orthologs from paralogs or processed pseudogenes.

Available Modules

module load TOGA/1.1.4-gimkl-2022a-Python-3.11.3

Transcript Selector for BRAKER

Available Modules

module load TSEBRA/1.1.1-gimkl-2022a-Python-3.11.3

Program Package For Electronic Structure Calculations.

Available Modules

module load TURBOMOLE/7.2.1-gompi-2022a

Nearly Infinite Neighbor Joining Application.

Available Modules

module load TWL-NINJA/0.97-cluster_only-GCC-9.2.0

Tcl (Tool Command Language) is a very powerful but easy to learn dynamic programming language, suitable for a very wide range of uses, including web and desktop applications, networking, administration, testing and many more.

NVIDIA TensorRT is a platform for high-performance deep learning inference

Available Modules

module load TensorRT/8.6.1.6-gimkl-2022a-Python-3.11.3-CUDA-11.8.0

Theano is a Python library that allows you to define, optimize, and evaluate mathematical expressions involving multi-dimensional arrays efficiently.

Available Modules

module load Theano/1.0.5-gimkl-2020a-Python-3.8.2

Tk is an open source, cross-platform widget toolchain that provides a library of basic elements for building a graphical user interface (GUI) in many different programming languages.

Available Modules
Links

TransDecoder identifies candidate coding regions within transcript sequences.

Available Modules

module load TransDecoder/5.7.1-GCC-11.3.0-Perl-5.34.1

Available Modules

module load Transrate/1.0.3
Links

TreeMix is a method for inferring the patterns of population splits and mixtures in the history of a set of populations.

Available Modules

module load TreeMix/1.13-GCC-11.3.0

A wrapper of FastQC and cutadapt to automate quality and adapter trimming as well as quality control, with some added functionality to remove biased methylation positions for RRBS sequence files

Available Modules

module load TrimGalore/0.6.10-gimkl-2022a-Python-3.11.3-Perl-5.34.1

Trimmomatic performs a variety of useful trimming tasks for illumina paired-end and single ended data.The selection of trimming steps and their associated parameters are supplied on the command line.

Available Modules

module load Trimmomatic/0.39-Java-1.8.0_144

Trinity represents a novel method for the efficient and robust de novo reconstruction of transcriptomes from RNA-Seq data. Trinity combines three independent software modules: Inchworm, Chrysalis, and Butterfly, applied sequentially to process large volumes of RNA-Seq reads.

C++ library of efficient algorithms and data structures for the analysis of sequences with the focus on biological data.

Available Modules

module load Trinotate/4.0.2-GCC-11.3.0

Tool for generating consensus long-read assemblies for bacterial genomes.

Available Modules

TuiView is a lightweight raster GIS with powerful raster attribute table manipulation abilities.

Available Modules

TurboVNC is a derivative of VNC (Virtual Network Computing) that is tuned to provide peak performance for 3D and video workloads.

Available Modules

module load TurboVNC/2.2.4-GCC-9.2.0

UCC (Unified Collective Communication) is a collective communication operations API and library that is flexible, complete, and feature-rich for current and emerging programming models and runtimes.

Available Modules

module load UCC/1.2.0-GCCcore-12.3.0
Links

Unified Communication X An open-source production grade communication framework for data centric and high-performance applications

UDUNITS supports conversion of unit specifications between formatted and binary forms, arithmetic manipulation of units, and conversion of values between compatible scales of measurement.

Available Modules

module load UDUNITS/2.2.28-CrayGNU-23.02

module load UDUNITS/2.2.25-GCC-7.1.0

USEARCH is a unique sequence analysis tool which offers search and clustering algorithms that are often orders of magnitude faster than BLAST.

Available Modules

module load USEARCH/11.0.667-i86linux32

Assembly pipeline for bacterial genomes. It can assemble Illumina-only read sets where it functions as a SPAdes-optimiser.

Available Modules

The Vienna Ab initio Simulation Package (VASP) is a computer program for atomic scale materials modelling, e.g. electronic structure calculations and quantum-mechanical molecular dynamics, from first principles.

VCF-kit is a command-line based collection of utilities for performing analysis on Variant Call Format (VCF) files.

Available Modules

module load VCF-kit/0.2.6-gimkl-2020a-Python-3.8.2

The aim of VCFtools is to provide methods for working with VCF files: validating, merging, comparing and calculate some basic population genetic statistics.

Available Modules

Variant Effect Predictor (VEP) determines the effect of your variants (SNPs, insertions, deletions, CNVs or structural variants) on genes, transcripts, and protein sequence, as well as regulatory regions. Includes EnsEMBL-XS, which provides pre-compiled replacements for frequently used routines in VEP.

Available Modules
Links

VESTA is a 3D visualization program for structural models, volumetric data such as electron/nuclear densities, and crystal morphologies. Run 'VESTA-gui' to launch.

Available Modules

module load VESTA/3.4.6
Links

Virus Identification By iteRative ANnoTation

Available Modules

module load VIBRANT/1.2.1-gimkl-2020a

VMD is a molecular visualization program for displaying, animating, and analyzing large biomolecular systems using 3-D graphics and built-in scripting.

Available Modules

module load VMD/1.9.3-gimkl-2020a
Links

An open source alternative to the metagenomics tool USEARCH. Performs chimera detection, clustering, full-length and prefix dereplication, rereplication, masking, all-vs-all pairwise global alignment, exact and global alignment searching, shuffling, subsampling and sorting. It also supports FASTQ file analysis, filtering, conversion and merging of paired-end reads.

Available Modules

The Visualization Toolkit (VTK) is an open-source, freely available software system for 3D computer graphics, image processing and visualization. VTK consists of a C++ class library and several interpreted interface layers including Tcl/Tk, Java, and Python. VTK supports a wide variety of visualization algorithms including: scalar, vector, tensor, texture, and volumetric methods; and advanced modeling techniques such as: implicit modeling, polygon reduction, mesh smoothing, cutting, contouring, and Delaunay triangulation.

Available Modules

module load VTK/7.1.1-gimkl-2018b-Python-2.7.16

module load VTK/8.1.1-GCC-7.1.0-Anaconda2-5.2.0
Links

Intel VTune Amplifier XE is the premier performance profiler for C, C++, C#, Fortran, Assembly and Java.

Available Modules

module load VTune/2023.1.0

module load VTune/2019_update8

Valgrind: Debugging and profiling tools

Available Modules

module load Valgrind/3.19.0-GCC-11.3.0

module load Valgrind/3.14.0-GCC-7.1.0

Variant calling and somatic mutation/CNV detection for next-generation sequencing data

Available Modules

module load VarScan/2.4.4-Java-11.0.4

Sequence assembler for very short reads

Available Modules

module load Velvet/1.2.10-GCC-9.2.0

Perl script for optimising the three primary parameter options of the Velvet de novo sequence assembler.

Available Modules

module load VelvetOptimiser/2.2.5-GCC-9.2.0

The Vienna RNA Package consists of a C code library and several stand-alone programs for the prediction and comparison of RNA secondary structures.

Available Modules

module load ViennaRNA/2.4.17-gimkl-2020a

Vim is an advanced text editor that seeks to provide the power of the de-facto Unix editor 'Vi', with a more complete feature set.

Available Modules

module load Vim/9.0.0500

module load Vim/9.0.0500
Links

Tools for computing various oligonucleotide frequency (ONF) based distance/dissimialrity measures.

Available Modules

module load VirHostMatcher/1.0.0-gimkl-2020a

VirSorter: mining viral signal from microbial genomic data.

Available Modules

VirtualGL is an open source toolkit that gives any Linux or Unix remote display software the ability to run OpenGL applications with full hardware acceleration.

Available Modules

module load VirtualGL/2.6.2-gimkl-2020a

VisIt is an Open Source, interactive, scalable, visualization, animation and analysis tool. This version supports interactive CPU-only rendering with the VisIt GUI using the Mesa library. It does not support GPU rendering. Use the GALLIUM_DRIVER environment variable to choose a software renderer, it is recommended to use GALLIUM_DRIVER=swr for best performance.

Available Modules

module load VisIt/2.13.3-gimpi-2018b-GUI-Mesa

Workflow to Annotate Assemblies and Find LGT Events.

Available Modules

module load WAAFLE/0.1.0-gimkl-2022a-Python-3.10.5

Tool for phasing genomic variants using DNA sequencing reads, also called read-based phasing or haplotype assembly.

Available Modules

module load WhatsHap/1.6-gimkl-2022a-Python-3.10.5

Winnowmap is a long-read mapping algorithm, and a result of our exploration into superior minimizer sampling techniques.

Available Modules

module load Winnowmap/2.03-GCC-11.3.0

Aligning proteins or protein HMMs to DNA

Available Modules

module load Wise2/2.4.1-GCC-11.3.0

Xconv is a program designed to convert model output into a format suitable for use in various plotting packages. Xconv is designed to be simple to use with a point and click, windows based interface.

Available Modules

module load XCONV/1.93

Calls copy number variation (CNV) from normalized read-depth data from exome capture or other targeted sequencing experiments.

Available Modules

module load XHMM/1.0-gimkl-2018b

XIOS stands for XML-IO-Server and is a library dedicated to I/O management in climate codes. This version uses netCDF4 with the parallel HDF5 library, and it contains a patch for various problems in the mesh connectivity algorithm

Available Modules

Fast integrator of stochastic partial differential equations.

Available Modules

module load XMDS2/3.0.0-beta-gimkl-2020a-Python-3.8.2

CodeSynthesis XSD is an open-source, cross-platform W3C XML Schema to C++ data binding compiler. Provided with an XML instance specification (XML Schema), it generates C++ classes that represent the given vocabulary as well as XML parsing and serialization code. You can then access the data stored in XML using types and functions that semantically correspond to your application domain rather than dealing with the intricacies of reading and writing XML

Available Modules

module load XSD/4.0.0-GCCcore-7.4.0
Links

xz: XZ utilities

Available Modules
Links

Xerces-C++ is a validating XML parser written in a portable subset of C++. Xerces-C++ makes it easy to give your application the ability to read and write XML data. A shared library is provided for parsing, generating, manipulating, and validating XML documents using the DOM, SAX, and SAX2 APIs.

Available Modules

module load Xerces-C++/3.2.3-GCCcore-9.2.0

Yet Another eXchange Tool - Library that performs halo exchange with MPI for domain decomposed simulations.

Available Modules

module load YAXT/0.9.3.1-gimpi-2022a

module load YAXT/0.9.0-CrayGNU-23.02

Yasm: Complete rewrite of the NASM assembler with BSD license

Available Modules

module load Yasm/1.3.0-gimkl-2017a

module load Yasm/1.3.0-GCC-7.1.0

A theorem prover from Microsoft Research.

Available Modules

module load Z3/4.12.2-GCCcore-12.3.0
Links

ZeroMQ looks like an embeddable networking library but acts like a concurrency framework. It gives you sockets that carry atomic messages across various transports like in-process, inter-process, TCP, and multicast. You can connect sockets N-to-N with patterns like fanout, pub-sub, task distribution, and request-reply.

Available Modules

Zip is a compression and file packaging/archive utility. Although highly compatible both with PKWARE's PKZIP and PKUNZIP utilities for MS-DOS and with Info-ZIP's own UnZip, our primary objectives have been portability and other-than-MSDOS functionality

Available Modules

module load Zip/3.0-GCCcore-9.2.0
Links

AMR gene detection pipeline that runs AMRFinderPlus on a single (or list ) of given isolates

Available Modules

module load abritamr/1.0.14-gimkl-2022a-Python-3.11.3

Program for analysing NGS data.

Available Modules

module load angsd/0.935-GCC-9.2.0

Apache Ant is a Java library and command-line tool whose mission is to drive processes described in build files as targets and extension points dependent upon each other. The main known usage of Ant is the build of Java applications.

Available Modules
Links

antiSMASH allows the rapid genome-wide identification, annotation and analysis of secondary metabolite biosynthesis gene clusters in bacterial and fungal genomes.

Available Modules

module load antiSMASH/6.0.1-gimkl-2020a-Python-3.8.2

Convert various sequence formats to FASTA

Available Modules

module load any2fasta/0.4.2-GCC-11.3.0

Argtable is an ANSI C library for parsing GNU style command line options with a minimum of fuss.

Available Modules

module load argtable/2.13-gimkl-2020a

aria2 is a lightweight multi-protocol & multi-source command-line download utility.

Available Modules

module load aria2/1.36.0-GCCcore-11.3.0

ARPACK is a collection of Fortran77 subroutines designed to solve large scale eigenvalue problems.

Available Modules

AT-SPI 2 toolkit bridge

Available Modules

module load at-spi2-atk/2.38.0-GCCcore-9.2.0

Assistive Technology Service Provider Interface.

Available Modules

module load at-spi2-core/2.38.0-GCCcore-9.2.0

Commands for Manipulating Filesystem Extended Attributes

Available Modules

module load attr/2.5.1-GCCcore-9.2.0

Java Development Kit (JDK), and a compliant implementation of the Java Standard Edition (SE) specification.

Available Modules

module load azul-zulu/17.36.19-jdk-17.0.4.1

Repository that contains several programs that perform operations on SAM/BAM files.

Available Modules

module load bamUtil/1.0.15

Barrnap predicts the location of ribosomal RNA genes in genomes.

Available Modules

module load barrnap/0.9-GCC-9.2.0

bcl2fastq Conversion Software both demultiplexes data and converts BCL files generated by Illumina sequencing systems to standard FASTQ file formats for downstream analysis.

Available Modules

module load bcl2fastq2/2.20.0-gimkl-2018b

Available Modules

Bam Error Stats Tool (best): analysis of error types in aligned reads

Available Modules

module load best/0.1.0

An extension to awk, adding the support of several common biological data formats

Available Modules

module load bioawk/1.0

breseq is a computational pipeline for the analysis of short-read re-sequencing data

Available Modules

bsddb3 is a nearly complete Python binding of the Oracle/Sleepycat C API for the Database Environment, Database, Cursor, Log Cursor, Sequence and Transaction objects.

Available Modules

module load bsddb3/6.2.6-gimkl-2018b-Python-2.7.16

bzip2 is a freely available, patent free, high-quality data compressor. It typically compresses files to within 10% to 15% of the best available techniques (the PPM family of statistical compressors), whilst being around twice as fast at compression and six times faster at decompression.

Available Modules

c-ares is a C library for asynchronous DNS requests (including name resolves)

Available Modules

module load c-ares/1.18.1-GCCcore-11.3.0

libcurl is a free and easy-to-use client-side URL transfer library, supporting DICT, FILE, FTP, FTPS, Gopher, HTTP, HTTPS, IMAP, IMAPS, LDAP, LDAPS, POP3, POP3S, RTMP, RTSP, SCP, SFTP, SMTP, SMTPS, Telnet and TFTP. libcurl supports SSL certificates, HTTP POST, HTTP PUT, FTP uploading, HTTP form based upload, proxies, cookies, user+password authentication (Basic, Digest, NTLM, Negotiate, Kerberos), file transfer resume, http proxy tunneling and more.

Available Modules

Cairo is a 2D graphics library with support for multiple output devices. Currently supported output targets include the X Window System (via both Xlib and XCB), Quartz, Win32, image buffers, PostScript, PDF, and SVG file output. Experimental backends include OpenGL, BeOS, OS/2, and DirectFB

Available Modules

Fasta file indexing and retrival tool

Available Modules

module load cdbfasta/0.99-GCC-11.3.0

Available Modules

module load cdt/23.02-19
Links

A complete suite for gene-by-gene schema creation and strain identification..

Available Modules

module load chewBBACA/3.1.2-gimkl-2022a-Python-3.11.3

Rust implementation of NanoFilt+NanoLyse

Available Modules

module load chopper/0.5.0-GCC-11.3.0

faster and more accurate reimplementation of BUSCO.

Available Modules

module load compleasm/0.2.5-gimkl-2022a

Workflow Management System geared towards scientific workflows.

Available Modules

module load cromwell/84-Java-1.8.0_144

Ctags generates an index (or tag) file of language objects found in source files that allows these items to be quickly and easily located by a text editor or other utility.

Available Modules

module load ctags/5.8

ctffind is a program for finding CTFs of electron micrographs

Available Modules

module load ctffind/4.1.10

cutadapt removes adapter sequences from high-throughput sequencing data. This is usually necessary when the read length of the sequencing machine is longer than the molecule that is sequenced, for example when sequencing microRNAs.

Fast and scalable long-read-based SV detection

Available Modules

module load cuteSV/2.0.2-gimkl-2020a-Python-3.8.2

Common Workflow Language tool description reference implementation

Available Modules

module load cwltool/3.0.20200317203547-gimkl-2020a-Python-3.8.2

cython + htslib == fast VCF and BCF processing

Available Modules

module load cyvcf2/0.30.11-gimkl-2020a-Python-3.8.2

Diffusion Approximation for Demographic Inference

Available Modules

module load dadi/2.3.0-gimkl-2022a-Python-3.11.3

de novo transcriptome annotator..

Available Modules

module load dammit/1.2

Tool to gather data from across NCBI databases

Available Modules

module load datasets/15.29.0

deepTools is a suite of python tools particularly developed for the efficient analysis of high-throughput sequencing data, such as ChIP-seq, RNA-seq or MNase-seq.

Available Modules

module load deepTools/3.2.1-gimkl-2018b-Python-3.7.3

R functions that simplify and expedite common tasks in package development.

Available Modules

module load devtools/2.4.4-gimkl-2022a-R-4.2.1

Efficient binary-decimal and decimal-binary conversion routines for IEEE doubles.

Available Modules

module load double-conversion/3.2.1-GCC-11.3.0

Rapid and accurate comparison and de-replication of microbial genomes

Available Modules

module load drep/3.4.2-gimkl-2022a-Python-3.10.5

DTCMP Library provides pre-defined and user-defined comparison operations to compare the values of two items which can be arbitrary MPI datatypes.

Available Modules

module load dtcmp/1.1.3-gimpi-2020a

uphold your DUP and DEL calls

Available Modules

module load duphold/0.2.3

Range of tools to support operations on Duplex Sequencing read pairs.

Available Modules

module load duplex-tools/0.2.20-gimkl-2022a-Python-3.10.5

A suite of tools to conduct metabarcoding analyses targeting any group of organisms. Includes utilities for preprocessing raw data and building your own custom reference database.

Available Modules

module load eDNA/2021.11-gimkl-2020a-Python-3.8.2

easi is a library for the Easy Initialization of models in three (or less or more) dimensional domains.

Available Modules

module load easi/1.3.0-foss-2023a

ecCodes is a package developed by ECMWF which provides an application programming interface and a set of tools for decoding and encoding messages in the following formats: WMO FM-92 GRIB edition 1 and edition 2, WMO FM-94 BUFR edition 3 and edition 4, WMO GTS abbreviated header (only decoding).

Available Modules

ecCodes is a package developed by ECMWF which provides an application programming interface and a set of tools for decoding and encoding messages in the following formats: - WMO FM-92 GRIB edition 1 and edition 2 - WMO FM-94 BUFR edition 3 and edition 4 - WMO GTS abbreviated header (only decoding) A useful set of command line tools provide quick access to the messages.

Available Modules

Standalone versatile serotyping module for Escherichia coli..

Available Modules

module load ectyper/1.0.0-gimkl-2022a-Python-3.10.5

Lightweight, super fast library for sequence alignment using edit (Levenshtein) distance.

Available Modules

module load edlib/1.3.8.post2-gimkl-2020a-Python-3.8.2

Tool for fast functional annotation of novel sequences (genes or proteins) using precomputed eggNOG-based orthology assignments

Available Modules

module load eggnog-mapper/2.1.12-gimkl-2022a

Tool for emm-typing of Streptococcus pyogenes using a de novo or complete assembly

Available Modules

module load emmtyper/0.2.0-gimkl-2022a-Python-3.11.3

C++ header-only library for numerical optimization

Available Modules

module load ensmallen/2.17.0-gimkl-2020a

an advanced method for accessing the NCBI's set of interconnected databases such as publication, sequence, structure, gene, variation, expression, etc.

Available Modules

module load entrez-direct/13.3

Generic tool for pairwise sequence comparison

Available Modules

module load exonerate/2.4.0-GCC-11.3.0

Expat is an XML parser library written in C. It is a stream-oriented parser in which an application registers handlers for things the parser might find in the XML document (like start tags)

Available Modules

module load expat/2.4.1-CrayIntel-23.02-19

module load expat/2.4.1-GCCcore-9.2.0

fastStructure is an algorithm for inferring population structure from large SNP genotype data. It is based on a variational Bayesian framework for posterior inference and is written in Python2.x.

Available Modules

module load fastStructure/1.0-gimkl-2020a-Python-2.7.18

A tool designed to provide fast all-in-one preprocessing for FastQ files.

Available Modules

module load fastp/0.23.4-GCC-11.3.0

A collection of small and efficient programs for performing some common and uncommon tasks with FASTQ files.

Available Modules

module load fastq-tools/0.8-gimkl-2017a

Format converting tool for genotype Data.

Available Modules

module load fcGENE/1.0.7

A set of tools to analyze genomic data with a focus on Next Generation Sequencing.

Available Modules

module load fgbio/2.0.2

A package for population structure inference from RAD-seq data

Available Modules

module load fineRADstructure/0.3.2r109-gimkl-2020a

Population assignment using large numbers of densely sampled genomes, including both SNP chips and sequence dat

Available Modules

module load fineSTRUCTURE/4.1.1-gimkl-2022a-Perl-5.34.1

FlatBuffers: Memory Efficient Serialization Library

Available Modules

module load flatbuffers/2.0.0-GCCcore-9.2.0

Flex (Fast Lexical Analyzer) is a tool for generating scanners. A scanner, sometimes called a tokenizer, is a program which recognizes lexical patterns in text.

Available Modules

module load flex/2.6.4-GCCcore-7.4.0

module load flex/2.6.4-GCC-7.1.0

Tool for performing hybrid correction of long read sequencing using the BWT and FM-index of short-read sequencing data

Available Modules

module load fmlrc/1.0.0-GCC-9.2.0

Formatting library providing a fast and safe alternative to C stdio and C++ iostreams.

Available Modules

module load fmt/8.0.1
Links

Fontconfig is a library designed to provide system-wide font configuration, customization and application access.

Available Modules

Arm Forge combines Arm DDT, the leading debugger for time-saving high performance application debugging, and Arm MAP, the trusted performance profiler for invaluable optimization advice.

Available Modules

module load forge/22.1.2

module load forge/22.1.2

module load forge/20.0.2

GNU Compiler Collection (GCC) based compiler toolchain, including OpenMPI for MPI support, OpenBLAS (BLAS and LAPACK support), FFTW and ScaLAPACK.

Available Modules

module load foss/2023a

FreeType 2 is a software font engine that is designed to be small, efficient, highly customizable, and portable while capable of producing high-quality output (glyph images). It can be used in graphics libraries, display servers, font conversion tools, text image generation tools, and many other products as well.

Available Modules

funcX is a distributed Function as a Service (FaaS) platform that enables flexible, scalable, and high performance remote function execution. Unlike centralized FaaS platforms, funcX allows users to execute functions on heterogeneous remote computers, from laptops to campus clusters, clouds, and supercomputers. A funcX endpoint is a persistent service launched by the user on a compute system to serve as a conduit for executing functions on that computer.

Extract sequences from a fastx (fasta or fastq) file given a subsequence.

Available Modules

module load fxtract/2.4-GCC-11.3.0

Library contains GRIB2 encoder/decoder ('C' version).

Available Modules

module load g2clib/1.6.0-GCCcore-7.4.0

Library contains GRIB2 encoder/decoder and search/indexing routines.

Available Modules

module load g2lib/3.2.0-GCC-12.3.0

A reference implementation of the GA4GH API

Available Modules

module load ga4gh/0.3.6-gimkl-2020a-Python-2.7.18

A GTK+ based text editor with with basic features of an integrated development environment.

Available Modules

module load geany/1.35

module load geany/1.35

GenomeTools: A Comprehensive Software Library for Efficient Processing of Structured Genome Annotations.

Available Modules

module load genometools/1.6.1-GCCcore-7.4.0

GNU 'gettext' is an important step for the GNU Translation Project, as it is an asset on which we may build many other steps. This package offers to programmers, translators, and even users, a well integrated set of tools and documentation

Available Modules

single fast and exhaustive tool for summary statistics and simultaneous *fa* (fasta, fastq, gfa [.gz]) genome assembly file manipulation.

Available Modules

module load gfastats/1.3.6-GCC-11.3.0

GFF/GTF parsing utility providing format conversions, region filtering, FASTA sequence extraction and more.

Available Modules

module load gffread/0.12.7-GCC-11.3.0

giflib is a library for reading and writing gif images. It is API and ABI compatible with libungif which was in wide use while the LZW compression algorithm was patented.

Available Modules

module load giflib/5.2.1-GCCcore-9.2.0

GNU Compiler Collection (GCC) based compiler toolchain with Intel MPI and MKL

Available Modules

module load gimkl/2022a

module load gimkl/2020a

GNU Compiler Collection (GCC) based compiler toolchain with Intel MPI.

Available Modules

module load gimpi/2022a

module load gimpi/2020a

Git is a free and open source distributed version control system designed to handle everything from small to very large projects with speed and efficiency.

Available Modules

module load git/2.23.3
Links

Client for the Globus Flows service.

Available Modules

module load globus-automate-client/0.16.1.post1-gimkl-2022a

Globus Compute is a distributed Function as a Service (FaaS) platform that enables flexible, scalable, and high performance remote function execution. Unlike centralized FaaS platforms, Globus Compute allows users to execute functions on heterogeneous remote computers, from laptops to campus clusters, clouds, and supercomputers. A Globus Compute endpoint is a persistent service launched by the user on a compute system to serve as a conduit for executing functions on that computer.

Available Modules

module load globus-compute-endpoint/2.19.0-gimkl-2022a-Python-3.10.5

Gmsh is a 3D finite element grid generator with a build-in CAD engine and post-processor..

Available Modules

module load gmsh/2.0.2-Miniconda3

Portable interactive, function plotting utility

Available Modules

module load gnuplot/5.4.2-GCC-7.4.0

module load gnuplot/5.4.2-GCC-7.1.0

GNU Compiler Collection (GCC) based compiler toolchain, including OpenMPI for MPI support.

Available Modules

module load gompi/2023a

An extremely memory-efficient hash_map implementation. 2 bits/entry overhead! The SparseHash library contains several hash-map implementations, including implementations that optimize for space or speed.

Available Modules

module load google-sparsehash/2.0.4

Google's C++ test framework

Available Modules

module load googletest/1.10.0-GCCcore-9.2.0

Pperfect hash function generator.

Available Modules

module load gperf/3.1

The ECMWF GRIB API is an application program interface accessible from C, FORTRAN and Python programs developed for encoding and decoding WMO FM-92 GRIB edition 1 and edition 2 messages. A useful set of command line tools is also provided to give quick access to GRIB messages. Note that JPEG and Python support have been disabled for this build.

Available Modules

module load grib_api/1.23.1-CrayIntel-23.02-19

module load grib_api/1.23.1-iccifort-2018.1.163-GCC-7.1.0

Command line tool for Google Drive.

Available Modules

module load grive2/0.5.1-GCCcore-9.2.0

Tool to sort genomic files according to a genomefile.

Available Modules

module load gsort/0.1.4

A simple C++17 wrapper for HDF5.

Available Modules

module load h5pp/1.9.0-gimpi-2020a

Detects in-sample contamination in mtDNA or WGS sequencing studies by analyzing the mitchondrial content

Available Modules

module load haplocheck/1.3.3

help2man produces simple manual pages from the '--help' and '--version' output of other commands.

Available Modules

module load help2man/1.47.8-GCCcore-7.4.0

Hifiasm: a haplotype-resolved assembler for accurate Hifi reads.

Available Modules

module load hifiasm/0.19.7-GCC-11.3.0

Spell checker and morphological analyzer library and program designed for languages with rich morphology and complex word compounding or character encoding.

Available Modules

module load hunspell/1.7.0

The Portable Hardware Locality (hwloc) software package provides a portable abstraction (across OS, versions, architectures, ...) of the hierarchical topology of modern architectures, including NUMA memory nodes, sockets, shared caches, cores and simultaneous multithreading. It also gathers various system attributes such as cache and memory information as well as the locality of I/O devices such as network interfaces, InfiniBand HCAs or GPUs. It primarily aims at helping applications with gathering information about modern computing hardware so as to exploit it accordingly and efficiently.

Available Modules

module load hwloc/2.10.0-GCC-12.3.0

Hypothesis is an advanced testing library for Python. It lets you write tests which are parametrized by a source of examples, and then generates simple and comprehensible examples that make your tests fail. This lets you find more bugs in your code with less work.

Available Modules

module load hypothesis/6.68.2-gimkl-2022a-Python-3.10.5

Intel C and C++ compilers

Available Modules

module load icc/2018.1.163-GCC-7.4.0

module load icc/2018.1.163-GCC-7.1.0
Links

Intel C, C++ & Fortran compilers

Available Modules

module load iccifort/2020.0.166

module load iccifort/2020.0.166

Intel Fortran compiler

Available Modules

module load ifort/2018.1.163-GCC-7.4.0

module load ifort/2018.1.163-GCC-7.1.0

Intel C/C++ and Fortran compilers, alongside Intel MPI.

Available Modules

module load iimpi/2022a

module load iimpi/2020a

Intel Math Kernel Library is a library of highly optimized, extensively threaded math routines for science, engineering, and financial applications that require maximum performance. Core math functions include BLAS, LAPACK, ScaLAPACK, Sparse Solvers, Fast Fourier Transforms, Vector Math, and more.

FFTW interfaces using Intel oneAPI Math Kernel Library

Available Modules

module load imkl-FFTW/2022.0.2-iimpi-2022a

A lightweight JIT compiler for flexible data access in simulation applications

Available Modules

module load impalajit/20211029-foss-2023a

Compiler toolchain including Intel compilers, Intel MPI and Intel Math Kernel Library (MKL).

Available Modules

module load intel/2022a

module load intel/2020a

Intel C, C++ & Fortran compilers (classic and oneAPI)

Available Modules

module load intel-compilers/2022.0.2

ipyrad is an interactive toolkit for assembly and analysis of restriction-site associated genomic data sets (e.g., RAD, ddRAD, GBS) for population genetic and phylogenetic studies.

Available Modules

module load ipyrad/0.9.85-gimkl-2022a-Python-3.10.5

Intel SPMD Program Compilers; An open-source compiler for high-performance SIMD programming on the CPU. ispc is a compiler for a variant of the C programming language, with extensions for 'single program, multiple data' (SPMD) programming. Under the SPMD model, the programmer writes a program that generally appears to be a regular serial program, though the execution model is actually that a number of program instances execute in parallel on the hardware.

Available Modules

module load ispc/1.15.0

module load ispc/1.9.2-GCC-7.1.0

JBIG-KIT is a software implementation of the JBIG1 data compression standard

Available Modules

module load jbigkit/2.1-GCCcore-9.2.0

Collection of Python libraries to parse bioinformatics files, or perform computation related to assembly, annotation, and comparative genomics.

Available Modules

module load jcvi/1.0.6-gimkl-2020a-Python-3.8.2

A general purpose malloc(3) implementation that emphasizes fragmentation avoidance and scalable concurrency support.

Available Modules

Lightweight and flexible command-line JSON processor.

Available Modules

module load jq/1.5
Links

JSON-C implements a reference counting object model that allows you to easily construct JSON objects in C, output them as JSON formatted strings and parse JSON formatted strings back into the C representation of JSON objects.

Available Modules

module load json-c/0.17-GCC-12.3.0

Java utilities for Bioinformatics

Available Modules

module load jvarkit/2021.10.13-Java-17

Kalign is a fast multiple sequence alignment program for biological sequences.

Available Modules

module load kalign2/3.3.1-GCCcore-9.2.0

kallisto is a program for quantifying abundances of transcripts from RNA-Seq data, or more generally of target sequences using high-throughput sequencing reads.

Available Modules

module load kallisto/0.46.2-gimkl-2020a

A CPU+GPU Profiling library that provides access to timeline traces and hardware performance counters

Available Modules

module load kineto/0.4.0-GCC-11.3.0

KMA is a mapping method designed to map raw reads directly against redundant databases, in an ultra-fast manner using seed and extend.

Available Modules

module load kma/1.4.5-GCC-11.3.0
Links

libFLAME is a portable library for dense matrix computations, providing much of the functionality present in LAPACK.

Available Modules

module load libFLAME/5.2.0-GCC-11.3.0

The OpenGL Utility Library (GLU) is a computer graphics library for OpenGL.

Available Modules

module load libGLU/9.0.1-GCCcore-9.2.0

Reference implementation of OGC KML 2.2

Available Modules

module load libKML/1.3.0.2017-GCC-12.3.0

module load libKML/1.3.0.2017-GCC-9.2.0

Set of classes for creating statistical genetic programs.

Available Modules

module load libStatGen/1.0.15-GCC-11.3.0

Multi-format archive and compression library

Available Modules

module load libarchive/3.7.2-GCC-12.3.0

API for distributing embarrassingly parallel workloads using self-stabilization.

Available Modules

module load libcircle/0.3-iimpi-2020a

Heavily optimized library for DEFLATE/zlib/gzip compression and decompression.

Available Modules

module load libdeflate/1.10

Direct Rendering Manager runtime library.

Available Modules

module load libdrm/2.4.110

The DWARF Debugging Information Format is of interest to programmers working on compilers and debuggers

Available Modules

module load libdwarf/20201201-GCCcore-9.2.0

Library for handling OpenGL function pointer management

Available Modules

module load libepoxy/1.5.4-GCCcore-9.2.0

The libevent API provides a mechanism to execute a callback function when a specific event occurs on a file descriptor or after a timeout has been reached. Furthermore, libevent also support callbacks due to signals or regular timeouts.

Available Modules

module load libevent/2.1.8

module load libevent/2.1.8

The libffi library provides a portable, high level programming interface to various calling conventions. This allows a programmer to call any function specified by a call interface description at run-time.

Available Modules

Libgpg-error is a small library that defines common error values for all GnuPG components.

Available Modules

module load libgcrypt/1.9.2-GCCcore-9.2.0

GD is an open source code library for the dynamic creation of images by programmers.

Available Modules

module load libgd/2.2.5-GCC-7.1.0

Library for reading and writing coordinate system information from/to GeoTIFF files

Available Modules

libgit2 is a portable, pure C implementation of the Git core methods provided as a re-entrant linkable library with a solid API, allowing you to write native speed custom Git applications in any language which supports C bindings.

Available Modules

module load libgit2/1.6.4-GCC-12.3.0

libglvnd is a vendor-neutral dispatch layer for arbitrating OpenGL API calls between multiple vendors.

Available Modules

module load libglvnd/1.2.0

Libgpg-error is a small library that defines common error values for all GnuPG components.

Available Modules

module load libgpg-error/1.41-GCCcore-9.2.0

Arrays on GPU device memory, for Theano

Available Modules

module load libgpuarray/0.7.6-GCCcore-9.2.0-CUDA-8.0.61

ligtextutils is a dependency of fastx-toolkit and is provided via the same upstream

Available Modules

module load libgtextutils/0.7-gimkl-2017a

Libiconv converts from one character encoding to another through Unicode conversion

Available Modules

module load libiconv/1.17-GCC-12.3.0

module load libiconv/1.15-GCC-7.1.0

libjpeg-turbo is a fork of the original IJG libjpeg which uses SIMD to accelerate baseline JPEG compression and decompression. libjpeg is a library that implements JPEG image encoding, decoding and transcoding.

Available Modules

module load libjpeg-turbo/2.0.2-CrayGNU-23.02

module load libjpeg-turbo/2.1.3-GCCcore-9.2.0

GNU libmatheval is a library (callable from C and Fortran) to parse and evaluate symbolic expressions input as text.

Available Modules

module load libmatheval/1.1.11-gimkl-2017a

Generic PCI access library.

Available Modules

module load libpciaccess/0.16

Available Modules

module load libpmi/2
Links

libpng is the official PNG reference library

Available Modules

module load libpng/1.6.37-CrayGNU-23.02

module load libpng/1.6.37-GCCcore-9.2.0

The GNU Readline library provides a set of functions for use by applications that allow users to edit command lines as they are typed in. Both Emacs and vi editing modes are available. The Readline library includes additional functions to maintain a list of previously-entered command lines, to recall and perhaps reedit those lines, and perform csh-like history expansion on previous commands.

Available Modules

module load libreadline/8.1-CrayIntel-23.02-19

module load libreadline/8.0-GCCcore-7.1.0

SpatiaLite is an open source library intended to extend the SQLite core to support fully fledged Spatial SQL capabilities.

Available Modules

GNU libtool is a generic library support script. Libtool hides the complexity of using shared libraries behind a consistent, portable interface.

Available Modules

module load libtool/2.4.6

This library provides functions for manipulating Unicode strings and for manipulating C strings according to the Unicode standard.

Available Modules

module load libunistring/0.9.6-GCC-5.4.0

Define a portable and efficient C programming API to determine the call-chain of a program.

Available Modules

module load libunwind/1.5.0-GCCcore-9.2.0

libvdwxc is a general library for evaluating energy and potential for exchange-correlation (XC) functionals from the vdW-DF family that can be used with various of density functional theory (DFT) codes.

Available Modules

module load libvdwxc/0.4.0-foss-2023a

The WebM Project is dedicated to developing a high-quality, open video format for the web that's freely available to everyone.

Available Modules

module load libvpx/1.7.0-GCC-7.1.0

Libxc is a library of exchange-correlation functionals for density-functional theory. The aim is to provide a portable, well tested and reliable set of exchange and correlation functionals.

Available Modules

module load libxc/4.2.3-CrayGNU-23.02

Libxml2 is the XML C parser and toolchain developed for the Gnome project (but usable outside of the Gnome platform).

Available Modules

Libxslt is the XSLT C library developed for the GNOME project (but usable outside of the Gnome platform).

LIBXSMM is a library for small dense and small sparse matrix-matrix multiplications targeting Intel Architecture (x86).

Available Modules

Fast lossless compression algorithm.

Available Modules

module load libzstd/1.5.2-GCC-11.3.0

A web server.

Available Modules

module load lighttpd/1.4.59

Command line tools for Linux to support programmers in developing high performance multi threaded programs.

Available Modules

module load likwid/5.2.1-GCC-11.3.0

Mixed Integer Linear Programming (MILP) solver

Available Modules

The light-weight group library defines data structures and collective operations to group MPI processes as an ordered set.

Available Modules

module load lwgrp/1.0.4-iimpi-2020a

LZ4 is lossless compression algorithm, providing compression speed at 400 MB/s per core. It features an extremely fast decoder, with speed in multiple GB/s per core.

Available Modules

module load lz4/1.9.4-GCCcore-12.3.0

module load lz4/1.9.3-GCCcore-9.2.0
Links

Collection of utilities to manipulate multiple alignments in the Multiple Alignment Format

Available Modules

module load maf_stream/202005-GCC-12.3.0

The MAGMA project aims to develop a dense linear algebra library similar to LAPACK but for heterogeneous/hybrid architectures, starting with current Multicore+GPU systems.

Available Modules

module load magma/2.7.1-gimkl-2022a-CUDA-11.6.2

The makedepend package contains a C-preprocessor like utility to determine build-time dependencies.

Available Modules

module load makedepend/1.0.5-GCC-7.1.0

Manta calls structural variants (SVs) and indels from mapped paired-end sequencing reads.

Available Modules

module load manta/1.6.0-gimkl-2020a-Python-2.7.18

tracks and quantifies DNA damage patterns among ancient DNA sequencing reads generated by Next-Generation Sequencing platforms.

Available Modules

module load mapDamage/2.2.1

High performance toolkit for complete analysis of methylated RNA data.

Available Modules

module load meRanTK/1.1.1b

Medaka is a tool to create a consensus sequence from nanopore sequencing data.

Tool to extract high accuracy modified base and sequence variant calls from raw nanopore reads by anchoring the information rich basecalling neural network output to a reference genome/transcriptome.

Available Modules

module load megalodon/2.5.0-gimkl-2022a-Python-3.10.5

Flexible pipeline for genome-resolved metagenomic data analysis.

Available Modules

module load metaWRAP/1.2.1-Miniconda3

Completely overhauled tool which discovers microRNA genes by analyzing sequenced RNAs

Available Modules

module load miRDeep2/0.1.3-gimkl-2020a

The Met Office library is a collection of routines written in IDL at the Met Office, originally written in PV-WAVE in the Hadley Centre in the early 1990s, with development continuing to the present day. Its main purpose is to support analysis and visualisation of PP data produced by the Unified Model (UM).

Available Modules

module load midl/20180128-IDL-8.7

mimalloc is a general purpose allocator with excellent performance characteristics.

Available Modules

module load mimalloc/2.0.6-GCC-11.3.0

faster and more accurate reimplementation of BUSCO.

Available Modules

module load miniBUSCO/0.2.1-gimkl-2022a

Fast OLC-based de novo assembler for noisy long reads.

Available Modules

module load miniasm/0.3-20191007-GCC-11.3.0

Minimap2 is a fast sequence mapping and alignment program that can find overlaps between long noisy reads, or map long reads or their assemblies to a reference genome optionally with detailed alignment (i.e. CIGAR). At present, it works efficiently with query sequences from a few kilobases to ~100 megabases in length at an error rate ~15%. .

Aligns a protein sequence against a genome with affine gap penalty, splicing and frameshift..

Available Modules

module load miniprot/0.11-GCC-11.3.0

The mjpeg programs are a set of tools that can do recording of videos and playback, simple cut-and-paste editing and the MPEG compression of audio and video under Linux.

Available Modules

module load mjpegtools/2.1.0-GCC-7.1.0

Fast, and flexible C++ machine learning library with bindings to other languages

Available Modules

module load mlpack/3.4.2-gimkl-2020a

The Met Office library is a collection of routines written in IDL at the Met Office, originally written in PV-WAVE in the Hadley Centre in the early 1990s, with development continuing to the present day. Its main purpose is to support analysis and visualisation of PP data produced by the Unified Model (UM).

Available Modules

module load mo_tidl/20130812-IDL-8.7-GCC-7.1.0

A program to aggregate modified base counts stored in a modified-base BAM file to a bedMethyl file.

Available Modules

module load modbam2bed/0.9.4-GCC-11.3.0

Tool for working with modified bases from Oxford Nanopore

Available Modules

module load modkit/0.2.5-GCC-12.3.0

Fast BAM/CRAM depth calculation for WGS, exome, or targeted sequencing

Available Modules

module load mosdepth/0.3.4-GCC-11.3.0

This package provides Python bindings for the Message Passing Interface (MPI) standard. It is implemented on top of the MPI-1/2/3 specification and exposes an API which grounds on the standard MPI-2 C++ bindings.

Available Modules

module load mpi4py/3.0.0-gimpi-2018b-Anaconda3-5.2.0

MPI-Based File Utilities For Distributed Systems

Available Modules

module load mpifileutils/0.11-gimpi-2020a

muParser is an extensible high performance math expression parser library written in C++. It works by transforming a mathematical expression into bytecode and precalculating constant parts of the expression.

Available Modules

nano is a text editor.

Available Modules

module load nano/3.1

module load nano/3.1

Create fastQC-like plots for Oxford Nanopore sequencing data.

Available Modules

module load nanoQC/0.9.4-gimkl-2022a-Python-3.10.5

Filtering and trimming of long read sequencing data.

Available Modules

module load nanofilt/2.6.0-gimkl-2020a-Python-3.8.2

Functions to extract information from Oxford Nanopore sequencing data and alignments

Available Modules

A few simple math function for other Oxford Nanopore processing scripts

Available Modules

Software package for signal-level analysis of Oxford Nanopore sequencing data.

Available Modules

module load nanopolish/0.13.3-gimkl-2020a-Python-3.9.9

The SRA Toolkit and SDK from NCBI is a collection of tools and libraries for using data in the INSDC Sequence Read Archives.

Available Modules

module load ncbi-vdb/2.11.2-gimpi-2022a

nccmp compares two NetCDF files bitwise, semantically or with a user defined tolerance (absolute or relative percentage). Parallel comparisons are done in local memory without requiring temporary files. Highly recommended for regression testing scientific models or datasets in a test-driven development environment.

Available Modules

module load nccmp/1.8.5.0-GCC-7.1.0

The Ncurses (new curses) library is a free software emulation of curses in System V Release 4.0, and more. It uses Terminfo format, supports pads and color and multiple highlights and forms characters and function-key mapping, and has all the other SYSV-curses enhancements over BSD Curses.

Available Modules

module load ncurses/6.2-CrayIntel-23.02-19

module load ncurses/6.1

Visual browser for netCDF format files.

Available Modules

module load ncview/2.1.7-gimpi-2020a

ne is a free (GPL'd) text editor based on the POSIX standard that runs (we hope) on almost any UN*X machine. ne is easy to use for the beginner, but powerful and fully configurable for the wizard, and most sparing in its resource usage.

Available Modules

module load ne/3.0.1
Links

NeSI nearline client provides 'End User' access to NeSI's archive filesystem Nearline which is connected to a Tape library.

Available Modules

module load nearline/1.1.0.22

module load nearline/1.1.0.22

NetCDF (network Common Data Form) is a set of software libraries and machine-independent data formats that support the creation, access, and sharing of array-oriented scientific data.

NetCDF (network Common Data Form) is a set of software libraries and machine-independent data formats that support the creation, access, and sharing of array-oriented scientific data.

Available Modules

module load netCDF-C++/4.2-intel-2020a

NetCDF (network Common Data Form) is a set of software libraries and machine-independent data formats that support the creation, access, and sharing of array-oriented scientific data.

Available Modules

module load netCDF-C++4/4.3.1-iimpi-2020a

module load netCDF-C++4/4.3.0-iccifort-2018.1.163-GCC-7.1.0

NetCDF (network Common Data Form) is a set of software libraries and machine-independent data formats that support the creation, access, and sharing of array-oriented scientific data.

Node.js is a platform built on Chrome's JavaScript runtime for easily building fast, scalable network applications. Node.js uses an event-driven, non-blocking I/O model that makes it lightweight and efficient, perfect for data-intensive real-time applications that run across distributed devices.

Available Modules

Used to mask nucleic acid sequences

Available Modules

module load nseg/1998-GCC-9.2.0

nsync is a C library that exports various synchronization primitives, such as mutexes

Available Modules

module load nsync/1.24.0-GCCcore-9.2.0

Reads to report pipeline for bacterial isolate NGS data.

Available Modules

module load nullarbor/2.0.20191013

The numactl program allows you to run your application program on specific cpu's and memory nodes. It does this by supplying a NUMA memory policy to the operating system before running your program. The libnuma library provides convenient ways for you to add NUMA memory policies into your own program.

Available Modules

Data processing toolkit that contains the Oxford Nanopore Technologies' basecalling algorithms, and several bioinformatic post-processing features

Available Modules

module load ont-guppy-gpu/6.5.7

Prokaryotic Antiviral Defence LOCator

Available Modules

module load padloc/1.1.0-gimkl-2022a-R-4.2.1

CLI tools to process mapped Hi-C data

Available Modules

module load pairtools/1.0.2-gimkl-2022a-Python-3.10.5

A pangenome analysis pipeline.

Available Modules

module load panaroo/1.3.0-gimkl-2022a-Python-3.10.5

Almost universal document converter

Available Modules

module load pandoc/2.8.0.1

parallel fastq-dump wrapper

Available Modules

module load parallel-fastq-dump/0.6.6-gimkl-2020a-Python-3.8.2

parasail is a SIMD C (C99) library containing implementations of the Smith-Waterman (local), Needleman-Wunsch (global), and semi-global pairwise sequence alignment algorithms.

Available Modules

module load parasail/2.4.3

PatchELF is a small utility to modify the dynamic linker and RPATH of ELF executables.

Available Modules

module load patchelf/0.14.5-GCC-11.3.0

Tools for plotting Oxford Nanopore and other long-read data.

Available Modules

module load pauvre/0.2.3-gimkl-2022a-Python-3.10.5

pFUnit is a unit testing framework enabling JUnit-like testing of serial and MPI-parallel software written in Fortran.

Available Modules

module load pfunit/3.2.9-CrayIntel-23.02-19

PanGenome Graph Builder(pggb)

Available Modules

module load pggb/0.5.3-Miniconda3

pangenome graph evaluator

Available Modules

module load pgge/2021-Miniconda3

phyx performs phylogenetics analyses on trees and sequences.

Available Modules

module load phyx/1.3-gimkl-2022a

A set of tools (in Java) for working with next generation sequencing data in the BAM format.

Available Modules

module load picard/2.26.10-Java-11.0.4

parallel implementation of gzip,

Available Modules

module load pigz/2.7

Pixman is a low-level software library for pixel manipulation, providing features such as image compositing and trapezoid rasterization. Important users of pixman are the cairo graphics library and the X server.

Available Modules

module load pixman/0.42.2-GCC-12.3.0

File format for storing nanopore dna data in an easily accessible way.

Available Modules

module load pod5/0.3.2-gimkl-2022a

Places query sequences on a fixed reference phylogenetic tree to maximize phylogenetic likelihood or posterior probability according to a reference alignment

Available Modules

module load pplacer/1.1.alpha19

Software for predicting library complexity and genome coverage in high-throughput sequencing.

Available Modules

module load preseq/3.2.0-GCC-11.3.0

Prodigal (Prokaryotic Dynamic Programming Genefinding Algorithm) is a microbial (bacterial and archaeal) gene finding program developed at Oak Ridge National Laboratory and the University of Tennessee.

Available Modules

module load prodigal/2.6.3-GCCcore-7.4.0

A fork of Prodigal meant to improve gene calling for giant viruses and viruses that use alternative genetic codes.

Available Modules

module load prodigal-gv/2.9.0-GCC-11.3.0

Prokka is a software tool for the rapid annotation of prokaryotic genomes.

Available Modules

module load prokka/1.14.5-GCC-11.3.0

Google Protocol Buffers.

Infers population size history from a diploid sequence using the PSMC model.

Available Modules

module load psmc/0.6.5-gimkl-2018b

pstoedit translates PostScript and PDF graphics into other vector formats.

Available Modules

module load pstoedit/3.75-GCCcore-9.2.0

module load pstoedit/3.70-GCC-7.1.0

Utility program for extracting sequences from a fasta/fastq file

Available Modules

module load pullseq/1.0.2

purge haplotigs and overlaps in an assembly based on read depth

Available Modules

module load purge_dups/1.2.6-gimkl-2022a-Python-3.10.5

Pipeline to help with curating heterozygous diploid genome assemblies

Available Modules

module load purge_haplotigs/1.1.2-gimkl-2022a-Perl-5.34.1

Monitors the progress of data through a unix pipeline.

Available Modules

module load pv/1.6.6
Links

Whole-genome classification using Average Nucleotide Identity

Available Modules

module load pyani/0.2.12-gimkl-2022a-Python-3.10.5

Computes metrics and generates interactive QC plots for Oxford Nanopore technologies sequencing data.

Available Modules

module load pycoQC/2.5.2-gimkl-2020a-Python-3.8.2

PyMOL (open source version) molecular visualization system.

Available Modules

module load pymol-open-source/2.5.0

python implementation of SEER(Sequence element enrichment analysis)

Available Modules

module load pyseer/1.3.11-gimkl-2022a-Python-3.11.3

Python bindings to spoa.

Available Modules

module load pyspoa/0.0.8-gimkl-2018b-Python-3.8.1

Command-line tool for demultiplexing Oxford Nanopore reads from FASTQ files

Available Modules

module load qcat/1.1.0-gimkl-2020a-Python-3.8.2

rDock is a fast and versatile Open Source docking program that can be used to dock small molecules against proteins and nucleic acids. It is designed for High Throughput Virtual Screening (HTVS) campaigns and Binding Mode prediction studies. rDock is mainly written in C++ and accessory scripts and programs are written in C++, perl or python languages.

Available Modules

module load rDock/2013.1-gimkl-2018b

Minimum free energy of folding randomization test software

Available Modules

module load randfold/2.0.1-GCC-9.2.0

Randomly subsample sequencing reads to a specified coverage.

Available Modules

module load rasusa/0.6.1-GCCcore-9.2.0

Tool for mapping millions of short genomic reads onto a reference genome.

Available Modules

module load razers3/3.5.8

Rclone is a command line program to sync files and directories to and from a variety of online storage services

Available Modules

module load rclone/1.62.2

re2c is a free and open-source lexer generator for C and C++. Its main goal is generating fast lexers: at least as fast as their reasonably optimized hand-coded counterparts. Instead of using traditional table-driven approach, re2c encodes the generated finite state automata directly in the form of conditional jumps and comparisons.

Available Modules

module load re2c/1.3

A common set of C++ infrastructure and CMake utilities used by various components of Intel® oneAPI Rendering Toolkit.

Available Modules

module load rkcommon/1.6.0-GCCcore-9.2.0

Tool for evaluating RNA-Seq assemblies using reference genome and gene database

Available Modules

module load rnaQUAST/2.2.2-gimkl-2022a

rsync is an open source utility that provides fast incremental file transfer. rsync is freely available under the GNU General Public License

Available Modules

module load rsync/3.2.7

FM-index Long Read Corrector (Rust implementation)

Available Modules

module load rust-fmlrc/0.1.7-GCC-11.3.0

samblaster is a fast and flexible program for marking duplicates in read-id grouped paired-end SAM files. It can also optionally output discordant read pairs and/or split read mappings to separate SAM files, and/or unmapped/clipped reads to a separate FASTQ file. When marking duplicates, samblaster will require approximately 20MB of memory per 1M read pairs.

Available Modules

module load samblaster/0.1.26-GCC-11.3.0

Filter SAM file for soft and hard clipped alignments.

Available Modules

module load samclip/0.4.0-GCC-11.3.0

Interface to various variant calling formats.

Available Modules

module load savvy/2.1.0-GCC-11.3.0

sbt is a build tool for Scala, Java, and more.

Available Modules

module load sbt/1.7.2-Java-1.8.0_144
Links

Bioconductor bundle for single-cell RNA-Seq Data analysis

Available Modules

module load sc-RNA/2023-07-gimkl-2022a-R-bundle-Bioconductor-3.15-R-4.2.1

Pipeline that screens for presence of genes of interest (GOI) in bacterial assemblies.

Available Modules

module load screen_assembly/1.2.8-gimkl-2022a-Python-3.11.3

Seqmagick is a utility built in the spirit of imagemagick to expose the file format conversion in Biopython in a convenient way. Instead of having a big mess of scripts, there is one that takes arguments.

Available Modules

module load seqmagick/0.8.4-gimkl-2020a-Python-3.8.2

Seqtk is a fast and lightweight tool for processing sequences in the FASTA or FASTQ format. It seamlessly parses both FASTA and FASTQ files which can also be optionally compressed by gzip.

Available Modules

module load seqtk/1.4-GCC-11.3.0

A std::streambuf wrapper for compression formats.

Available Modules

module load shrinkwrap/1.2.0-GCC-11.3.0

Shumlib is the collective name for a set of libraries which are used by the UM; the UK Met Office's Unified Model, that may be of use to external tools or applications where identical functionality is desired. The hope of the project is to enable developers to quickly and easily access parts of the UM code that are commonly duplicated elsewhere, at the same time benefiting from any improvements or optimisations that might be made in support of the UM itself.

A general-purpose genome simulator

Available Modules

module load simuG/1.0.1-GCC-11.3.0

Simulation of In Silico Multi-Omic Networks R package.

Available Modules

module load sismonr/2.0.0-gimkl-2020a-R-4.1.0

Toolkit for converting (FAST5 <-> SLOW5), compressing, viewing, indexing and manipulating data in SLOW5 format.

Available Modules

module load slow5tools/0.8.0-gimpi-2022a

Smafa attempts to align or cluster pre-aligned biological sequences, handling sequences which are all the same length.

Available Modules

module load smafa/0.5.0-GCC-11.3.0

simplifies and speeds calling and genotyping SVs for short reads.

Available Modules

module load smoove/0.2.8-Miniconda3

SNAPHU is an implementation of the Statistical-cost, Network-flow Algorithm for Phase Unwrapping proposed by Chen and Zebker

Available Modules

module load snaphu/2.0.4-GCCcore-9.2.0

Snappy is a compression/decompression library. It does not aim for maximum compression, or compatibility with any other compression library; instead, it aims for very high speeds and reasonable compression.

Available Modules

module load snappy/1.1.9-GCCcore-9.2.0

Finds SNP sites from a multi-FASTA alignment file.

Available Modules

module load snp-sites/2.5.1-GCC-11.3.0

SnpEff is a variant annotation and effect prediction tool. It annotates and predicts the effects of genetic variants (such as amino acid changes).

Available Modules

module load snpEff/5.0-Java-11.0.4

extract informative sites, evaluate relatedness, and perform quality-control on BAM/CRAM/BCF/VCF/GVCF

Available Modules

module load somalier/0.2.16

Stand-alone program that maps and aligns a set of cDNA or protein sequences onto a whole genomic sequence in a single job.

Available Modules

module load spaln/2.4.13f-GCC-11.3.0

Fast C++ logging library.

Available Modules

module load spdlog/1.10.0-GCCcore-11.3.0

Splat deploys (builds and installs) the components of heterogenous systems such as cylc suites, described by splat config files, into target directory trees.

Available Modules

module load splat/1.0.5

module load splat/1.0.5

c++ implementation of the partial order alignment (POA) algorithm which is used to generate consensus sequences

Available Modules

module load spoa/4.0.7-GCC-9.2.0

The SRA Toolkit, and the source-code SRA System Development Kit (SDK), will allow you to programmatically access data housed within SRA and convert it from the SRA format

Available Modules

module load sratoolkit/3.0.2

Provides an mpirun script which merely wraps the srun command.

Available Modules

module load srun-wrapper/1.0

module load srun-wrapper/1.0
Links

Text editor with GUI

Available Modules

module load sublime/3

module load sublime/3

A robust and fast clustering method for amplicon-based studies. The purpose of swarm is to provide a novel clustering algorithm that handles massive sets of amplicons. Results of traditional clustering algorithms are strongly input-order dependent, and rely on an arbitrary global clustering threshold. swarm results are resilient to input-order changes and rely on a small local linking threshold d, representing the maximum number of differences between two amplicons.

Available Modules

module load swarm/3.1.4-GCC-11.3.0

Perl module for reading and writing UniProtKB data in plain text format.

Available Modules

module load swissknife/1.80-GCC-9.2.0-Perl-5.30.1

Transfer RNA detection

Available Modules

module load tRNAscan-SE/2.0.7-GCC-9.2.0

Generic indexer for TAB-delimited genome position files

Available Modules

module load tabix/0.2.6-GCCcore-9.2.0

C++ wrapper to tabix indexer

Available Modules

module load tabixpp/1.1.2-GCC-11.3.0

Intel(R) Threading Building Blocks (Intel(R) TBB) lets you easily write parallel C++ programs that take full advantage of multicore performance, that are portable, composable and have future-proof scalability.

Available Modules
Links

Command-line program that automates the creation of sequence records for submission to GenBank

Available Modules

module load tbl2asn/20200519-linux64

tmux is a terminal multiplexer. It lets you switch easily between several programs in one terminal, detach them (they keep running in the background) and reattach them to a different terminal.

Available Modules

module load tmux/2.8

Locates tandem repeats in DNA sequences.

Available Modules

module load trf/4.09
Links

Tool for automated alignment trimming in large-scale phylogenetic analyses

Available Modules

module load trimAl/1.4.1-GCC-11.3.0

um2netcdf converts Unified Model output files in fieldsfile format into netCDF format.

Available Modules

module load um2netcdf/3.2.2-GCC-7.1.0

Fork of minimap2 optimized for assembly-to-reference alignment.

Available Modules

module load unimap/0.1-GCC-11.3.0

RAR is a powerful archive manager.

Available Modules

module load unrar/6.0.1-GCCcore-9.2.0

Set of Linux utilities

Available Modules

module load util-linux/2.38

Genetic variant detector designed to find polymorphisms smaller than the length of a short-read sequencing alignment.

Available Modules

module load vcflib/1.0.1-GCC-9.2.0

Hybrid genome assembly pipeline developed for telomere-to-telomere assembly of PacBio HiFi and Oxford Nanopore reads

Available Modules

module load verkko/2.0-Miniconda3

variation graph data structures, interchange formats, alignment, genotyping, and variant calling methods

Available Modules

module load vg/1.46.0
Links

Wgsim is a small tool for simulating sequence reads from a reference genome.

Available Modules

module load wgsim/20111017-GCC-11.3.0

A built-package format for Python.

Available Modules

module load wheel/0.31.1-gimkl-2020a-Python-2.7.18

de novo sequence assembler for long noisy reads produced by PacBio or Oxford Nanopore Technologies.

Available Modules

module load wtdbg/2.5-GCC-9.2.0

widget toolkit and tools library for creating graphical user interfaces for cross-platform applications.

Available Modules

module load wxWidgets/3.1.4-GCC-9.2.0

x264 is a free software library and application for encoding video streams into the H.264/MPEG-4 AVC compression format, and is released under the terms of the GNU GPL.

Available Modules

x265 is a free software library and application for encoding video streams into the H.265 AVC compression format, and is released under the terms of the GNU GPL.

Available Modules

module load x265/3.5-GCC-11.3.0

keyboard keymap compiler and support library

Available Modules

module load xkbcommon/0.10.0

xtb - An extended tight-binding semi-empirical program package.

Available Modules

module load xtb/6.6.1-foss-2023a
Links

Chimeric Read Detector for long reads

Available Modules

module load yacrd/1.0.0-GCC-11.3.0

Yet Another JSON Library. Why does the world need another C library for parsing JSON? Good question.

Available Modules

module load yajl/2.1.0-GCCcore-9.2.0

Yet another k-mer analyzer

Available Modules

module load yak/0.1
Links

YAML parser and emitter in C++

Available Modules

zlib is designed to be a free, general-purpose, legally unencumbered -- that is, not covered by any patents -- lossless data-compression library for use on virtually any computer hardware and operating system.

Zstandard is a real-time compression algorithm, providing high compression ratios. It offers a very wide range of compression/speed trade-off, while being backed by a very fast decoder. It also offers a special mode for small data, called dictionary compression, and can create dictionaries from any sample set.

Available Modules

module load zstd/1.5.5-GCCcore-12.3.0

module load zstd/1.5.2-GCCcore-9.2.0

If the application you are looking for is not here you can make software installation request or try compiling it yourself.

You can help contribute to this list by tagging software here.