Skip to content

Kraken2

Kraken2 is a system for assigning taxonomic labels to short DNA sequences, usually obtained through metagenomic studies.

Kraken2 is available as a module on Apocrita.

Usage

To run the default installed version of Kraken2, simply load the kraken2 module:

module load kraken2

For usage documentation, run kraken2 -h.

Example job

Do not use --memory-mapping

Kraken2 has a --memory-mapping flag which is not suitable for cluster use and puts enormous strain on the storage infrastructure, as it avoids loading your database into RAM and instead uses it directly from disk.

Please do not use it and instead let Kraken2 fully load your database into RAM (the default behaviour).

Please check your database size and ensure you request enough RAM (plus an additional allowance to allow for overhead) to load the entire database into RAM before Kraken2 runs.

Serial job

Here is a Kraken2 example job running on 1 core and 5GB of memory:

#!/bin/bash
#$ -cwd
#$ -j y
#$ -pe smp 1
#$ -l h_rt=1:0:0
#$ -l h_vmem=5G

module load kraken2

kraken2 --db example2_db \
        --threads ${NSLOTS} \
        input_file

References