Multi-Locus Sequence Typing#

In this practical we will perform the genome typing by MLST analysis on the assembly of Klebsiella pneumoniae, using the contigs/scaffolds produced by the hybrid assembly in a previous tutorial.

Prepare our computing environment#

We will first run the appropriate srun command to book the computing cores (cpus) on the cluster.

Tip

You need to ask the teacher which partition to use !

srun -p SELECTED_PARTITION --cpus-per-task 2 --pty bash -i

You are now on a computing node, with computing 2 cpus reserved for you. That way, you can run commands interactively.

If you want to exit the srun interactive mode, press CTRL+D or type exit

Run the MLST program#

We should now identify our fasta file containing the contigs from the Unicycler hybrid assembly, produced from long and short reads. This will be the input file for mlst.

cd results
ls -l

Now we will run the MLST tool:

module load system/singularity/3.6.0
singularity run /path/to/mlst_singularity_container ...

mlst K2_unicycler_scaffolds.fasta > K2_unicycler_scaffolds_mlst.tsv

Tip

Ask the teacher for the MLST container path.

Once finished, you can exit the container by typing exit or press CTRL+D

Question

Check the output file. What is the sequence type of this strain?

Question

If the (hybrid) assemblies of the other Klebsiella pneumoniae strains are provided, try to make a loop to run the MLST analysis on all.