Models
To avoid unnecessary storage use, we maintain readonly versions of popular models in /opt/nesi/models, if you can use this please do.
If you need a model that is not listed here, please Contact our Support Team with the model name, source, and a brief description of your use case.
Available models¶
| Model | Licence | Path | Slurm |
|---|---|---|---|
| Llama 3.1 | Meta Llama 3.1 | |
|
|
|
||
| DeepSeek-R1 | MIT | |
|
|
|
||
|
|
||
| Qwen3 | Apache 2.0 | |
|
|
|
||
| Qwen2.5 | Apache 2.0 | |
|
|
|
||
| Gemma 3 | Gemma | |
|
The Slurm column shows the minimum GPU flags required, your actual throughput will depend on the queue size. See Hardware for a full list of available GPUs.
L4 GPUs have no double-precision floating point
The L4 is an inference-optimised GPU. It is suitable for running quantised models but should not be used for model training or workflows that require FP64 precision.
Related¶
- [Ollama](../Software/Available_Applications/ollama.md).
- [Hardware](../Batch_Computing/Hardware.md).