GGUF
English
llama.cpp
unsloth
conversational

Devstral Small 2505 - Deepseek v3.2 Speciale Distill

This model was trained on a non-reasoning (reasoning traces were removed) dataset of DeepSeek v3.2 Speciale.

  • 🧬 Datasets:

    • TeichAI/deepseek-v3.2-speciale-OpenCodeReasoning-3k
    • TeichAI/deepseek-v3.2-speciale-1000x
    • TeichAI/deepseek-v3.2-speciale-openr1-math-3k
  • 🏗 Base Model:

    • unsloth/Devstral-Small-2505
  • ⚡ Use cases:

    • Coding
    • Math
    • Chat
    • Deep Research

Downloads last month
1,988
GGUF
Model size
24B params
Architecture
llama
Hardware compatibility
Log In to add your hardware

2-bit

3-bit

4-bit

5-bit

6-bit

8-bit

16-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for TeichAI/Devstral-Small-2505-Deepseek-V3.2-Speciale-Distill-GGUF

Datasets used to train TeichAI/Devstral-Small-2505-Deepseek-V3.2-Speciale-Distill-GGUF