Qwen3-4B-2507-Claude-4.6-Opus-Reasoning-Distilled-GGUF : GGUF
This model was finetuned and converted to GGUF format using Unsloth.
Example usage:
- For text only LLMs:
./llama.cpp/llama-cli -hf Jackrong/Qwen3-4B-2507-Claude-4.6-Opus-Reasoning-Distilled-GGUF --jinja - For multimodal models:
./llama.cpp/llama-mtmd-cli -hf Jackrong/Qwen3-4B-2507-Claude-4.6-Opus-Reasoning-Distilled-GGUF --jinja
Available Model files:
qwen3-4b-thinking-2507.Q8_0.ggufqwen3-4b-thinking-2507.Q4_K_M.gguf
Ollama
An Ollama Modelfile is included for easy deployment.
This was trained 2x faster with Unsloth

- Downloads last month
- 780
Hardware compatibility
Log In to add your hardware
4-bit
8-bit
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐ Ask for provider support