Video is unavailable for watching
Show in Telegram
⚡️ Introducing DeepThought-8B: Transparent reasoning model built on LLaMA-3.1 with test-time compute scaling.
- JSON-structured thought chains & controllable inference paths.
- ~16GB VRAM, competitive w/ 70B models.
- Open model weights, and inference scripts.
https://huggingface.co/ruliad/deepthought-8b-llama-v0.01-alpha
@opendatascience
- JSON-structured thought chains & controllable inference paths.
- ~16GB VRAM, competitive w/ 70B models.
- Open model weights, and inference scripts.
https://huggingface.co/ruliad/deepthought-8b-llama-v0.01-alpha
@opendatascience