Playing with DeepSeek R1 Distill Qwen 1.5B:
🚀 Playing with DeepSeek R1 Distill Qwen 1.5B: So I tried out DeepSeek R1, the distilled 1.5B small version because resources. lol. Its a tiny yet powerful 1.5B parameter Non quantized model, using Group Relative Policy Optimization (GRPO) for reinforcement learning. All of DeepSeek’s models are open source, and DeepSeek has been making news lately about how they managed to pull off powerful models using little resources and even skipping steps everyone thought were necessary to develop powerful models. They did all this with the smallest budgets and not-so-powerful GPUs. And the fact that it’s open source is a whole other thing, because before all this, the only powerful open-source models were Meta’s lineup of LLaMA models. So, having a new player that’s just as powerful, costs a tenth of what industry leaders charge, and is open source is a really big deal. 🔍 Here are some of my takeaways from the hands-on experience: And remember, this is the Tiny Tiny version, only 1.5B. ...