This article highlights the advancements in small language models, specifically those with under 7 billion parameters, which can now run on consumer GPUs or even laptops. It emphasizes that these models are now capable of performing tasks that were previously only achievable by much larger models, thanks to improvements in training data quality, distillation techniques, and architectural innovations like Mixture-of-Experts (MoE). The article provides a curated list of the best small language models available on Hugging Face, along with their capabilities and benchmark scores.
入选理由:Small language models under 7 billion parameters are now capable of performing complex tasks previously reserved for much larger models.