Meta AI Releases New Quantized Versions of Llama 3.2 (1B & 3B): Delivering Up To 2-4x Increases in Inference Speed and 56% Reduction in Model Size

October 24, 2024

The rapid growth of large language models (LLMs) has brought significant advancements across various sectors, but it has also presented considerable challenges. Models such as Llama 3 have made impressive strides in natural language understanding and generation, yet their size and computational requirements have often limited their practicality. High energy costs, lengthy training times, and the need for expensive hardware are barriers to accessibility for many organizations and researchers. These challenges not only impact the environment but also widen the gap between tech giants and smaller entities trying to leverage AI capabilities. Meta AI’s Quantized Llama 3.2 Models (1B and