Meta Releases Quantized Llama 3.2 with 4x Inference Speed on Android Phones

October 25, 2024

Meta has introduced quantized versions of its Llama 3.2 models, enhancing on-device AI performance with up to four times faster inference.

Search

RECENT PRESS RELEASES