Back to Newsroom
August 27, 2024
PALO ALTO, CALIFORNIA

Zyphra is excited to release Zamba2-mini, a state-of-the-art small language model for on-device applications.

Zamba2-mini achieves highly competitive evaluation scores and performance numbers and fits in a tiny memory footprint of <700MB at 4bit quantization.

7x drop in params for same performance ; Zamba2- mini (1.2B) ~ Llama2 7B

Authors
Paolo Glorioso, Quentin Anthony, Yury Tokpanov, James Whittington, Jonathan Pilault, Beren Millidge
Collaborators
Daniel A Roberts (Sequoia Capital & MIT), Andrey Gromov (Meta FAIR), Kushal Tirumala (Meta FAIR) and Hassan Shapourian (Cisco)