We show that BlackMamba performs competitively versus both equally Mamba and transformer baselines, and outperforms in inference and coaching FLOPs. We totally train and open-supply 340M/1.5B and 630M/two.8B BlackMamba https://k2spiceshop.com/product/liquid-k2-on-paper-online/