Google just re-entered the game 馃敟馃敟
They want to take the crown 馃憫 back from Chinese open source AI.
And... Gemma 4 is FINALLY Apache 2.0 aka real-open-source-licensed.
From what I've seen it's going to be a pretty significant model. But give it a try yourself today:
brew install llama.cpp --HEAD
if you have at least 24GB of RAM or VRAM, run the (very good) 26B MOE:
llama-server -hf ggml-org/gemma-4-26B-A4B-it-GGUF:Q4_K_M
if you have 16GB of RAM or VRAM, run the dense E4B:
llama-server -hf ggml-org/gemma-4-E4B-it-GGUF:Q8_0

