Collection of Arxiv papers of the Gemma model family

Gen AI
A collection of Arxiv papers describing main innovations of Gemma model family.
Author

Rafa Sanchez

Published

June 12, 2024

This post is also published in Medium and LinkedIn.

Gemma is not just a model, it’s a family of models and tools built from the same research and technology used to create the Gemini models of Google.

Gemma was released just 10 months ago, and the amount of innovations released by Google related to this model is just impressive.

Here is a collection of Arxiv papers showing the main innovations introduced by Google on Gemma in the last 10 months.

This is a summary of all the Arxiv papers describing the variety of innovations introduced in Gemma:

Non-Google researchers have also contributed to the Gemma ecosystem with papers:

Deploying open models should not be difficult and has never been so easy with a cloud platform like Vertex AI, which allows deployment, fine-tuning, evaluation and many other tasks on Gemma.

Additionally, you can also find pre-trained models and code on Hugging Face and Kaggle.

If you have read or published a paper on Gemma not included above, would appreciate if you can include it in the comments.

Blog posts in reverse chronological order

[1] Blog post Gemma 1 announcement

[2] Blog post New Gemma 1 variants

[3] Blog post PaliGemma 1, Gemma 2, and an Upgraded Responsible AI Toolkit

[4] Blog post Gemma 1 explained: Gemma models family architectures

[5] Blog post Gemma 1 explained: RecurrentGemma Architectures

[6] Blog post Gemma 1 explained: PaliGemma Architectures

[7] Blog post Gemma 1 explained: What’s new in Gemma 2

[8] Blog post Advancing Multilingual AI with Gemma 2 and a $150K Challenge

[9] Blog post ShieldGemma and Gemma Scope

[10] Blog post DataGemma

[11] Blog post PaliGemma 2