Google DeepMind | GSoC Organizations

@anxkhn (Anas Khan)

This project will evaluate Google's Gemini (2.5 Pro, 2.0 Flash) and Gemma (3, CodeGemma) models by extending multiple open-source coding benchmarks...

Adarsh J. Dubey

This project is about building a user-friendly web interface using Gradio that makes fine-tuning Gemma models simple and accessible for everyone—even...

Adel Muursepp

The project will close the documentation and evaluation gap for Google’s Gemini models by contributing structured onboarding guides and benchmarking...

Amrit Rai

This project aims to rigorously evaluate Google’s Gemini multimodal model on real-world tasks. We plan to address the limitations of current...

Andy L

This project will elevate Gemini API support across widely used open-source agent frameworks like LangChain, LlamaIndex, CrewAI, and...

AndyC

A majority of the Gemma chat applications on Hugging Face Spaces do not allow the user to adjust generation settings or system prompts, giving the...

andycandy

This proposal aims to upgrade and expand existing open-source tutorials and examples to support the new unified Gemini SDKs for JavaScript/TypeScript...

Aniket.Saxena

This project aims to create a Gemini Workspace in Postman for interacting with the Gemini API’s and providing a central hub for exploration ,...

Aryan Saboo

EchoGem introduces a novel batching engine designed to answer multiple questions about the same source parallelly to reduce response times heavily....

Chen-Hao Wu

Gemma is a lightweight, open-source large language model by Google DeepMind. This project aims to build an intuitive web interface for fine-tuning...

Green Code

The goal of this project is to develop a multi-modal and open-source benchmark with which to evaluate Gemini 2.0. Open-source benchmarks are an...

Guan-Ming (Wesley), Chiu

This proposal modernizes Gemini SDK learning resources by migrating outdated examples to JavaScript / TypeScript, developing new tutorials, and...

Haibo Yang

This project aims to develop a comprehensive Postman workspace for Google's Gemini API, providing a central hub for exploration, integration, and...

Hailey Cheng (Cheng Hei Lam)

This project will develop a robust, extensible benchmarking suite designed for Google's Gemma models to address the lack of standardized evaluation...

Indominus

Google DeepMind’s Gemini is a powerful multimodal model that excels at language comprehension, visual processing, and tool interaction. However,...

Jeet Dekivadia

HALO is an innovative open-source framework designed to optimize Gemini API usage for long-context video analysis. By leveraging hierarchical...

Jet Chiang

Fine-tuning LLMs like Gemma requires deep technical understanding and complex configuration, hindering rapid prototyping and broader application....

Jevon Mao

This project proposes the creation of a comprehensive Postman Workspace tailored for Google’s Gemini API suite. It will offer developers a robust,...

krishnaagrawal

A VS Code (or JetBrains) extension to provide AI-powered coding assistance using Google’s Gemini API. This tool enhances the developer experience by...

Lorenzo Drudi

The goal of this project is to create a developer-friendly Postman Workspace for interacting with the Gemini API. This workspace will serve as a...

Lucas Martins

This proposal aims to develop the Gemma LLM Garage, a full-stack interface to manage datasets and fine-tune Gemma models. Its main goal is to...

lucas-maes

This project aims to develop a more rigorous and focused evaluation testbed than that used by Garrido et al. (2025), with the specific goal of...

Matthew Nguyen

As multimodal agents become increasingly integrated into real-world applications, ensuring their safe and reliable tool-use behavior is paramount. We...

Mikhail Sinitcyn

This project aims to develop a Python library to support Xarray data (labeled multi-dimensional array library supported by Deepmind) with JAX...

msaadg

This project aims to enhance Gemini API integrations in open-source software (OSS) agent tools, specifically LangChain and LlamaIndex, by addressing...

Myan (My Anh) Vu

OSS-Fuzz-Gen, a framework using LLMs for fuzz target generation and evaluation by Google, currently has a basic experiment report UI alongside a...

Nattaput Namchittai

Fine-tuning MLLMs to achieve test-time scaling and strong reasoning performance for visual/spatial tasks can be very expensive because of the...

Nawaf Alampara

Scientific discovery fundamentally relies on integrating and reasoning over multimodal information—text, diagrams, plots, spectra, microscopy images,...

Phillip Daniel

The aim of this project is to develop a code sample in Python that demonstrates some of the key capabilities of Google's Gemini APIs in regards to...

Pranav Agrawal

Reproducibility as Accuracy (RaA) is a benchmark which aims to evaluate how effective multimodal AI systems are in preserving information fidelity...

Preston Tjandra

When attempting to utilize Google's Gemini APIs, developers face a steep learning curve. The scattered documentation, complex parameter...

Rodrigo Sagastegui

AI models with function-calling capabilities can be powerful tools for solving real-world problems, but experimenting with them isn’t always...

Ryan Rong

GemmaEval is a comprehensive, automated benchmarking framework designed to evaluate, visualize, and compare Gemma language models against other open...

Saravan_Kumar

This project aims to advance the evaluation framework for Google’s Gemini 2.0 and Gemma 3-27B multimodal models by integrating a diverse set of...

Sean Brar

This project develops an efficient framework for analyzing educational video content using the Gemini API. The approach combines optimized batch...

Siddharth Sahu

Current manual evaluation methods for LLM-based AI applications are unsustainable and resource-intensive. While using LLMs as judges offers a...

tesims

The goal of this project is to help strengthen the Concordia framework by developing and open-sourcing a collection of new language model agent...

Ton Hoang Nguyen (Bill)

This project aims to enhance the integration of Google's Gemini AI into open-source IDE extensions, focusing on developing and improving user...

Triyan Mukherjee

The Gemini Cookbook is a set of sample applications and tutorials illustrating different functionalities of the Gemini APIs. The intent of this...

vanshksingh

This project aims to build a production-grade, open-source code sample that showcases batch question answering over long-context inputs using Google...

Vedant Kulkarni

This project aims to: (a) investigate to explore technical possibilities, enhance specifications, and find applications for specific use cases and...

Wale

This project addresses critical gaps in Gemini API integration across leading agent frameworks (LangChain, LlamaIndex, CrewAI, Composio), where...

Yang Ouyang

This project aims to design a robust, reproducible benchmark for evaluating the Gemini family of multimodal large language models (MLLMs), such as...

Yashdeep Prasad

Enable and document best practices for function calling in Gemma models, similar to the structured tool-use capabilities . In essence, function...

Zewei Wang

This project aims to develop a standalone Python SDK that provides researchers with a streamlined and well-documented API for interacting with...

Category

Years

Technologies

Topics