Evaluate Gemini on an Open-Source Benchmark: OpenUI Eval
@anxkhn (Anas Khan)
This project evaluates large language models on frontend coding and UI generation tasks by extending and integrating ideas from open-source coding...
Gemma Facet: End-to-End Fine-Tuning Platform for Gemma Models
Adarsh J. Dubey
Gemma Facet is a comprehensive platform that provides an end-to-end solution for fine-tuning Gemma language models through a microservices...
Improving Gemini Documentation for Open Source Model Providers Promptfoo and Weights & Biases
Adel Muursepp
The project will close the documentation and evaluation gap for Google’s Gemini models by contributing structured onboarding guides and benchmarking...
Enhance Gemini API Integrations in OSS Agents Tools
Andy L
This project will elevate Gemini API support across widely used open-source agent frameworks like LangChain, LlamaIndex, CrewAI, and...
Gemma Chat Gradio Demo
AndyC
A majority of the Gemma chat applications on Hugging Face Spaces do not allow the user to adjust generation settings or system prompts, giving the...
Unified Gemini Example Cookbook: Migrating and Modernizing Open-Source Learning Resources
andycandy
This proposal aims to upgrade and expand existing open-source tutorials and examples to support the new unified Gemini SDKs for JavaScript/TypeScript...
Develop a Gemini Workspace in Postman
Aniket.Saxena
This project aims to create a Gemini Workspace in Postman for interacting with the Gemini API’s and providing a central hub for exploration ,...
EchoGem – Teaching Gemini to Think in Batches by Prioritizing What Matters
Aryan Saboo
EchoGem introduces a novel batching engine designed to answer multiple questions about the same source parallelly to reduce response times heavily....
Gemma Model Fine-tuning UI
Chen-Hao Wu
Gemma is a lightweight, open-source large language model by Google DeepMind. This project aims to build an intuitive web interface for fine-tuning...
Creation of a Creative Thinking Benchmark
Green Code
The goal of this project is to develop a multi-modal and open-source benchmark with which to evaluate Gemini 2.0. Open-source benchmarks are an...
Develop Gemini Examples in Swift
Haibo Yang
This project aims to refactor the firebase/quickstart-ios to demonstrate the latest Firebase AI Logic features, including multimodal analysis,...
HALO: Hierarchical Abstraction for Longform Optimization
Jeet Dekivadia
HALO (Hierarchical Abstraction for Longform Optimization) is an MIT-licensed Python package for efficient large-scale video content analysis,...
Facet AI: No-code web platform to democratize small language model fine-tuning
Jet Chiang
Fine-tuning LLMs like Gemma requires deep ML expertise and complex setups, slowing down adoption of SLMs in various industries and communities...
Gemini API Developer Workspace in Postman
Jevon Mao
This project proposes the creation of a comprehensive Postman Workspace tailored for Google’s Gemini API suite. It will offer developers a robust,...
VS Code Extension to assist with coding powered by Gemini
krishnaagrawal
A VS Code (or JetBrains) extension to provide AI-powered coding assistance using Google’s Gemini API. This tool enhances the developer experience by...
Develop a Gemini Workspace in Postman
Lorenzo Drudi
The goal of this project is to create a developer-friendly Postman Workspace for interacting with the Gemini API. This workspace will serve as a...
Gemma Garage: Leveraging Gemma 3 to democratize LLM Fine-tuning
Lucas Martins
This proposal aims to develop the Gemma LLM Garage, a full-stack interface to manage datasets and fine-tune Gemma models. Its main goal is to...
Enhanced Benchmark for Evaluating Intuitive Physics Understanding in Gemma Multimodal Models
lucas-maes
This project aims to develop a more rigorous and focused evaluation testbed than that used by Garrido et al. (2025), with the specific goal of...
ATIA: A BENCHMARK FOR ADVERSARIAL TOOL INFILTRATION IN AGENTS
Matthew Nguyen
As multimodal agents become increasingly integrated into real-world applications, ensuring their safe and reliable tool-use behavior is paramount. We...
Xarray-JAX Integration Library
Mikhail Sinitcyn
This project aims to develop a Python library to support Xarray data (labeled multi-dimensional array library supported by Deepmind) with JAX...
Enhancing Gemini API Integrations in OSS Agents Tools
Muhammad Saad (msaadg)
The Gemini API, known for its multimodal capabilities and efficiency, is a powerful tool that enables advanced AI agent interactions. However, its...
Streamline experiment execution and improve report UI for OSS-Fuzz-Gen
Myan (My Anh) Vu
OSS-Fuzz-Gen, a framework using LLMs for fuzz target generation and evaluation by Google, currently has a basic experiment report UI alongside a...
Creating The First Benchmark for Evaluating LLMs Across All Five Foundational AI Agent Types
Nattaput Namchittai
With the rise in popularity of AI agents, there is an increasing need for agentic benchmarks. There are 5 foundational types of AI agents that many...
SciResearchBench: A Multimodal Benchmark for Scientific Reasoning and Discovery
Nawaf Alampara
Scientific discovery fundamentally relies on integrating and reasoning over multimodal information—text, diagrams, plots, spectra, microscopy images,...
gemini-batcher: A Python package and learning resources for efficient API calls with Gemini LLMs
Phillip Daniel
The aim of this project is to develop a comprehensive learning resource for developers working with the Gemini Python SDK. This would consist of a...
Reproducibility as Accuracy (RaA) Benchmark
Pranav Agrawal
Reproducibility as Accuracy (RaA) is a benchmark which aims to evaluate how effective multimodal AI systems are in preserving information fidelity...
Crisis Response Toolkit for Gemma Models
Rodrigo Sagastegui
This project explores how lightweight Gemma models can be applied in life-critical situations through an open-source Crisis Response Toolkit. The...
Gemma Scout: On-device AI camping & wildlife survival companion
Ryan Rong
About 88 million U.S. households now identify as campers, and ~54 million households took a trip last year National parks run thousands of...
Open-Source Multimodal Benchmarks and Adversarial Robustness Testing for Gemini 2.X models
Saravan_Kumar
This project aims to advance the evaluation framework for Google’s Gemini 2.0 and Gemma 3-27B multimodal models by integrating a diverse set of...
Batch Prediction Framework: Long Context and Context Caching for Video Analysis
Sean Brar
This project develops an efficient framework for analyzing educational video content using the Gemini API. The approach combines optimized batch...
AI Evaluations using Gemini APIs
Siddharth Sahu
Current manual evaluation methods for LLM-based AI applications are unsustainable and resource-intensive. While using LLMs as judges offers a...
Creating New Agent Architectures for Concordia
tesims
The goal of this project is to help strengthen the Concordia framework by developing and open-sourcing a collection of new language model agent...
Enhancing Gemini Integration in Roo Code
Ton Hoang Nguyen (Bill)
This project improved Gemini AI integration in Roo Code VS Code extension, serving 800,000+ developers. Key deliverables included real-time Google...
Open-source Gemini Example Apps
Triyan Mukherjee
The Gemini Cookbook is a set of sample applications and tutorials illustrating different functionalities of the Gemini APIs. The intent of this...
Batch Prediction with Long Context and Context Caching
vanshksingh
This project developed an open-source code sample for batch question answering on long-form transcripts (such as lectures or documentaries) using...
Exploring & Extending Function Calling in Gemma
Vedant Kulkarni
This project aims to: (a) investigate to explore technical possibilities, enhance specifications, and find applications for specific use cases and...
Multimodal Intelligence: Supercharging Agents with Gemini
Wale
This project addresses critical gaps in Gemini API integration across leading agent frameworks (LangChain, LlamaIndex, CrewAI, Composio), where...
Self-Contained OSS-Fuzz Module for Researchers
Zewei Wang
This project aims to develop a standalone Python SDK that provides researchers with a streamlined and well-documented API for interacting with...