How do I deploy geminicli2api on Docker?

Use the provided Dockerfile or Docker Compose setup. Set environment variables like GEMINI_AUTH_PASSWORD and GEMINI_CREDENTIALS, then run the container on port 8888 or 7860 for Hugging Face compatibility, as detailed in the README.

Gemini CLI to API proxy vs using Gemini API directly – what's the difference?

The proxy adds an abstraction layer that provides OpenAI-compatible endpoints and simplifies integration, but it introduces extra latency. Direct API usage might be better for performance-critical applications, while the proxy is ideal for adapting existing OpenAI-based tools.

Can I use this with existing OpenAI client code?

Yes, by pointing your OpenAI client to the proxy's base URL and using the authentication password as the API key, as demonstrated in the Python example. It supports both streaming and non-streaming requests.

How does authentication work in geminicli2api?

It supports multiple methods including Bearer tokens, Basic auth, query parameters, and Google headers. Set GEMINI_AUTH_PASSWORD as the key, and use it in requests according to the chosen method for secure access.

Does it support image inputs for multimodal queries?

Yes, it leverages Gemini's multimodal capabilities, allowing you to send image data through the API endpoints. However, the exact format depends on whether you use the native Gemini API or OpenAI-compatible interface.

What are the performance implications of using this proxy?

It adds latency due to the proxy layer and depends on Google's API quotas. For high-volume use, you'll need to monitor performance and potentially scale the proxy server, as it lacks built-in caching or optimization features.

geminicli2api — Gemini to OpenAI API Proxy

What is geminicli2api?

Gemini CLI to API Proxy is a FastAPI-based server that converts Google's Gemini CLI tool into standard API endpoints. It provides both OpenAI-compatible and native Gemini API interfaces, allowing developers to integrate Gemini's AI capabilities into applications using familiar API patterns. The proxy enables access to Google's free Gemini API quota through a standardized interface.

Target Audience

Developers and teams building AI-powered applications who want to use Google's Gemini models through API interfaces compatible with OpenAI's format or direct Gemini API calls.

Value Proposition

It provides a drop-in replacement for OpenAI's API while leveraging Google's free Gemini quota, offers both streaming and multimodal support, and is containerized for easy self-hosting. The proxy eliminates the need to adapt tools specifically for Gemini's CLI, enabling broader integration.

Overview

Gemini CLI to API Proxy is a FastAPI-based server that converts the Gemini CLI tool into standard API endpoints. It enables developers to use Google's free Gemini API quota through familiar OpenAI API interfaces or direct Gemini API calls, making it simple to integrate Gemini capabilities into existing applications.

Key Features

OpenAI-Compatible API — Drop-in replacement for OpenAI's chat completions API, allowing seamless integration with tools expecting OpenAI's format.
Native Gemini API — Direct proxy to Google's Gemini API, supporting all its endpoints and features.
Streaming Support — Real-time streaming responses for both API formats, enabling interactive applications.
Multimodal Support — Handles text and image inputs, leveraging Gemini's multimodal capabilities.
Authentication Flexibility — Supports multiple auth methods including Bearer tokens, Basic auth, query parameters, and Google API headers.
Model Variants — Automatically creates model variants with Google Search grounding and thinking/reasoning controls.
Docker Ready — Containerized for easy deployment and includes Hugging Face Spaces compatibility.

Philosophy

The project aims to bridge the gap between Google's Gemini CLI and the broader ecosystem of AI tools by providing a standardized, easy-to-deploy API layer that maintains compatibility with popular interfaces.

Use Cases

Best For

Integrating Gemini models into applications that expect OpenAI's API format
Self-hosting a proxy to use Google's free Gemini API quota
Adding streaming AI responses to applications using Gemini models
Building multimodal applications with text and image inputs via API
Deploying an AI API proxy on Hugging Face Spaces or Docker containers
Creating a unified API layer for tools that work with both OpenAI and Gemini models

Not Ideal For

Production systems needing high-throughput, low-latency AI inference without proxy overhead
Teams using multiple AI providers beyond Google's Gemini models
Applications requiring advanced API management features like built-in rate limiting or detailed analytics

Pros & Cons

Pros

OpenAI API Compatibility

Acts as a drop-in replacement for OpenAI's chat completions API, enabling seamless integration with existing tools and libraries, as shown in the Python example using the openai client.

Native Gemini Feature Access

Proxies all Gemini API endpoints, allowing use of advanced features like thinking controls and Google Search grounding through model variants like '-search' and '-maxthinking'.

Streaming and Multimodal Support

Supports real-time streaming responses and handles text and image inputs, leveraging Gemini's capabilities for interactive and vision-based applications.

Flexible Deployment Options

Containerized with Docker and configured for Hugging Face Spaces, simplifying setup and scaling across different hosting environments.

Cons

Complex Credential Setup

Requires managing Google OAuth credentials or API keys, which adds initial configuration overhead compared to simpler authentication methods and can be error-prone.

Vendor Lock-in to Google

Tied exclusively to Google's Gemini models, limiting flexibility if you need to switch to or incorporate other AI providers like OpenAI or Anthropic.

Performance Overhead

Adds an extra network hop as a proxy layer, which can increase latency and become a bottleneck in high-demand scenarios without built-in clustering or load balancing.

geminicli2api

What is geminicli2api?

Overview

Key Features

Philosophy

Use Cases

Best For

Not Ideal For

Pros & Cons

Pros

Cons

Frequently Asked Questions

Related Projects

Found a gem we're missing?

geminicli2api

What is geminicli2api?

Overview

Key Features

Philosophy

Use Cases

Best For

Not Ideal For

Pros & Cons

Pros

Cons

Frequently Asked Questions

Related Projects

Found a gem we're missing?