Model integration · Gemini

Gemini, without the Google SDK detour.

Request google/gemini-flash-1.5 from the OpenAI client you already ship. One OpenRouter key covers Pro for the heavy work and Flash for the fast, cheap lane.

Start free How to connect

$129/month SaaS. Bring your own model keys. No inference markup.

Three steps to connect.

Add an OpenRouter key

Gemini runs through OpenRouter, the same key that covers Anthropic and Meta models. Paste it in the ProxyLLM dashboard; it is encrypted with AES-256-GCM.

Point your SDK

Set the base URL to https://api.proxyllm.ai/v1 with your ProxyLLM key as the bearer. No Google AI SDK and no Vertex setup.

Request google/ models

google/gemini-pro-1.5 for the heavy work, google/gemini-flash-1.5 for the cheap fast lane. The slash in the model name tells the gateway to dispatch via OpenRouter.

Same client, different model name.

The google/ prefix sends the request through your OpenRouter key.

client.ts

import OpenAI from "openai";

const client = new OpenAI({
  baseURL: "https://api.proxyllm.ai/v1",
  apiKey: "pk_live_…", // your ProxyLLM key
});

const r = await client.chat.completions.create({
  model: "google/gemini-flash-1.5",
  messages: [{ role: "user", content: "Summarize this thread." }],
});

Codex Hosted · the main feature

Run your AI workloads on your ChatGPT subscription.

ProxyLLM runs OpenAI's Codex for you, signed in with your own ChatGPT account. Your apps call one OpenAI-compatible endpoint and the work bills to your flat plan instead of per-token API pricing.

Get Codex Hosted How it works

$129/month · normal SaaS pricing

Skip the Vertex setup entirely.

Gemini runs on your own OpenRouter key, logged per request, no inference markup. For OpenAI-bound workloads, Codex Hosted bills them to your ChatGPT subscription instead of per-token pricing.

Start free All integrations