agentskit.js
Providers

groq

Groq — ultra-low-latency inference on custom LPU hardware. Llama + Mixtral + Gemma.

import { groq } from '@agentskit/adapters'

const adapter = groq({
  apiKey: process.env.GROQ_API_KEY!,
  model: 'llama-3.3-70b-versatile',
})

#Options

OptionTypeDefault
apiKeystringrequired
modelstringllama-3.3-70b-versatile
baseUrlstringhttps://api.groq.com/openai/v1
retryRetryOptionsinherited

#Capabilities

{ streaming: true, tools: true, usage: true } — Groq exposes a strict OpenAI-compatible surface, so the request shape matches openai({ baseUrl }).

#Why groq

  • Sub-100 ms first-token latency — best for realtime voice + chat.
  • OpenAI-compatible.

#Env

VarPurpose
GROQ_API_KEYAPI key

Explore nearby

✎ Edit this page on GitHub·Found a problem? Open an issue →·How to contribute →

On this page