huggingface

import { huggingface } from '@agentskit/adapters'

const adapter = huggingface({
  apiKey: process.env.HF_TOKEN!,
  model: 'meta-llama/Meta-Llama-3-70B-Instruct',
})

#Options

Option	Type	Default
`apiKey`	`string`	required
`model`	`string`	required
`baseUrl`	`string`	`https://api-inference.huggingface.co`
`fetch`	`typeof fetch`	global

#Env

Var	Purpose
`HF_TOKEN`	Read token from hf.co/settings/tokens

#Notes

Serverless tier has cold starts. Pin a dedicated Inference Endpoint for production latency.
For open weights locally see ollama · vllm · llamacpp.

Providers overview

Explore nearby

Peer
Providers
25 native chat and embedder adapters, plus higher-order adapters that compose candidates. Separate from the 140-provider models catalog.
Peer
Choosing an adapter
Capability decision table and rules of thumb for picking a chat adapter.
Peer
Hosted chat adapters
17 managed-LLM adapters. Same contract; swap by changing one import.

✎ Edit this page on GitHub·Found a problem? Open an issue →·How to contribute →

huggingface

#Options

#Env

#Notes

#Related

Explore nearby

On this page