Using the base-model API
These are base models — raw next-token prediction, no chat template, no instruction tuning — served over an OpenAI-compatible /v1/completions endpoint.
What you get
- Three base models — a small one for quick tests plus two larger ones (see Models). Some are kept warm; others cold-start on first use.
- Real next-token access — arbitrary prefill/continuation,
logprobsandprompt_logprobsfor likelihood/surprisal and interpretability work,echo, and SSE streaming. - Full, honored sampling controls —
temperature,top_p,top_k,min_p, penalties, and a respectedseedfor reproducibility. - A reliable, strict API — standard OpenAI-compatible
/v1/completions, with strict parameter validation (a typo'd parameter fails loudly instead of silently defaulting) and clear, structured JSON errors. - Browser Workbench — try prompts and manage API keys without writing code.
- Per-key budgets & usage — set token caps per key and track spend (see Account).
- Community — a Discord with #feedback and #bug-reports, plus a one-click Feedback button in the Workbench.
Two ways in
- Workbench — prompt the models straight from your browser. Good for getting a feel before you write any code.
- The API (below) — for anything programmatic. Create a key from your dashboard.
Quick start
Create a key in your dashboard, then point any OpenAI-compatible client at the API:
export ACS_API_KEY="acs-bm-..." # your key
export ACS_API_BASE="https://base-models.acsresearch.org/v1"
A first request with curl:
curl -s "$ACS_API_BASE/completions" \
-H "Authorization: Bearer $ACS_API_KEY" \
-H "Content-Type: application/json" \
-d '{"model": "llama-8b", "prompt": "The capital of France is", "max_tokens": 8}'
Or with the Python SDK (pip install openai):
import os
from openai import OpenAI
client = OpenAI(
base_url=os.environ["ACS_API_BASE"],
api_key=os.environ["ACS_API_KEY"],
)
resp = client.completions.create( # completions — not chat.completions
model="llama-8b",
prompt="The capital of France is",
max_tokens=16,
logprobs=5,
)
print(resp.choices[0].text)
Worked examples
Short, copy-pasteable end-to-end snippets for each feature live under Examples — one page per topic. The curl examples assume ACS_API_KEY + ACS_API_BASE are exported (see Quick start); the Python examples use the same openai SDK client as above.
Start with logprobs if you're new — it's the smallest end-to-end example. The harder ones (with the most gotchas) are prompt_logprobs and Cold-boot recovery.