Together AI

Use Together AI with Blazen — construct the TogetherProvider and call complete().

Together AI is an OpenAI-compatible LLM provider. Construct it like any other Blazen provider — build a list of ChatMessages, call complete() (or stream()), and read back a typed ModelResponse.

At a glance

Provider idtogether
Base URLhttps://api.together.xyz/v1
Default modelmeta-llama/Llama-3.3-70B-Instruct-Turbo
API key env varTOGETHER_API_KEY
AuthAuthorization: Bearer <key>

Set TOGETHER_API_KEY in the environment and Blazen reads it automatically, or pass the key explicitly when you construct the provider.

Capabilities

CapabilitySupported
StreamingYes
Tool callingYes
Structured outputYes
VisionYes
Model listingYes
EmbeddingsYes

Usage

Construct the provider and call complete(). The default model is meta-llama/Llama-3.3-70B-Instruct-Turbo; override it with with_model / model when you need a different one.

use blazen_llm::{Model, ModelRequest, ChatMessage};
use blazen_provider_together::TogetherProvider;

// Reads TOGETHER_API_KEY from the environment, or pass the key to `new`.
let model = TogetherProvider::new(std::env::var("TOGETHER_API_KEY")?);

let resp = model
    .complete(ModelRequest::new(vec![ChatMessage::user("Hello")]))
    .await?;
println!("{}", resp.content.unwrap_or_default());
from blazen import TogetherProvider, ProviderOptions, ChatMessage

# Omit the api_key to read TOGETHER_API_KEY from the environment.
model = TogetherProvider(options=ProviderOptions(api_key="..."))

resp = await model.complete([ChatMessage.user("Hello")])
print(resp.content)
import { TogetherProvider, ChatMessage } from "blazen";

// Omit apiKey to read TOGETHER_API_KEY from the environment.
const model = TogetherProvider.create({ apiKey: "..." });

const resp = await model.complete([ChatMessage.user("Hello")]);
console.log(resp.content);

Streaming

async for chunk in model.stream([ChatMessage.user("Count to five")]):
    print(chunk.delta, end="")
await model.stream([ChatMessage.user("Count to five")], (chunk) => {
  if (chunk.delta) process.stdout.write(chunk.delta);
});

See also