> ## Documentation Index > Fetch the complete documentation index at: https://mintlify.com/pt-act/pi-mono/llms.txt > Use this file to discover all available pages before exploring further. # stream() and complete() > Core streaming and completion functions for generating assistant messages ## Overview The Pi AI toolkit provides four main functions for generating assistant messages: * **`stream()`** - Stream assistant messages with full event control * **`complete()`** - Get complete assistant message without streaming * **`streamSimple()`** - Stream with simplified reasoning options * **`completeSimple()`** - Complete with simplified reasoning options ## stream() Stream an assistant message with granular event handling. ```typescript theme={null} function stream( model: Model, context: Context, options?: ProviderStreamOptions ): AssistantMessageEventStream ``` The model to use for generation. Get models via `getModel(provider, modelId)`. The conversation context including system prompt, messages, and tools. ```typescript theme={null} interface Context { systemPrompt?: string; messages: Message[]; tools?: Tool[]; } ``` Optional provider-specific streaming options. Controls randomness (0.0 to 2.0). Lower is more deterministic. Maximum tokens to generate. Abort signal to cancel the request. API key for the provider. Falls back to environment variables. Preferred transport for providers that support multiple transports. Prompt cache retention preference. Providers map this to their supported values. Session identifier for providers that support session-based caching. Callback for inspecting provider payloads before sending. Custom HTTP headers to include in API requests. Maximum delay in milliseconds to wait for a retry when the server requests a long wait. Optional metadata to include in API requests. Providers extract the fields they understand. An async iterable stream that emits events as the assistant message is generated. Call `.result()` to get the final `AssistantMessage` after streaming completes. ### Example ```typescript theme={null} import { getModel, stream } from '@mariozechner/pi-ai'; const model = getModel('openai', 'gpt-4o-mini'); const s = stream(model, { systemPrompt: 'You are a helpful assistant.', messages: [{ role: 'user', content: 'Hello!' }] }); for await (const event of s) { switch (event.type) { case 'start': console.log(`Starting with ${event.partial.model}`); break; case 'text_delta': process.stdout.write(event.delta); break; case 'thinking_delta': console.log('[Thinking]', event.delta); break; case 'toolcall_end': console.log('Tool:', event.toolCall.name, event.toolCall.arguments); break; case 'done': console.log('\nFinished:', event.reason); break; case 'error': console.error('Error:', event.error.errorMessage); break; } } // Get final message const message = await s.result(); console.log('Tokens:', message.usage.totalTokens); console.log('Cost: $', message.usage.cost.total); ``` ## complete() Get a complete assistant message without streaming. ```typescript theme={null} async function complete( model: Model, context: Context, options?: ProviderStreamOptions ): Promise ``` The model to use for generation. The conversation context. Same options as `stream()`. The complete assistant message. ```typescript theme={null} interface AssistantMessage { role: "assistant"; content: (TextContent | ThinkingContent | ToolCall)[]; api: Api; provider: Provider; model: string; usage: Usage; stopReason: StopReason; errorMessage?: string; timestamp: number; } ``` ### Example ```typescript theme={null} import { getModel, complete } from '@mariozechner/pi-ai'; const model = getModel('anthropic', 'claude-3-5-haiku-20241022'); const response = await complete(model, { messages: [{ role: 'user', content: 'Explain TypeScript in one sentence.' }] }); for (const block of response.content) { if (block.type === 'text') { console.log(block.text); } } console.log(`Cost: $${response.usage.cost.total.toFixed(4)}`); ``` ## streamSimple() Stream with simplified reasoning/thinking options. Maps unified `reasoning` levels to provider-specific parameters. ```typescript theme={null} function streamSimple( model: Model, context: Context, options?: SimpleStreamOptions ): AssistantMessageEventStream ``` Extends `StreamOptions` with reasoning support. Unified thinking level. Automatically maps to provider-specific parameters: * OpenAI: `reasoning_effort` * Anthropic: `thinking_enabled` + `thinking_budget_tokens` * Google: `thinking.enabled` + `thinking.budgetTokens` Custom token budgets for thinking levels (token-based providers only). ```typescript theme={null} interface ThinkingBudgets { minimal?: number; low?: number; medium?: number; high?: number; } ``` ### Example ```typescript theme={null} import { getModel, streamSimple } from '@mariozechner/pi-ai'; const model = getModel('openai', 'gpt-5-mini'); const s = streamSimple(model, { messages: [{ role: 'user', content: 'Solve: 2x + 5 = 13' }] }, { reasoning: 'medium' // Maps to appropriate provider parameter }); for await (const event of s) { if (event.type === 'thinking_delta') { console.log('[Thinking]', event.delta); } else if (event.type === 'text_delta') { process.stdout.write(event.delta); } } ``` ## completeSimple() Get complete response with simplified reasoning options. ```typescript theme={null} async function completeSimple( model: Model, context: Context, options?: SimpleStreamOptions ): Promise ``` Parameters and return type are the same as `streamSimple()` and `complete()`. ### Example ```typescript theme={null} import { getModel, completeSimple } from '@mariozechner/pi-ai'; const model = getModel('anthropic', 'claude-sonnet-4-20250514'); const response = await completeSimple(model, { messages: [{ role: 'user', content: 'Calculate 25 * 18' }] }, { reasoning: 'high' }); for (const block of response.content) { if (block.type === 'thinking') { console.log('Thinking:', block.thinking); } else if (block.type === 'text') { console.log('Answer:', block.text); } } ``` ## Context The `Context` interface represents a conversation's state. ```typescript theme={null} interface Context { systemPrompt?: string; messages: Message[]; tools?: Tool[]; } ``` System-level instructions for the assistant. Conversation history. Can include `UserMessage`, `AssistantMessage`, and `ToolResultMessage`. ```typescript theme={null} type Message = UserMessage | AssistantMessage | ToolResultMessage; interface UserMessage { role: "user"; content: string | (TextContent | ImageContent)[]; timestamp: number; } ``` Available tools for the assistant to call. See [tools documentation](/api/ai/tools). ### Context Serialization Context objects are fully JSON-serializable: ```typescript theme={null} import { Context } from '@mariozechner/pi-ai'; const context: Context = { systemPrompt: 'You are helpful.', messages: [{ role: 'user', content: 'Hello', timestamp: Date.now() }] }; // Serialize const json = JSON.stringify(context); localStorage.setItem('conversation', json); // Deserialize const restored: Context = JSON.parse(localStorage.getItem('conversation')!); ``` ## Events The `AssistantMessageEventStream` emits these event types: Stream begins. Contains initial message structure. Text block starts at the given content index. Text chunk received. `delta` contains the new text. Text block complete. `content` contains the full text. Thinking block starts (for models with reasoning capabilities). Thinking chunk received. Thinking block complete. Tool call begins. Tool arguments streaming. `partial.content[contentIndex].arguments` contains partially parsed JSON. Arguments may be incomplete during `toolcall_delta`. Always check for field existence. Tool call complete. `toolCall` contains the full parsed tool call. ```typescript theme={null} interface ToolCall { type: "toolCall"; id: string; name: string; arguments: Record; thoughtSignature?: string; // Google-specific } ``` Stream complete successfully. `reason` is `"stop"`, `"length"`, or `"toolUse"`. Error occurred. `error` contains partial message and error details. ## Stop Reasons Every `AssistantMessage` has a `stopReason` field: ```typescript theme={null} type StopReason = "stop" | "length" | "toolUse" | "error" | "aborted"; ``` Normal completion - the model finished its response. Output hit the maximum token limit. Model is calling tools and expects tool results. An error occurred during generation. Check `errorMessage` field. Request was cancelled via `AbortSignal`. ## Aborting Requests Use `AbortSignal` to cancel in-progress requests: ```typescript theme={null} import { getModel, stream } from '@mariozechner/pi-ai'; const model = getModel('openai', 'gpt-4o-mini'); const controller = new AbortController(); // Abort after 2 seconds setTimeout(() => controller.abort(), 2000); const s = stream(model, { messages: [{ role: 'user', content: 'Write a long story' }] }, { signal: controller.signal }); for await (const event of s) { if (event.type === 'text_delta') { process.stdout.write(event.delta); } else if (event.type === 'error' && event.reason === 'aborted') { console.log('\nRequest aborted'); } } const response = await s.result(); if (response.stopReason === 'aborted') { console.log('Partial content:', response.content); console.log('Tokens used:', response.usage.totalTokens); } ``` Aborted messages can be added to context and continued: ```typescript theme={null} const context = { messages: [] }; // First request gets aborted const partial = await complete(model, context, { signal: abortSignal }); context.messages.push(partial); // Continue the conversation context.messages.push({ role: 'user', content: 'Please continue' }); const continuation = await complete(model, context); ```