Run Prompt

Run a prompt with variable substitution and get the generated response. This endpoint handles template rendering, LLM API calls, and automatically logs a span.

Request

Authorization

string

required

Bearer token with your API key: Bearer tr_your_api_key

slug

string

required

The prompt slug to run

variables

object

Key-value pairs for template variables. Must include all variables required by the prompt.

model

string

Override the default model (e.g., gpt-4o, claude-sonnet-4-20250514)

Response

text

string

The generated text from the LLM

spanId

string

Unique identifier for this span

traceId

string

Session identifier (same as spanId if not part of session)

promptVersion

number

Version of the prompt that was used

latencyMs

number

Total request latency in milliseconds

usage

object

Token usage statistics

Show Usage object

inputTokens

number

Number of input tokens

outputTokens

number

Number of output tokens

totalTokens

number

Total tokens used

cost

number

Estimated cost in USD

finishReason

string

Why the model stopped generating: stop, max_tokens, or tool_calls

toolCalls

array

Tool calls made by the model (when the prompt has tools configured)

Show ToolCall object

string

Unique identifier for the tool call

name

string

Name of the tool called

arguments

object

Arguments passed to the tool

structuredOutput

object

Parsed JSON object when the prompt has an output schema configured. The output conforms to the JSON schema defined in the prompt settings.

messages

array

Full conversation messages (rendered input + assistant response) for multi-turn continuation. Pass these back in the next request’s messages field to continue the conversation.

curl -X POST https://app.tracia.io/api/v1/prompts/welcome-email/run \
  -H "Authorization: Bearer tr_your_api_key" \
  -H "Content-Type: application/json" \
  -d '{
    "variables": {
      "name": "Alice",
      "product": "Tracia"
    },
    "tags": ["onboarding", "email"],
    "userId": "user_123"
  }'

{
  "text": "Dear Alice,\n\nWelcome to Tracia! We're thrilled to have you join our community...",
  "spanId": "sp_abc123xyz",
  "traceId": "tr_session789",
  "promptVersion": 3,
  "latencyMs": 1250,
  "usage": {
    "inputTokens": 45,
    "outputTokens": 120,
    "totalTokens": 165
  },
  "cost": 0.0049,
  "finishReason": "stop",
  "toolCalls": null,
  "structuredOutput": null
}

Authorizations

Authorization

string

header

required

API key starting with tr_

Path Parameters

slug

string

required

The prompt slug to run

Body

application/json

variables

object

Key-value pairs for template variables

Show child attributes

model

string

Override the default model (e.g., gpt-4o, claude-sonnet-4-20250514)

Response

Successful response

text

string

The generated text from the LLM

spanId

string

Unique identifier for this span

traceId

string

Session identifier (same as spanId if not part of session)

promptVersion

integer

Version of the prompt that was used

latencyMs

integer

Total request latency in milliseconds

usage

object

Show child attributes

cost

number

Estimated cost in USD

finishReason

enum<string> | null

Why the model stopped generating

Available options:

stop,

max_tokens,

tool_calls

toolCalls

object[] | null

Tool calls made by the model

Show child attributes

structuredOutput

object

Parsed JSON when the prompt has an output schema configured

messages

object[]

Full conversation messages for multi-turn continuation (input messages + assistant response)

Show child attributes

Overview

Prompts

Spans

Request

Response

Authorizations

Path Parameters

Body

Response

Overview

Prompts

Spans

​Request

​Response

Authorizations

Path Parameters

Body

Response

Request

Response