Async Python client for Codex app-server over stdio and websocket.

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

emsiak

These details have not been verified by PyPI

Project description

codex-app-server-sdk

High-level async Python client for codex app-server.

It gives you a convenient conversation API over stdio or websocket without having to manage raw protocol events yourself.

Documentation: https://emsi.github.io/codex-app-server-sdk/

Highlights

simple one-shot turns with chat_once(...)
step-streaming turns with chat(...) (thinking, exec, codex, etc.), non-delta
built-in thread/turn lifecycle handling
thread-scoped config + forking via ThreadHandle
inactivity timeout continuation for long-running turns
turn cancellation with unread-step/event drain via cancel(...)
optional low-level request(...) access when needed

Install

Install uv (if needed):

curl -LsSf https://astral.sh/uv/install.sh | sh

Install the package from PyPI:

uv add codex-app-server-sdk

Or pip-compatible install in the active environment:

uv pip install codex-app-server-sdk

Documentation

Docs site: https://emsi.github.io/codex-app-server-sdk/
PyPI: https://pypi.tw.martin98.com/project/codex-app-server-sdk/

Quick start

Stdio

import asyncio
from codex_app_server_sdk import CodexClient


async def main() -> None:
    async with CodexClient.connect_stdio() as client:
        result = await client.chat_once("Hello from Python")
        print(result.final_text)


asyncio.run(main())

By default, stdio transport runs:

command: codex app-server

You can override via:

connect_stdio(command=[...])
environment variable: CODEX_APP_SERVER_CMD

Websocket

import asyncio
from codex_app_server_sdk import CodexClient


async def main() -> None:
    async with CodexClient.connect_websocket() as client:
        result = await client.chat_once("Hello over websocket")
        print(result.final_text)


asyncio.run(main())

Websocket defaults:

URL: CODEX_APP_SERVER_WS_URL or ws://127.0.0.1:8765
Bearer token: CODEX_APP_SERVER_TOKEN (optional)

Continuation on inactivity timeout

Both high-level APIs support resuming the same running turn.

import asyncio
from codex_app_server_sdk import CodexClient, CodexTurnInactiveError


async def main() -> None:
    async with CodexClient.connect_stdio(inactivity_timeout=120.0) as client:
        continuation = None
        while True:
            try:
                if continuation is None:
                    result = await client.chat_once("Do a longer task")
                else:
                    result = await client.chat_once(continuation=continuation)
                print(result.final_text)
                break
            except CodexTurnInactiveError as exc:
                continuation = exc.continuation
                idle = (
                    f"{exc.idle_seconds:.1f}s"
                    if exc.idle_seconds is not None
                    else "unknown"
                )
                print(
                    f"[warn] turn inactive for {idle}; resuming "
                    f"(thread_id={continuation.thread_id}, turn_id={continuation.turn_id})"
                )


asyncio.run(main())

Advanced thread control (cwd/instructions/model/fork)

Use explicit thread handles when you need thread-scoped configuration.

import asyncio
from codex_app_server_sdk import CodexClient, ThreadConfig, TurnOverrides


async def main() -> None:
    async with CodexClient.connect_stdio() as client:
        thread = await client.start_thread(
            ThreadConfig(
                cwd="/home/me/project",
                base_instructions="You are concise.",
                developer_instructions="Prefer rg over grep.",
                model="gpt-5",
            )
        )

        result = await thread.chat_once("Summarize the repo layout.")
        print(result.final_text)

        await thread.update_defaults(ThreadConfig(model="gpt-5.1-codex-mini"))
        forked = await thread.fork(
            overrides=ThreadConfig(
                developer_instructions="Focus on tests first.",
            )
        )

        async for step in forked.chat(
            "Run a quick diagnostics pass.",
            turn_overrides=TurnOverrides(effort="low"),
        ):
            print(step.step_type, step.text)


asyncio.run(main())

Configuration scopes and semantics

CodexClient: connection/session scope (transport, request routing, lifecycle).
ThreadHandle + ThreadConfig: thread scope (cwd, baseInstructions, developerInstructions, model, etc.).
TurnOverrides: per-turn scope (cwd, model, effort, summary, ...).

`UNSET` vs `None`

UNSET (default): omit field from request payload; keep server default/current value.
None: send JSON null explicitly (where protocol allows) to reset/clear.

Example:

from codex_app_server_sdk import ThreadConfig, UNSET

cfg = ThreadConfig(
    model=UNSET,  # omit key
    developer_instructions=None,  # send explicit null
)

Continuation constraints

When resuming with continuation=..., do not pass extra turn-start arguments in that same call. Specifically, do not pass: text, thread_id, user, metadata, thread_config, or turn_overrides.

Apply thread changes via thread.update_defaults(...) or start a new/forked thread before continuing with a new turn.

Example clients

More complete examples are under examples/.

All thread_* examples print lifecycle progress checkpoints by default so long operations are visible. Use --quiet on those scripts for minimal output.

Rich step-stream example (thinking/exec/codex blocks)

Recommended example for step-oriented API and continuation behavior.

Stdio:

uv run python examples/chat_steps_rich.py

Websocket:

uv run python examples/chat_steps_rich.py --transport websocket --url ws://127.0.0.1:8765

With extra payload summaries:

uv run python examples/chat_steps_rich.py --show-data

Cancel timed-out turns instead of auto-resume:

uv run python examples/chat_steps_rich.py --cancel-on-timeout

Common options:

--transport {stdio,websocket}
--cmd "codex app-server" (stdio mode)
--url ws://127.0.0.1:8765 (websocket mode)
--token "$CODEX_APP_SERVER_TOKEN" (websocket mode)
--prompt "..."
--user "..."
--inactivity-timeout 120
--show-data
--cancel-on-timeout

Advanced thread config + fork example

uv run python examples/thread_config_and_fork.py \
  --transport stdio \
  --cwd . \
  --base-instructions "Be concise." \
  --developer-instructions "Prioritize correctness."

Websocket:

uv run python examples/thread_config_and_fork.py \
  --transport websocket \
  --url ws://127.0.0.1:8765

Quiet mode:

uv run python examples/thread_config_and_fork.py --quiet

Resume-by-id example

uv run python examples/thread_resume_by_id.py \
  --transport stdio \
  --thread-id <existing-thread-id> \
  --prompt "Continue the previous conversation."

Quiet mode:

uv run python examples/thread_resume_by_id.py --thread-id <existing-thread-id> --quiet

Concurrent thread handles example

This example starts two new threads and runs turns concurrently on those fresh ThreadHandles over one shared client connection (it does not call thread/resume for the newly started threads).

uv run python examples/thread_concurrent_handles.py --transport stdio

Quiet mode:

uv run python examples/thread_concurrent_handles.py --quiet

Thread/model/config ops showcase

This example uses the newly exposed helper APIs:

thread/read, thread/list, thread/name/set, thread/archive
model/list
config/read
endpoint-aware summaries with explicit <not-provided> / null values
optional thread model update reporting with --set-model
config/read prints origin_entries: count of config keys that include provenance metadata (which layer/file provided that effective value)

uv run python examples/thread_ops_showcase.py \
  --transport stdio \
  --prompt "Give a 3-bullet summary." \
  --thread-name "showcase-thread"

Websocket:

uv run python examples/thread_ops_showcase.py \
  --transport websocket \
  --url ws://127.0.0.1:8765

Show model update intent and before/after thread snapshot model visibility:

uv run python examples/thread_ops_showcase.py --set-model gpt-5.3-codex

With raw payload dumps:

uv run python examples/thread_ops_showcase.py --show-data

Quiet mode:

uv run python examples/thread_ops_showcase.py --quiet

Stdio example (multi-turn, one thread)

uv run python examples/chat_session_stdio.py

Custom command and prompts:

uv run python examples/chat_session_stdio.py \
  --cmd "codex app-server" \
  --prompt "First prompt" \
  --prompt "Second prompt"

Websocket example (multi-turn, one thread)

uv run python examples/chat_session_websocket.py

With explicit endpoint/token:

uv run python examples/chat_session_websocket.py \
  --url ws://127.0.0.1:8765 \
  --token "$CODEX_APP_SERVER_TOKEN"

Or via environment:

export CODEX_APP_SERVER_WS_URL=ws://127.0.0.1:8765
export CODEX_APP_SERVER_TOKEN=your-token
uv run python examples/chat_session_websocket.py

API reference (quick)

`CodexClient` (`src/codex_app_server_sdk/client.py`)

connect_stdio(...): create a stdio-configured client (unstarted).
connect_websocket(...): create a websocket-configured client (unstarted).
start(): connect transport and start receive loop (idempotent).
initialize(params=None, timeout=None): perform JSON-RPC initialize handshake with default-merged params (protocolVersion, clientInfo, capabilities) and return normalized InitializeResult.
request(method, params=None, timeout=None): low-level JSON-RPC request helper.
start_thread(config=None): create thread and return ThreadHandle.
resume_thread(thread_id, overrides=None): resume thread and return ThreadHandle.
fork_thread(thread_id, overrides=None): fork thread and return ThreadHandle.
set_thread_defaults(thread_id, overrides): apply thread-level overrides via thread/resume.
read_thread(thread_id, include_turns=True): read one thread.
list_threads(...): list threads with optional filters.
set_thread_name(thread_id, name): rename thread.
archive_thread(thread_id) / unarchive_thread(thread_id): archive lifecycle controls.
rollback_thread(thread_id, num_turns=...): drop recent turns from thread history.
compact_thread(thread_id): request context compaction.
chat(...) (text=None, thread_id=None, user=None, metadata=None, thread_config=None, turn_overrides=None, inactivity_timeout=None, continuation=None): async iterator yielding completed non-delta step blocks.
chat_once(...) (text=None, thread_id=None, user=None, metadata=None, thread_config=None, turn_overrides=None, inactivity_timeout=None, continuation=None): send one user message and wait for completed turn.
cancel(continuation, timeout=None): interrupt running turn, return unread steps/events, and clean turn state.
steer_turn(thread_id=..., expected_turn_id=..., input_items=...): steer active turn input.
start_review(thread_id=..., target=..., delivery=None): run review mode.
list_models(...): discover available models.
exec_command(command, ...): run one command via server command API.
read_config(...), read_config_requirements(), write_config_value(...), batch_write_config(...): config APIs.
interrupt_turn(turn_id, timeout=None): low-level turn interruption request.
close(): cancel receive loop and close transport.

`Transport` and implementations (`src/codex_app_server_sdk/transport.py`)

Transport.connect/send/recv/close: abstract interface.
StdioTransport: line-delimited JSON over subprocess stdin/stdout.
WebSocketTransport: JSON messages over websocket frames.

Data models (`src/codex_app_server_sdk/models.py`)

InitializeResult: parsed initialize response (protocol_version, server_info, capabilities, raw).
ConversationStep: completed step from chat(...) (step_type, item_type, text, item_id, thread_id, turn_id, data).
ChatResult: buffered turn output (thread_id, turn_id, final_text, raw_events, assistant_item_id, completion_source).
ChatContinuation: continuation token for timed-out running turns (thread_id, turn_id, cursor, mode).
CancelResult: cancellation result with unread steps/raw_events plus terminal flags.
ThreadConfig: thread-level config for thread/start, thread/resume, thread/fork (cwd, base_instructions, developer_instructions, model, ...).
TurnOverrides: per-turn overrides forwarded to turn/start (cwd, model, effort, ...).
UNSET: sentinel for “omit this field from request payload.”
ApprovalPolicy: literal type for approval policy values (untrusted, on-failure, on-request, never).

`ThreadHandle` (`src/codex_app_server_sdk/client.py`)

thread_id: bound thread id.
defaults: local thread config snapshot.
chat_once(...): convenience one-turn call bound to this thread.
chat(...): step-streaming call bound to this thread.
update_defaults(overrides): apply thread defaults between messages.
fork(overrides=None): fork thread and get a new handle.
read(include_turns=True): low-level thread/read helper.
set_name(name), archive(), unarchive(), rollback(num_turns), compact(): thread lifecycle/history helpers.
start_review(target, delivery=None): thread-bound review API.

Exceptions (`src/codex_app_server_sdk/errors.py`)

CodexError: base exception.
CodexTransportError: transport/connectivity problems.
CodexTimeoutError: request timeout (and base for timeout-related flow).
CodexTurnInactiveError: per-turn inactivity timeout with resumable continuation.
CodexProtocolError: protocol/JSON-RPC error (optional code and data).

Behavior notes

This version does not expose token-delta streaming as a public API.
chat(...) provides async streaming of completed step blocks (non-delta) from live item/completed notifications only.
chat(...) intentionally does not merge thread/read snapshot items for the same turn, avoiding duplicate blocks when snapshot item IDs differ from live event item IDs.
chat_once(...) resolves final text from completed agentMessage items (item/completed), with thread/read(includeTurns=true) fallback.
turn_timeout is intentionally removed to avoid conflicting timeout semantics.
Turn waits are controlled by inactivity_timeout (or unbounded when None).
cancel(...) interrupts a continuation turn, returns unread buffered data, and cleans internal session state so the same thread can be reused safely.
Advanced thread-level config/fork uses protocol v2 methods (thread/start, thread/resume, thread/fork) exposed via ThreadHandle and ThreadConfig.
metadata is applied on turn/start payloads for message turns; thread-level config uses schema-aligned fields on thread methods.
preferred lifecycle is async with CodexClient.connect_*() as client:; manual start()/close() remains available for advanced control.
The client uses modern thread/turn methods (thread/start, thread/resume, turn/start, turn/interrupt).
initialize currently sends protocolVersion: "1" as handshake metadata.
Websocket transport targets websockets (>=16,<17), uses additional_headers, and disables compression by default (compression=None) for codex app-server compatibility.
After dependency changes, run uv sync to refresh the virtual environment.

Initialize handshake (`initialize()`)

initialize() performs the protocol handshake and returns InitializeResult.

chat_once(...) and chat(...) call initialize() automatically on first use.
call initialize() explicitly when you want to fail fast before first turn, inspect server metadata, or send custom init params.

Default initialize payload

When params=None, the client sends:

{
  "protocolVersion": "1",
  "clientInfo": {
    "name": "codex-app-server-sdk",
    "version": "0.1.0"
  },
  "capabilities": {
    "optOutNotificationMethods": [
      "codex/event/agent_message_content_delta",
      "codex/event/reasoning_content_delta",
      "codex/event/item_started",
      "codex/event/item_completed",
      "codex/event/task_started",
      "codex/event/task_complete"
    ]
  }
}

Custom init params (`initialize(params=...)`)

Supported/customizable keys:

protocolVersion: str
clientInfo: dict (commonly name, version, plus optional extra fields)
capabilities: dict
capabilities.optOutNotificationMethods: list[str]
any additional top-level keys are passed through unchanged

Merge rules:

the payload starts from the default block above;
caller params are shallow-merged at top level;
if caller provides capabilities as a dict and omits optOutNotificationMethods, defaults are auto-injected;
if caller provides capabilities.optOutNotificationMethods, caller value is preserved;
if caller sets capabilities to None or a non-dict value, no injection is applied.

`InitializeResult` fields

protocol_version: extracted from protocolVersion or protocol_version in server result
server_info: extracted from serverInfo or server_info
capabilities: extracted from capabilities
raw: full raw initialize result payload

Example: explicit initialize

import asyncio
from codex_app_server_sdk import CodexClient


async def main() -> None:
    async with CodexClient.connect_stdio() as client:
        init = await client.initialize(
            {
                "clientInfo": {
                    "name": "my-client",
                    "version": "0.3.0",
                },
                "capabilities": {
                    "optOutNotificationMethods": [
                        "codex/event/agent_message_content_delta"
                    ]
                },
            }
        )
        print(init.protocol_version)


asyncio.run(main())

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

emsiak

These details have not been verified by PyPI

Release history Release notifications | RSS feed

This version

0.3.2

Feb 26, 2026

0.3.1

Feb 24, 2026

0.3.0

Feb 24, 2026

0.2.1

Feb 24, 2026

0.2.0

Feb 24, 2026

0.1.0

Feb 23, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

codex_app_server_sdk-0.3.2.tar.gz (139.2 kB view details)

Uploaded Feb 26, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

codex_app_server_sdk-0.3.2-py3-none-any.whl (55.6 kB view details)

Uploaded Feb 26, 2026 Python 3

File details

Details for the file codex_app_server_sdk-0.3.2.tar.gz.

File metadata

Download URL: codex_app_server_sdk-0.3.2.tar.gz
Upload date: Feb 26, 2026
Size: 139.2 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for codex_app_server_sdk-0.3.2.tar.gz
Algorithm	Hash digest
SHA256	`ab21a374922ad8d5d29904c89453edb9c0bccd69719ed06c7ec7c66e82bdbbd4`
MD5	`b0b1e6605c677e804b49b481978e0a38`
BLAKE2b-256	`cbeb2d1dee1027b639afded260f5557eda874c9876461c80e65fb817b5e8eb3a`

See more details on using hashes here.

Provenance

The following attestation bundles were made for codex_app_server_sdk-0.3.2.tar.gz:

Publisher: publish.yml on emsi/codex-app-server-sdk

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: codex_app_server_sdk-0.3.2.tar.gz
- Subject digest: ab21a374922ad8d5d29904c89453edb9c0bccd69719ed06c7ec7c66e82bdbbd4
- Sigstore transparency entry: 1000063558
- Sigstore integration time: Feb 26, 2026
Source repository:
- Permalink: emsi/codex-app-server-sdk@0257ccf7fe7fe91d0d7d8d9bd66c07ce2e45f99e
- Branch / Tag: refs/tags/v0.3.2
- Owner: https://github.com/emsi
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@0257ccf7fe7fe91d0d7d8d9bd66c07ce2e45f99e
- Trigger Event: push

File details

Details for the file codex_app_server_sdk-0.3.2-py3-none-any.whl.

File metadata

Download URL: codex_app_server_sdk-0.3.2-py3-none-any.whl
Upload date: Feb 26, 2026
Size: 55.6 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for codex_app_server_sdk-0.3.2-py3-none-any.whl
Algorithm	Hash digest
SHA256	`f3c46357401d7c1b7a88ec421209ddbefe6b9631bf319ede0bf9899a9d7a3fd4`
MD5	`6bde779c4be57633a181933db11ade1a`
BLAKE2b-256	`94059f7c0329f086a25d4528944e02517c2ab36f434f4b8eb4d3408e1fbe3f9a`

See more details on using hashes here.

Provenance

The following attestation bundles were made for codex_app_server_sdk-0.3.2-py3-none-any.whl:

Publisher: publish.yml on emsi/codex-app-server-sdk

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: codex_app_server_sdk-0.3.2-py3-none-any.whl
- Subject digest: f3c46357401d7c1b7a88ec421209ddbefe6b9631bf319ede0bf9899a9d7a3fd4
- Sigstore transparency entry: 1000063605
- Sigstore integration time: Feb 26, 2026
Source repository:
- Permalink: emsi/codex-app-server-sdk@0257ccf7fe7fe91d0d7d8d9bd66c07ce2e45f99e
- Branch / Tag: refs/tags/v0.3.2
- Owner: https://github.com/emsi
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@0257ccf7fe7fe91d0d7d8d9bd66c07ce2e45f99e
- Trigger Event: push

codex-app-server-sdk 0.3.2

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

codex-app-server-sdk

Highlights

Install

Documentation

Quick start

Stdio

Websocket

Continuation on inactivity timeout

Advanced thread control (cwd/instructions/model/fork)

Configuration scopes and semantics

UNSET vs None

Continuation constraints

Example clients

Rich step-stream example (thinking/exec/codex blocks)

Advanced thread config + fork example

Resume-by-id example

Concurrent thread handles example

Thread/model/config ops showcase

Stdio example (multi-turn, one thread)

Websocket example (multi-turn, one thread)

API reference (quick)

CodexClient (src/codex_app_server_sdk/client.py)

Transport and implementations (src/codex_app_server_sdk/transport.py)

Data models (src/codex_app_server_sdk/models.py)

ThreadHandle (src/codex_app_server_sdk/client.py)

Exceptions (src/codex_app_server_sdk/errors.py)

Behavior notes

Initialize handshake (initialize())

Default initialize payload

Custom init params (initialize(params=...))

InitializeResult fields

Example: explicit initialize

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance

`UNSET` vs `None`

`CodexClient` (`src/codex_app_server_sdk/client.py`)

`Transport` and implementations (`src/codex_app_server_sdk/transport.py`)

Data models (`src/codex_app_server_sdk/models.py`)

`ThreadHandle` (`src/codex_app_server_sdk/client.py`)

Exceptions (`src/codex_app_server_sdk/errors.py`)

Initialize handshake (`initialize()`)

Custom init params (`initialize(params=...)`)

`InitializeResult` fields