All notable changes to this project will be documented in this file.
The format is based on Keep a Changelog, and this project adheres to Semantic Versioning.
0.11.0 - 2026-03-27
55+ PRs merged, 400+ files changed.
Unified tool calling across all providers — the bridge between LLM reasoning and real-world action.
- Single
tools=parameter replacesweb_search=True,x_search=True,code_execution=True. - Three tool shapes:
- Built-in tools (server-side, provider-mapped):
WebSearch(blocked_domains=["reddit.com"]),XSearch(),CodeExecution() - User-defined function tools (dict with
name+parameters):{"name": "get_weather", "parameters": WeatherParams} - Raw passthrough (dict without
name):{"type": "bash_20250124"}— escape hatch for provider-specific tools
- Built-in tools (server-side, provider-mapped):
- New types:
Tool,WebSearch,XSearch,CodeExecution,ToolCall,ToolResult,ToolDefinition - Multi-turn tool use via
ToolResult(Message)— tool results are messages in the conversation array. - Streaming tool calls — tool call deltas accumulate during streaming, reconstructed into
ToolCallobjects after stream exhaustion. - Architecture:
ToolMapperper tool type per provider (parallel toParameterMapper).ToolsMapperdispatches by tool type.ToolSupportconstraint on models declares which built-in tools are supported. - Deprecated param shims —
web_search=Truestill works withDeprecationWarning, removal date 2026-06-07. - Philosophy: Celeste is primitives, not a framework. Tools are a parameter — celeste passes schemas to providers, normalizes responses into
ToolCallobjects, and returns them. It never auto-executes. - PRs: #199, #228, #229, #232, #234, #235
- Files:
src/celeste/tools.py(new), providertools.pyfiles (6 new),tests/integration_tests/text/test_tools.py(new)
Extracted shared protocol implementations from provider-specific code. Providers now inherit protocol base classes and override only what differs.
- ChatCompletions protocol (
protocols/chatcompletions/) — shared/v1/chat/completionsimplementation. Used by: DeepSeek, Groq, HuggingFace, Mistral, Moonshot. - OpenResponses protocol (
protocols/openresponses/) — shared Responses API implementation. Used by: OpenAI, xAI, Ollama. protocol=+base_url=parameters (v0.11.0, #241) — connect to any compatible API (vLLM, Ollama, MiniMax, LocalAI) without a registered provider. Protocol is wire format identity, not company name.- ~2000 lines of provider code deduplicated across 34 files. Provider clients went from ~140 lines to ~10-30 lines.
- Each protocol provides:
client.py,config.py,parameters.py,streaming.py,tools.py. - PRs: #153, #154, #241
- Files:
src/celeste/protocols/(new directory)
Route requests through Google Vertex AI for any supported provider.
GoogleADCauth withproject_idandlocationfields.- All Google providers routable through Vertex AI (text, images, embeddings, video, TTS).
- Cross-provider support: Anthropic, Mistral, and DeepSeek on Vertex AI.
_make_poll_requestfor long-running operations (video generation).google-authmoved to optional[gcp]extra (#133) — saves ~24.5 MB for non-GCP users.- 39 unit tests for URL routing in
test_vertex_routing.py. - PR: #135
- Files:
src/celeste/auth.py, all provider clients
- HuggingFace (v0.10.2, #183) — OpenAI-compatible inference router (
router.huggingface.co), 127+ models across 15 inference providers. Uses ChatCompletions protocol. Credential:HF_TOKEN. - xAI Grok Imagine (v0.9.6, #128) — Image generation (
grok-imagine-image) and video generation (grok-imagine-video) with aspect ratios, num_images, duration, resolution. Async polling for video. - Ollama Images (v0.9.4, #119) — Local image generation via
/api/generate. Flexible parameter validation (pass-through for dynamic providers). AddsNEGATIVE_PROMPTparameter. - OpenResponses + Ollama Text (v0.9.4, #111) —
Provider.OPENRESPONSESfor Responses-compatible APIs,Provider.OLLAMAfor local inference.NoAuthfor local providers,base_url=for custom gateways. - Files:
src/celeste/providers/huggingface/,src/celeste/providers/xai/images/,src/celeste/providers/xai/videos/,src/celeste/providers/ollama/
Anthropic:
- Claude Opus 4.6 (#130) — 64K max tokens, 32K thinking budget
- Claude Sonnet 4.6 (#179) — 64K max tokens, extended thinking
Google:
- Gemini 3.1 Pro Preview (#179) — thinking levels: low/medium/high
- Gemini 3.1 Flash Lite Preview (#213)
OpenAI:
- GPT-5.4, GPT-5.4 Pro (#213) — thinking budget + verbosity
xAI:
- Grok 4.20 Beta (reasoning + non-reasoning) (#213)
BytePlus:
- Seedream 5.0 Lite (#181) — 2K/3K quality tiers, 16 resolution presets
Moonshot:
- Kimi K2.5 — with WebSearch tool support
Removed (no longer available on provider APIs, #243):
- OpenAI: GPT-4, GPT-4 Turbo, GPT-3.5 Turbo
- xAI: Grok 2 Vision 1212
- Groq: Llama 4 Maverick 17B 128E
First-class support for multi-turn conversations across all text providers.
Message(role, content, tool_calls)andRole(USER, ASSISTANT, SYSTEM, DEVELOPER) types.- All text entrypoints accept
messages=parameter for conversation history. - Provider normalization: Anthropic system messages extracted to top-level, Google system_instruction, chat vs responses arrays.
- PR: #111
- Files:
src/celeste/types.py, all text provider clients
web_search=/x_search=/code_execution=deprecated — replaced bytools=[WebSearch()]. Old boolean params work via shims withDeprecationWarninguntil 2026-06-07.google-authmoved to optional extra (#133) —pip install "celeste-ai[gcp]"required for Vertex AI. RaisesMissingDependencyErrorwith install instructions if missing.requestsremoved from runtime deps (#132) — only needed asgoogle-auth[requests]transport.- Deprecated models removed (#243) — GPT-4, GPT-4 Turbo, GPT-3.5 Turbo, Grok 2 Vision 1212, Llama 4 Maverick (all dead on provider APIs).
_parse_contentsignature change (#203) — now(self, response_data) -> Content, no**parameters. Affects custom provider subclasses.FieldMapper.nametype change (#161) — fromStrEnumtoClassVar[StrEnum]. Affects customParameterMappersubclasses.ModalityClientconstructor (#241) — gainedprotocol=andbase_url=;providerbecameProvider | None.
- BlockingPortal thread leak (#239) — sync stream iteration leaked portal thread on exception,
break, or abandonment. Rewrote__iter__as generator withtry/finallyfor guaranteed cleanup. Fixed CI hanging 19 minutes after tests. - SSE stream error detection (#192) — mid-stream errors (Anthropic
overloaded_error, OpenAIserver_error) now raiseStreamEventErrorinstead of being silently discarded. - Streaming HTTP error enrichment (#196) — consistent provider-enriched error messages across streaming and non-streaming paths.
json.loadsstrict mode (#152) — LLM-generated content with literal\nin thinking fields causedInvalid control charactererrors. Fixed withstrict=Falsein all 8 providerparse_outputmethods.UnicodeDecodeErrorin error handler (#171) — binary error bodies (e.g.0xff) escaped the catch block._parse_usagereturn type mismatch (#174) — 3 modality-level overrides returned typed objects instead ofRawUsagedicts, crashing on Gemini/Imagen.- Streaming image edits routed to wrong endpoint (#180) — OpenAI streaming edits sent to
/v1/images/generationsinstead of/v1/images/edits. - Empty/whitespace credentials (#190) — now raises
MissingCredentialsErrorinstead of sending empty API keys. - API key whitespace causing HTTP errors (#116) — strip whitespace from API keys to prevent
LocalProtocolError: Illegal header value. - Null
tool_callsin ChatCompletions streaming — DeepSeek sends"tool_calls": nullinstead of omitting the key..get("tool_calls", [])returnsNonewhen key exists with null value. Fixed withor []. - Pydantic V2.11 deprecation —
tool.model_fieldson instance changed totype(tool).model_fields(class access) in 4 tool mapper files.
extra_body=on all modalities (#126) — pass provider-specific fields into the request body on images, audio, videos, embeddings.extra_headers=on all client methods (#193) —generate,stream,analyze,speak,embed,editall acceptextra_headers: dict[str, str] | None.auth=passthrough on all namespace methods (#146) — previously only async text generation passed it through, silently dropping it elsewhere. Vertex AI auth now works across all domains.UnsupportedParameterWarning(#212) — warns when parameters are silently ignored by a provider instead of failing silently.InvalidToolError(#228) — immediate error for bad tool list items (e.g.,tools=[WebSearch]without(), strings, integers).- WebSearch config field drop warnings (#235) — warns when fields like
blocked_domainsormax_usesaren't supported by the provider. - Base64 serialization (#145) —
ser_json_bytes="base64"on Output/Chunk models ensuresmodel_dump_json()correctly base64-encodes artifact data. - Colab quickstart notebook (#115) —
notebooks/celeste-colab-quickstart.ipynb. - CONTRIBUTING.md rewrite (#131) — accurate workflow documentation.
- FieldMapper base class (#161) — replaces repeated validate-then-set pattern across 45 mapper classes with 2-line declarations.
_content_fieldsClassVar (#160) — replaces 20 identical_build_metadata()overrides with one-line declarations. -215 lines.- Generic
MediaConstraint(#176) — 6 near-identical media constraint classes (~190 lines) replaced with two generic bases. - Stream base class consolidation (#168, #172) —
_parse_chunk_content/_wrap_chunk_contenthooks. 6 text provider streams became empty classes. -521 lines total. ParameterMappergeneric on Content (#189) — PEP 695 generics, end-to-end type safety._transform_outputmoved to base_predict(#187) — prevents silent structured output failures in new providers._json_headers()helper (#162) — replaces verbatim auth + Content-Type header construction across every provider client.- Artifact validator handles base64 (#139) — removed manual
base64.b64decode()across 6 provider clients. - ChatCompletionsTextClient protocol base (#199) — modality-level protocol class that DeepSeek, Groq, HuggingFace, Mistral, Moonshot inherit from.
- Provider-level OpenResponses removed (#229) — OpenAI/xAI/Ollama now use protocol-level
OpenResponsesTextClient. -396 lines.
- +882 lines of new tests across unit and integration suites.
- Integration tests for tools: 17 parametrized runs across 4 providers (Anthropic, OpenAI, Google, xAI) — WebSearch, streaming, function tools, XSearch, multi-turn ToolResult round-trip.
- Vertex AI routing tests: 39 tests covering URL routing for all provider/modality combinations.
- Protocol base URL tests: 185 lines (
test_protocol_base_url.py). - Streaming tests expanded: +246 lines, sync iteration cleanup tests.
- Exception tests:
InvalidToolError,StreamEventError. - Client tests: unsupported parameter warnings, deprecated param shims.
- Constraint tests:
ToolSupportvalidation (5 tests). - Test count: ~478 → 527+.
- @Leo-le-jeune — HuggingFace text generation provider (#183)
- @Seluj78 and @Olaiwonismail — CONTRIBUTING.md rewrite (#131)
- @Alistorm and @XinyueZ — design input on the tool calling architecture (#150)
0.9.1 - 2026-01-17
279 files changed. Monolith rewrite — single package, modality-first architecture.
- DX first: pip install celeste-ai is the only thing most users want to remember. Requiring extras like celeste-ai[text] adds cognitive load and drops adoption.
- Lightweight anyway: importing modalities/providers doesn’t add “weight” in practice. The package stays tiny (under ~2 MB), and 90% of the code is core no matter what.
- No heavyweight vendor SDKs: Celeste uses lightweight HTTP clients, so “bundling providers” doesn’t drag in massive dependencies.
- Faster + cleaner: removing the registry/entry‑points means fewer indirections, no dynamic discovery, and faster startup with clearer behavior.
- Simpler maintenance: one package, one version, one release pipeline. Fewer moving parts and fewer “what do I install?” support tickets.
- Predictable behavior: providers are always available; no hidden runtime failures because a plugin wasn’t installed.
- Better UX in docs: one install step + one API surface; examples “just work.”
- Removed the legacy multi-package workspace and extras-only install paths.
- Now ships as a single PyPI package (
celeste-ai). - Updated dependency and tooling configuration to the monolith layout (tests/coverage/mypy/ruff paths now target
srcandtestsonly). - File updated:
pyproject.toml. - File updated:
Makefile. - Files deleted:
packages/(all legacy capability/provider packages and their tests).
- Added modality-first integration tests (
tests/integration_tests/**) covering generate/edit/analyze/speak and streaming flows per modality. - Added media fixtures for image/audio/video analysis in integration tests.
- Expanded unit tests for namespace routing, modality inference, stream metadata, auth/credentials registry, artifact handling, and IO validation.
- Replaced legacy package-based tests with a unified monolith test layout.
- Files added:
tests/integration_tests/**,tests/testing_guidelines.md,tests/unit_tests/utils/**,tests/__init__.py. - Files updated:
tests/unit_tests/*.py. - Files removed: legacy package-specific test suites under
packages/**.
- CI workflow now targets monolith paths only (
src/celeste,tests) for ruff, mypy, and bandit. - Publish workflow runs required integration tests from
tests/integration_testsand no longer uses package-based change detection. - Build step produces a single package (
uv build) before TestPyPI → PyPI → GitHub release. - Files updated:
.github/workflows/ci.yml,.github/workflows/publish.yml.
- New
extra_bodyparameter lets you pass provider-specific fields or arbitrary JSON into the request body. - This unblocks new provider capabilities before Celeste adds first-class parameter mapping.
- Implemented via deep-merge into the built request payload.
- File updated:
src/celeste/client.py.
- v1 moves from capability-centric clients (one method) to modality-centric clients (one output type). Modality = output type (text/images/audio/video/embeddings), operations become methods (
generate,edit,analyze,embed). - Domain is the resource you work with (e.g., videos), regardless of whether the input is text or media; modality is what comes out (output type). This clarifies why one client can expose multiple operations while still being type-safe.
- Namespaces are domain-first (the resource you work with);
create_clientis modality-first (the output type you want). Routing uses (domain, operation) → modality. - Cross-domain operations are explicit: image/audio/video analysis always routes to the text modality; embeddings always route to the embeddings modality.
- Operations exposed: text
generate/embed, imagesgenerate/edit/analyze, audiospeak/analyze, videosgenerate/analyze. - Files added:
src/celeste/modalities/(text/images/audio/videos/embeddings submodules).
- Domain namespaces (
celeste.text,celeste.images,celeste.audio,celeste.videos) provide one-line calls that route to the correct modality/operation under the hood. - Domain indicates the resource you work with, even if the input is different (e.g., video generation can take text input but stays in the videos domain). Modality indicates the output type (e.g.,
celeste.images.analyze(...)routes to the text modality because analysis returns text). - The namespace pattern is more discoverable (IDE autocomplete) and matches how people think: start from the domain, then pick the action.
- Execution modes are explicit via namespace properties: async by default, with
.syncand.streamfor blocking and streaming workflows. - The factory pattern remains available for explicit configuration and client reuse when you want full control.
- Files added:
src/celeste/namespaces/__init__.py,src/celeste/namespaces/domains.py.
- Added monolith provider layer under
src/celeste/providers/with per-provider config/parameters/streaming modules. - Provider modules include: OpenAI (responses/images/audio/videos), Google (generate_content/imagen/veo/cloud_tts/embeddings/interactions), Anthropic (messages), Cohere/Mistral/DeepSeek/Groq/Moonshot (chat), xAI (responses), ElevenLabs/Gradium (text_to_speech), BFL/BytePlus (images + videos).
- Central provider exports in
providers/__init__.pyfor import/discovery. - Added provider API references doc at
src/celeste/providers/api_references.md. - Files added:
src/celeste/providers/**andsrc/celeste/providers/__init__.py.
- Added code generation templates for contributors to easily add new modalities, providers, or parameters.
- Templates follow the monolithic architecture with proper relative imports and type patterns.
- Modality templates (
templates/modalities/): Full scaffolding for new output types including client, IO types, parameters, streaming, provider implementations, and integration tests. - Provider API templates (
templates/providers/): Scaffolding for new provider HTTP clients including config, client mixin, streaming, and parameter mappers. - Templates include:
client.py,io.py,parameters.py,streaming.py,models.py,config.py, and test files. - Placeholder conventions:
{Modality}(PascalCase),{modality}(lowercase),{Provider},{provider},{Api},{api},{Content}. - Files added:
templates/modalities/**,templates/providers/**.
- Updated public exports to v1 surface, including modalities, operations, clients, structured outputs, and namespace singletons.
- Added modality/provider client map and auto-registered modality models at import time.
- Added capability-to-(modality, operation) translation layer to keep
capabilitysupported with a deprecation warning. create_clientnow supports modality + operation arguments, infers operations when possible, and resolves models with better modality-aware errors.- File updated:
src/celeste/__init__.py.
- Updated Quick Start and multimodal examples to the namespace‑first v1 API.
- Added a Namespace API section and an “Advanced: create_client” section.
- Added “Behavior changes since v0.3.9”.
- Replaced extras installs with a single‑package install (
pip install celeste-ai/uv add celeste-ai). - Updated the PyPI badge and added the v1 beta callout banner.
- File updated:
README.md.
- Set package version to
0.9.1for the public v1 beta. - Updated development status classifier to Beta.
- Removed notebook/scraping-only runtime deps from install requirements (ipykernel, matplotlib, beautifulsoup4).
- File updated:
pyproject.toml.
- Replaced the capability-specific
Clientwith a unifiedModalityClientbase class. - Removed the capability/provider client registry and related lookup helpers.
- Added
modalityas a first-class field and use it for HTTP client selection. - Added
syncandstreamnamespace properties for modality clients. _predictnow takes explicitinputsplus optionalendpointandextra_body, and expects provider_make_requestto return a response data dict (not anhttpx.Response)._streamnow takes explicitinputsand astream_class, passesclient=self, and supportsextra_body+streaming=True.- Added
APIMixin._deep_mergeand extended_build_requestto mergeextra_bodyinto the request body. - Removed capability compatibility validation at init time.
- Updated content types to
TextContent/ genericContentand adjusted_transform_outputaccordingly. - Metadata now includes
modalityand the raw response payload. - File updated:
src/celeste/client.py.
- Renamed
APIKeytoAuthHeader(withsecret,header,prefix), keepingAPIKeyas a backwards-compatible alias. get_auth_classno longer auto-loads providers from entry points; auth types must be registered explicitly.- Replaced static provider maps with dynamic auth registry (
register_auth,get_auth_config). - Added credential fields for
MOONSHOT_API_KEY,DEEPSEEK_API_KEY, andGROQ_API_KEY. get_authnow instantiates custom auth classes or builds anAuthHeaderfrom the registry.has_credentialreturns true for auth-class providers;list_available_providersnow filters by registry + credentials.- Files updated:
src/celeste/auth.py,src/celeste/credentials.py.
- Added
get_bytes()andget_base64()primitives for content access. - Removed
_default_mime_typeandto_data_url()(moved to utilities). - Added
src/celeste/utils/mime.pywithdetect_mime_type()andbuild_image_data_url(). - MIME detection uses
filetypelibrary (magic bytes). - Added JSON serialization for
data: bytesusing base64. - Files updated:
src/celeste/artifacts.py,src/celeste/mime_types.py. - Files added:
src/celeste/utils/.
- Switched shared client registries to use
Modalityinstead ofCapability. - HTTP client now recreates the
httpx.AsyncClientif the event loop changes to avoid "Event loop is closed" errors. - Files updated:
src/celeste/http.py,src/celeste/websocket.py.
- Added sync iteration support via anyio blocking portals (
__iter__,__next__). - Added sync context manager support (
__enter__,__exit__) with portal cleanup. - Added
_build_stream_metadatahook (default: raw events). - Stream exhaustion no longer raises
StreamEmptyErrorwhen no chunks were produced. - Improved cleanup: guard
acloseduring active iteration and suppress close-time runtime errors. - File updated:
src/celeste/streaming.py.
ModelNotFoundErrornow supports modality-based messages.ClientNotFoundErrorexpanded to include modality + operation.- Added
ModalityNotFoundError. - File updated:
src/celeste/exceptions.py.
- Provider
_make_requestnow returns a response data dict; error handling is expected inside provider implementations. - Empty streams no longer raise
StreamEmptyErroron exhaustion. - Auth types are no longer auto-loaded from entry points; they must be registered explicitly.
- Related files:
src/celeste/client.py,src/celeste/streaming.py,src/celeste/artifacts.py,src/celeste/auth.py.