牛马 (Niuma)

Squeeze every idle hardware resource. Waste not a single token.

Niuma is a compiler-driven AI task orchestration system that runs on resource-constrained hardware. A strong LLM acts as a compiler — it translates natural language tasks into typed, verifiable DAGs once. A fleet of cheap, weak LLMs act as workers — they fill in the blanks, run tests, fix failures, and iterate until every contract is satisfied. The strong model only re-engages as a reviewer, checking the final output.

The core idea: strong models are expensive per token but smart. Weak models are dumb but cheap enough to burn on trial-and-error loops. Niuma splits the work so each model does what it's best at.

Why

Existing AI agent frameworks (AutoGPT, MetaGPT, CrewAI) assume resource abundance — 16 GB RAM minimum, unbounded token budgets, cloud-scale LLMs. Niuma is designed for the opposite: a 2 GB Ubuntu server, a leftover weak-model API subscription, and the belief that good things can come from trash hardware.

No one in the literature is attempting sub-16 GB multi-agent systems. Niuma explores the physical lower bound of an AI agent orchestration kernel.

Architecture

User: "implement a thread-safe LRU cache"
            │
            ▼
     ┌─────────────┐
     │  compiler.py │  Strong model (once): task → typed DAG JSON
     └──────┬──────┘
            │ DAG (nodes with contracts + tests)
            ▼
     ┌─────────────┐
     │   worker.py  │  Weak model loop: generate → test → fix → retry
     │  (sandbox)   │  Subprocess isolation with resource limits
     └──────┬──────┘
            │ all nodes pass
            ▼
     ┌─────────────┐
     │ reviewer.py  │  Strong model (once): audit → PASS/FAIL
     └──────┬──────┘
            │ PASS
            ▼
     ┌─────────────┐
     │  outputs/    │  Final code + metrics
     └─────────────┘

Each DAG node carries a function signature (typed inputs/outputs), a contract (pre/post conditions and invariants), and a test skeleton that the weak model must satisfy. The weak model never needs to understand the overall task — it only fills in typed blanks.

Quick Start

Prerequisites

Python 3.10+
Node.js 18+ (for TypeScript sandbox execution)
An LLM API key (OpenAI-compatible endpoint)

Setup

git clone https://github.com/ai-dev-dot/niuma.git
cd niuma
./niuma

That's it. The first run auto-installs dependencies, then opens the menu:

  ==================================================
  牛马 Niuma
  ==================================================
  1. 配置强模型 | Configure Strong Model
  2. 配置弱模型 | Configure Weak Model
  3. 管理项目 | Manage Projects
  4. 退出 | Exit

From the menu, pick a provider (9 vendors supported: OpenAI, DeepSeek, Groq, OpenRouter, SiliconFlow, ZhipuAI, DashScope, MiniMax, Moonshot/Kimi), enter your API key, and you're done. Base URL and model list are auto-filled.

Verify

python main.py --doctor     # Check prerequisites (Python, Node.js, model config)
python main.py --dry-run    # Dry-run with mocked APIs to verify pipeline structure

Test

python -m pytest system_tests/ -v

Project Structure

niuma/
  cli.py               # Interactive TUI menu (entry point)
  config.py             # Strong/weak model configuration management
  project_manager.py    # Project create/open/delete workflows
  compiler.py           # Strong model: natural language → typed DAG JSON
  worker.py             # Weak model: generate → sandbox → fix → retry loop
  reviewer.py           # Strong model: contract compliance audit
  sandbox.py            # Subprocess execution with resource limits
  llm.py                # OpenAI-compatible API client (exponential backoff)
  metrics.py            # JSONL metrics output
  models.py             # Shared dataclasses (DAGNode, SandboxResult, ...)
  main.py               # Pipeline orchestrator (compile → execute → review)
  dag_schema.json       # JSON Schema for DAG validation
  requirements.txt      # Python dependencies
  niuma                 # One-command launcher (bash)
  niuma.bat             # One-command launcher (Windows)
  tasks/                # Example task descriptions (internal storage)
  system_tests/         # pytest suite (29 tests)
  docs/                 # Design docs and test plans

Language Strategy

The orchestration kernel is written in Python. Generated code targets TypeScript by default (Python also supported). The signature.language field on each DAG node selects the runtime. Adding a new language means implementing a new sandbox runtime handler — the compiler and reviewer don't need to change.

Status

v0.3.0 — 29 system tests passing. Conversational requirement clarification, git-driven architecture, and configurable retry limits are all implemented. The compiler → worker → reviewer loop runs end-to-end with real LLMs. ./niuma → configure models → new task to get started.

Docs

Design doc — DAG node specification, fault recovery, prompt templates, architecture review
User guide — complete workflow, project management, Git-based review

License

MIT — see LICENSE.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

牛马 (Niuma)

Why

Architecture

Quick Start

Prerequisites

Setup

Verify

Test

Project Structure

Language Strategy

Status

Docs

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 47 Commits
docs		docs
system_tests		system_tests
tasks		tasks
.env.example		.env.example
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
CLAUDE.md		CLAUDE.md
LICENSE		LICENSE
README.md		README.md
README.zh-CN.md		README.zh-CN.md
VERSION		VERSION
cli.py		cli.py
compiler.py		compiler.py
config.py		config.py
dag_schema.json		dag_schema.json
jest.config.js		jest.config.js
llm.py		llm.py
main.py		main.py
metrics.py		metrics.py
models.py		models.py
niuma		niuma
niuma.bat		niuma.bat
package-lock.json		package-lock.json
package.json		package.json
project_manager.py		project_manager.py
requirements.txt		requirements.txt
reviewer.py		reviewer.py
sandbox.py		sandbox.py
tsconfig.json		tsconfig.json
worker.py		worker.py

Folders and files

Latest commit

History

Repository files navigation

牛马 (Niuma)

Why

Architecture

Quick Start

Prerequisites

Setup

Verify

Test

Project Structure

Language Strategy

Status

Docs

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages