Prerequisites

Check what you need before installing Sagewai. The SDK itself requires very little; platform components add more as you adopt them.

Software

Python 3.10+ — required. Sagewai is a Python library and the minimum supported version is 3.10.

Docker — only required if you use the sandbox execution path (isolated container steps). See Execution Modes for when that applies. If you only call cloud APIs or run agents in-process, you do not need Docker.

Ollama — optional. Install Ollama if you want free, local inference without a cloud API key. It uses Metal GPU acceleration automatically on Apple Silicon.

Hardware

Sagewai runs anywhere from a laptop to a multi-node cluster. Pick the profile that matches your use case.

Deployment profiles

Profile	RAM	CPU	Disk	GPU	Use case
SDK Only	512 MB	Any 2-core	500 MB	None	pip install + cloud APIs
Lightweight Dev	2 GB	2-core	2 GB	None	Postgres + Redis only
Full Dev Stack	8 GB+	4-core	15 GB	None	All infrastructure services
Production	16 GB+	8-core	50 GB+	Optional	Self-hosted with observability

Per-service memory breakdown

Service	Memory	Purpose
PostgreSQL 15	~200 MB	State, workflows, fleet, audit
Redis 7	~50 MB	Cache, sessions
Milvus 2.3 (etcd + MinIO + standalone)	~2 GB	Vector embeddings for RAG
NebulaGraph 3.6 (metad + storaged + graphd)	~1.5 GB	Knowledge graph, relations
Observability (VictoriaMetrics + VictoriaLogs + OTel Collector + Grafana)	~1.5 GB	Dashboards, metrics, tracing
LocalStack	~300 MB	S3-compatible archive (dev)
Total (full stack)	~5.5 GB	—

GPU requirements for local inference

Task	VRAM	Recommended GPU	CPU alternative
Ollama 7B (q4_K_M)	4 GB	RTX 3060	Yes, ~10 tok/s
Ollama 13B (q4_K_M)	8 GB	RTX 3070	Yes, ~5 tok/s
Ollama 70B (q4_K_M)	40 GB	A100 / 2x RTX 3090	Impractical
vLLM serving (7B–70B)	8–80 GB	NVIDIA A10G+	No
Unsloth fine-tune 7B	6 GB	RTX 3060+	Very slow
Unsloth fine-tune 13B	12 GB	RTX 3090+	Impractical
Sentence-transformers	512 MB	Any	Yes (default)
GLiNER NER	512 MB	Any	Yes (default)
faster-whisper (base)	1 GB	Any	Yes

GPU is only needed for local inference. If you use cloud APIs (OpenAI, Anthropic, Google), no GPU is required at all.

Disk usage reference

Component	Size
SDK + core dependencies	~500 MB
Intelligence extras (torch, transformers)	~5 GB
Ollama model weights	4–40 GB per model
Milvus data (per 1M vectors, 1536-dim)	~6 GB
Container images (full stack)	~8 GB
NebulaGraph data	~500 MB per 1M edges

Cloud instance recommendations

Profile	AWS	GCP	Azure
SDK Only	t3.small	e2-small	B2s
Lightweight Dev	t3.medium	e2-medium	B2ms
Full Dev Stack	m5.xlarge	e2-standard-4	D4s_v5
Production (no GPU)	m5.2xlarge	e2-standard-8	D8s_v5
Production (GPU)	g5.xlarge (A10G)	g2-standard-4 (L4)	NC6s_v3 (V100)

Apple Silicon notes

Sagewai runs natively on Apple Silicon (M1/M2/M3/M4) Macs:

Ollama uses Metal GPU acceleration automatically — no configuration needed.
Sentence-transformers and GLiNER work on CPU (MPS support varies).
Docker runs via Docker Desktop or OrbStack with Rosetta 2 emulation for x86 images.
Recommended: MacBook Pro M2+ with 16 GB unified memory for the full dev stack.

What you actually need to start

Python 3.10+ and 512 MB of RAM. That is the absolute minimum.

Everything else is optional and scales with your needs. Sagewai degrades gracefully — without Milvus it uses in-memory vectors; without NebulaGraph, in-memory graphs; without Redis, direct database queries. Add components when you actually need them.

Your first agent — write and run a minimal agent.