Roadmap

Fintech scale, AI inference, and operating maturity now move on one roadmap.

v0.1 delivered 62,770 TPS on a 3-node DO cluster. v0.2 achieved 436K TPS sharded and 131K TPS VSR. v0.3 shipped April 19 2026 on AWS i4i.4xlarge — 233,894 TPS with live VSR failover testing. v0.3.1 integrated AI inference with Tract ONNX. v0.3.2 added multi-stream priority routing with <1ms critical latency. v0.3.3 hardened the platform with 2PC, mTLS, signed artifacts, and operator runbooks. v0.5 AI LLM verified distributed Qwen2.5-7B generation, and v0.5 AI Production now points toward large-model serving.

v0.1✓ Done62,770 TPS · DO cluster · gRPC
  • Core Rust engine with LMAX Disruptor ring buffer
  • TigerBeetle VSR consensus across three nodes
  • gRPC + Tokio UDP transport · 256 in-flight window
  • 3-node DigitalOcean cluster · $252/month · 0% error rate
v0.2✓ Done1.2M TPS local · 436K TPS sharded · 131K TPS VSR
  • Aeron IPC transport — 1,203,108 TPS peak on MacBook Air M4
  • io_uring disk writes · sharded TigerBeetle E2E integration
  • 2 shards per node · 436,351 TPS aggregate (3-node sharded)
  • 130,998 TPS with full VSR consensus · 2/3 quorum · 0% error
  • DO cluster benchmark completed April 13 2026 · SGP1
v0.3✓ Done233,894 TPS peak · AWS i4i · VSR failover tested
  • Production-grade single-shard with live VSR failover test
  • AWS i4i.4xlarge · Intel Xeon Platinum 8375C · 16 vCPU · 128 GiB · 1.9TB NVMe
  • 4 shards × dedicated TigerBeetle client · 3-node VSR cluster (loopback)
  • 233,894 TPS peak · 180,500 TPS avg · 12,421,068 events · 0% error
  • VSR failover tested: 1 replica killed at t=80s → recovered in 37s · bench continued
  • AWS Singapore · April 19 2026
v0.3 — AWS i4i.4xlarge · 233,894 TPS peak · VSR failover recovery tested · 12,421,068 events · 0% error · April 19 2026
v0.3.1 AI🔨 Implemented1,500–2,000 RPS estimate · AI inference pipeline
  • 5 production-grade datasets implemented
  • Tract ONNX runtime integration for AI inference
  • io_uring dataloader for high-throughput model serving
  • 2,291 LOC · 57 tests passing · CI 100% green
  • AI inference pipeline implemented · production benchmark pending
v0.3.2 Priority✓ DoneSame TPS · <1ms critical latency
  • Multi-stream priority routing (Critical/High/Normal)
  • Critical requests bypass queue with <1ms latency guarantee
  • 429 tests passing · 0 Clippy warnings
  • Production-ready priority scheduler deployed
v0.3.3 Hardening✓ DoneSame TPS · 2PC · mTLS · signed supply chain
  • Cross-shard 2PC via TigerBeetle pending/post/void flow
  • Kubernetes ingress + cert-manager mTLS automation
  • Prometheus PVC persistence and operational hardening
  • Syft SBOM + Cosign keyless signing in CI
  • 3 ADRs and 8 runbooks published for operators
v0.4 Fintech Scale🔮 Future1M+ TPS target · 4× i8g.16xlarge Graviton 4
  • 4× AWS i8g.16xlarge instances (Graviton 4, 64 vCPU each)
  • Sharded VSR cluster with horizontal scaling
  • Production benchmark pending · target 1M+ sustained TPS
  • Multi-region replication architecture
Target: 2027 — bare-metal NVMe Gen4 · XDP ingress · estimated 5–10M TPS sharded · 1–2M TPS VSR
v0.5 AI LLM✓ Verified32 tokens in 19.7s · distributed Qwen2.5-7B
  • Qwen2.5-7B-Instruct distributed 3-stage inference pipeline
  • Stage orchestration across Aeron IPC streams 1001/2001/2002/1002/1003
  • Language Drift issue fixed and English generation verified
  • KV cache preserved across decode steps with correct position propagation
  • Production-ready multi-token generation on CPU-only Apple M4
Qwen2.5-7B-Instruct · 3-stage distributed pipeline · 32 tokens in 19.7s on Apple M4 CPU · multi-token generation verified
v0.5 AI Production🔮 FutureTBD RPS · AI inference production benchmark
  • Cortex v1 on ClarkenAI 70B plus Blazil Super Engine
  • Cloud cost validation against commercial inference APIs
  • Ankatos runtime integration and shared transport layer
  • Production benchmark pending for large-model serving

The source is open and the benchmarks are reproducible.

Clone the repository, run the benchmark suite, and verify every number yourself.

View on GitHub →