Production RAG · Measured & regression-gated · EU-sovereign
Production GenAI is proven, not just demoed.
I build RAG and GenAI systems held to a production bar: retrieval measured against an expert-validated gold set, regression-gated, deterministic and auditable by design. When the data can't leave European jurisdiction, the stack stays sovereign end to end — self-hosted models, EU infrastructure, no US service in the path.
78% Recall@5 · enterprise gold set
0.66 MRR · enterprise gold set
5 pt regression gate
1B+ records · production ETL
Work
Selected projects
ragaws-bedrockvector-searchevaluationdata-engineering
End-to-end knowledge and retrieval layer for a production RAG assistant — a deterministic, rule-based ingestion and chunking pipeline (no ML, no LLM), structure-aware indexing for AWS Bedrock, two-stage retrieval with reranking, and a four-pillar evaluation harness.
llm-opsmulti-agentlitellmeu-sovereign
A controllable, EU-sovereign environment for agentic coding: LiteLLM tier-routing across on-device and EU-managed providers (any OpenAI-compatible backend swappable in), an adversarial reasoning/execution gate, and a single gateway guardrail.
sap-hanacode-generationcompilertesting
A spec-driven engine that generates SAP-HANA warehouse objects from YAML and parses existing ones back — bound by a shared intermediate representation and byte-stable roundtrip tests.
ai-governanceworkflow-dsldeterminismaudit
A deterministic framework that makes AI-agent software delivery reproducible, auditable, and approval-gated — a no-LLM kernel, single-shot agents, enforced invariants, and a tamper-evident audit log.
Let's talk
Building an AI system that has to hold up in production, not just in a demo?
Whether you need a RAG system proven correct and kept that way as it changes, or a sovereignty constraint solved end to end — or you're a delivery partner who needs someone who has actually shipped both — happy to talk systems. No pitch needed.
- — A RAG or retrieval system that has to prove its quality — and keep proving it as it changes
- — A production AI system that has to be auditable and reproducible, not just demoable
- — A sovereignty or compliance constraint blocking AI adoption
- — Delivery partners needing proven, production-grade AI capacity
Building AI that has to be provably correct — or that has to stay in-jurisdiction? Get in touch.