Umbra is an experimental alpha, and the fastest way to lose trust in a private inference product is to overclaim. So here is a plain snapshot of what actually works today, and what does not yet.
What works today
You can call the API. Umbra speaks the OpenAI and Anthropic wire formats.
Point your existing client at https://api.tryumbra.dev/v1, mint a key in the
console, call GET /v1/models to see what is live, and run it. Your $1 of free
signup credit covers up to ~10 million tokens at the current lowest input
rate ($0.10 per million). The exact count depends on the model and your
input/output mix. See pricing for the live per-model rates.
Developer top-up is live. You can buy prepaid inference credit with a card, processed by Stripe. Charges are real. Purchased and promotional credit is spendable on inference only. It is non-refundable and non-withdrawable.
Attestation is live at the hardware tier. The coordinator runs inside an
attested AMD SEV-SNP confidential VM, and Apple Managed Device Attestation of
provider hardware is live: the reference provider runs at the hardware trust
tier under real MDA verification. Prompts are decrypted in memory only and
zeroized after each request, never logged, never written to disk.
The provider app is published. A clean Apple-Silicon Mac can install with one
command (curl -fsSL https://tryumbra.dev/setup | sh), sign in, complete the MDM
approval, and start hosting a public Hugging Face GGUF model it selects and pulls
with its own Hugging Face key.
What is still pending
Independent code identity (code_attested) is implemented but dormant. The
APNs app-targeted nonce challenge is built and tested; it lights up once the
operator wires the APNs key and the app ships the entitlement. Until then, the
verifier caps providers at hardware, and authenticated private routing fails
closed below code_attested rather than silently downgrading.
Signed per-response receipts are not live yet. Today’s evidence covers coordinator posture and provider registration. The receipts that would bind a specific request to that evidence are still rolling out. Treat missing evidence as unavailable privacy proof.
Provider payouts are not open. The cross-border region support needed to pay providers is still being worked out, so no money is paid out during alpha. Earnings accrue for transparency in the meantime, but the amounts are projected and non-binding.
Try it
Swap your base_url, keep your SDK: quickstart. See the live
rates on pricing, or read the trust model and
threat model for how the privacy property is meant to hold.
(Umbra is an experimental alpha. Everything above reflects the state at the time of writing and will change as the alpha progresses: see the changelog.)