Early access · Paid API credit is live. New accounts still get free starter credit; provider payouts are coming soon.
Home · Blog

Umbra alpha: what works today

July 1, 2026 · An honest snapshot of the Umbra alpha: developer top-up is live, coordinator and Apple hardware attestation are live, the provider app is published, and provider payouts are still pending region support.

Umbra is an experimental alpha, and the fastest way to lose trust in a private inference product is to overclaim. So here is a plain snapshot of what actually works today, and what does not yet.

What works today

You can call the API. Umbra speaks the OpenAI and Anthropic wire formats. Point your existing client at https://api.tryumbra.dev/v1, mint a key in the console, call GET /v1/models to see what is live, and run it. Your $1 of free signup credit covers up to ~10 million tokens at the current lowest input rate ($0.10 per million). The exact count depends on the model and your input/output mix. See pricing for the live per-model rates.

Developer top-up is live. You can buy prepaid inference credit with a card, processed by Stripe. Charges are real. Purchased and promotional credit is spendable on inference only. It is non-refundable and non-withdrawable.

Attestation is live at the hardware tier. The coordinator runs inside an attested AMD SEV-SNP confidential VM, and Apple Managed Device Attestation of provider hardware is live: the reference provider runs at the hardware trust tier under real MDA verification. Prompts are decrypted in memory only and zeroized after each request, never logged, never written to disk.

The provider app is published. A clean Apple-Silicon Mac can install with one command (curl -fsSL https://tryumbra.dev/setup | sh), sign in, complete the MDM approval, and start hosting a public Hugging Face GGUF model it selects and pulls with its own Hugging Face key.

What is still pending

Independent code identity (code_attested) is implemented but dormant. The APNs app-targeted nonce challenge is built and tested; it lights up once the operator wires the APNs key and the app ships the entitlement. Until then, the verifier caps providers at hardware, and authenticated private routing fails closed below code_attested rather than silently downgrading.

Signed per-response receipts are not live yet. Today’s evidence covers coordinator posture and provider registration. The receipts that would bind a specific request to that evidence are still rolling out. Treat missing evidence as unavailable privacy proof.

Provider payouts are not open. The cross-border region support needed to pay providers is still being worked out, so no money is paid out during alpha. Earnings accrue for transparency in the meantime, but the amounts are projected and non-binding.

Try it

Swap your base_url, keep your SDK: quickstart. See the live rates on pricing, or read the trust model and threat model for how the privacy property is meant to hold.

(Umbra is an experimental alpha. Everything above reflects the state at the time of writing and will change as the alpha progresses: see the changelog.)