Prefill once. Query forever. Minnesväv caches your documents' full AI context locally — so every query is cheaper, faster, and never leaves your infrastructure.
Every query against your internal documents re-processes the full context — a 200-page legal brief, a chip datasheet, a patient record. You pay full prefill cost every single time.
LLM inference has two phases. We make prefill a one-time cost per document — not a per-query tax paid by every user, every time.
When a new document enters your DMS, Minnesväv runs prefill once — computing the full KV cache using your chosen open model, on your own hardware or private cloud.
The KV cache is stored in your own infrastructure — on-prem DRAM, edge SSDs, or your private cloud. It inherits the exact ACL of the source document. Nothing leaves your domain.
Every user query injects the cached context and runs only decode. No document re-processing — cost is proportional to the query alone, not the document that was already cached.
Minnesväv integrates with your DMS — EHR, SharePoint, Confluence, internal wikis. The cache store is fully within your perimeter. The model runs on cost-efficient open-weight inference.
Companies whose document corpus is stable and repeatedly queried see the highest ROI — where prefill-once translates directly to predictable, shrinking unit costs.
Case files, contracts, and regulatory briefs queried by dozens of attorneys. Cache each document once, serve thousands of queries — with document-level ACL enforced on every cache access.
Patient records in EHR systems queried across departments. The KV cache stays within your HIPAA perimeter. PHI is never processed by external APIs after the initial on-prem prefill.
Thousands of datasheets, design specs, and technical errata queried daily by engineers. Prefill the corpus once — engineers query fast without re-processing 500-page PDFs every time.
Risk models, regulatory filings, and fund prospectuses queried by analysts. Per-document billing maps cleanly to fintech unit economics — a line item you can actually predict and budget.
Minnesväv is in private beta with select design partners in legal, healthcare, and semiconductors. Join the waitlist to get early access and shape the roadmap.