Hallucination liability
Clauses with opposite meaning — shall vs shall not — sit too close in embedding space. False-positive retrievals get cited, audited, litigated.
Enterprise Compress
Enterprise embedding compression with deterministic safety lens. Production · AWS Marketplace Ready · v1.0.0-rc18.
Per-document footprint
WP-20260113-01 · deterministic · signal-physical
At 1 billion documents
Standard Stack
With AQEA
117× smaller. 99.15% cost reduction.
Section 01
Clauses with opposite meaning — shall vs shall not — sit too close in embedding space. False-positive retrievals get cited, audited, litigated.
1B embeddings at 1024D float-32 ≈ 4 TB on disk and roughly $100,000/year in cloud hot-tier storage.
Legal, medical, financial RAG pipelines need provable controls, audit trail and zero-knowledge deployment paths from day one.
Section 02
Two layers, one deployable artefact. Drop into your existing vector-DB, keep your embedding model, your RAG framework, your prompt-chain. No retraining, no re-ingest.
┌────────────────────────┐ ┌──────────────────────┐
│ Your embedder │ → │ Lens (~45 KB) │ ← domain steering
│ (E5 / BGE / OpenAI) │ │ matrix multiply │ legal · medical · financial
└────────────────────────┘ └──────────┬───────────┘
│
▼
┌──────────────────────┐
│ Compress (117×) │ ← deterministic, signal-physical
│ 1024D → 11D │ 93–99% quality preservation
└──────────┬───────────┘
│
▼
┌────────────────────────────────┐
│ Vector-DB (Qdrant · pgvector) │
│ S3 · Azure Blob · GCS · HTTP │
└────────────────────────────────┘Section 03
LexGLUE 50k, EDGAR, ContractNLIv2 — reproducible against a live read-only Qdrant Cloud endpoint.
| Metric | Value | Source |
|---|---|---|
| Compression | 117–1,229× | WP-20260113-01 |
| Quality Preservation | 93–99% | WP-20260113-01 |
| False-Positive Rate (LexGLUE 50k) | 56% → 0% | WP-20260109-01 |
| HitC@10 (Correct Hit) | 80.0% → 83.2% | WP-20260109-01 |
| Storage (1B embeddings) | 4 TB → 35 GB | WP-20260113-01 |
| Cloud Storage Cost (1B embeddings) | $100k → $850/yr | WP-20260113-01 |
| Qdrant Live-Bench | 1024D → 11D, 6.9× faster ingest | WP-20260113-02 |
| Lens Weight Artefact | ~45 KB | WP-20260109-01 |
Section 04
1.42 GB · multi-arch · amd64 + arm64
1.42 GB
All-in-One image
amd64 + arm64
Multi-arch
8 / 8
Connectors
10 / 10
Pipeline Step-Handlers
8 Connectors
10 Pipeline Step-Handlers
Security on day one
bcrypt-12 · CSRF · hash-chained audit log · rate-limiting · API + Auth
Drops into your stack day one. Same container, same SHA-256 manifest, same multi-arch image — whether you ship to AWS, Azure, GCP, on-prem, or air-gapped.
Verify it yourself
Run the 117× compression bench against your own embeddings via our Qdrant Cloud Read-Only API. SHA-256 manifests, public reproducibility statement.
We respond to commercial inquiries within one business day.