Enterprise investigation & forensics platform

An investigator that works the case.

Black Box runs real collection tools against live public-record and OSINT sources, divides each case across a coordinated team of specialist agents, and clears every report through an integrity gate before you read a word. For enterprise and government teams it deploys into your own air-gapped network on dedicated private AI servers — integrating your enterprise tools and APIs, and running full computer, mobile-device and CCTV forensics. A chatbot answers from memory. Black Box goes and gets the evidence — on infrastructure you control.

Start with Black Box LiteContact us for EnterpriseLite — self-serve monthly subscription · Enterprise — deployed for you
Live sourcesCourtListener / RECAPOpenSanctionsSEC EDGAROpenCorporatesRDAPcrt.shWaybackusername searchreverse image
The engine

How a case actually runs

Six stages, each a distinct step in the engine — not one model improvising an answer. A lead decomposes the question, specialist agents execute it with real tools, and the findings are verified and fact-checked before they reach you.

01

Plan

A lead investigator decomposes your request into distinct, non-overlapping tasks. No single model trying to hold the whole case in its head — the work is divided before any collection begins.

planInvestigation
02

Collect

Each task is handed to its own specialist agent, which runs a real tool loop: live web search, the built-in OSINT data sources, and — behind an approval gate — the investigation browser. Reasoning is set to HIGH on every task.

runTask
03

Verify

An adversarial QC verifier reviews each task's claims and marks them supported or unsupported. It can downgrade a finding the agent called confirmed. Unsupported claims do not get to ride along quietly.

verifyTask
04

Resolve

Aliases and duplicate entities are merged into single records, each assigned a role and its relationships — so the same person under three spellings becomes one node, not three.

resolveEntities
05

Synthesize

The lead integrates the verified findings into one report, with a BLUF tagged to a single calibrated confidence term and a mandatory competing-hypotheses section. Citations are renumbered into one global sequence you can follow.

synthesizeInvestigation
06

Integrity gate

An integrity auditor reads the finished report and flags hallucinated sources or dockets, uncited claims, and miscalibrated confidence. The report clears this stage before it surfaces — the fact-check is enforced in software, not left to the operator.

factCheckInvestigation
Real collection

It executes the collection — it does not recall it

Ten built-in OSINT data sources make live calls, plus an investigation browser that drives your own authenticated Chrome session. Every tool is scoped to authorized, public-record and consented collection.

Case law & federal dockets

CourtListener search across US case law and RECAP — the archived PACER document set. (RECAP, not a live PACER login.)

courtlistener_search

Sanctions & PEP screening

OpenSanctions lookups for sanctions and politically-exposed-person hits. A screening aid for triage, not a definitive legal determination.

sanctions_search

Corporate & securities records

OpenCorporates company records and SEC EDGAR full-text search, against live registries and filings.

opencorporates / edgar_search

Infrastructure & domains

RDAP (the WHOIS successor) and crt.sh certificate-transparency enumeration to map domains, registrants, and subdomains.

rdap_lookup / crt_sh

Archive & source preservation

Wayback save-and-lookup so the evidence you cite is preserved at the moment you captured it.

webarchive / preserve_source

Username & reverse-image leads

A username sweep across ~16 platforms and a reverse-image link-builder across Lens, Yandex, TinEye, and Bing. Both are labelled heuristic leads — to be human-confirmed in the browser.

username_search / reverse_image
Human in the loop

The investigation browser, behind an approval gate

To read login-gated OSINT and social pages, an agent can drive your own authenticated Chrome profile — but never without your explicit approval, stated reason, and the risks shown first.

Your own authenticated session

The browser runs as a persistent Chrome context against your profile, so an agent reads pages exactly as you would when signed in. It is reach that link-analysis tools and consumer chatbots do not have.

browser.ts

Approval before any action

Only browser actions require approval — public-API OSINT calls do not. Each browser step pauses for a human decision that explains why it is needed and the risk, and denies on timeout rather than proceeding.

approvals.ts

A deliberate posture on high-risk sites

Driving a logged-in session into major social platforms prepends a terms-of-service and automation warning, so collection there is a conscious, warned choice — authorized, consented, and reviewable, not a way around anything.

HIGH_RISK_DOMAINS

Every navigation logged

Each browser navigation is written to the tamper-evident audit log, so the record of what was accessed, and when, is preserved alongside the case.

audit.ts
Tradecraft

Calibrated tradecraft, not confident prose

The single biggest blocker to AI in serious investigations is hallucination with no confidence signal. The engine answers that with structure — and frames its own heuristics as triage aids, not verdicts.

ICD-203 calibrated confidence

Every assessment leads with exactly one estimative-probability term — from almost certain to remote — so you read a calibrated judgment, not a vague hedge or false certainty.

estimative probability

Competing hypotheses, required

Each report carries a mandatory competing-hypotheses section: two to four alternatives with evidence for and against and a calibrated likelihood each. Uncertainty is shown, not smoothed over.

competing hypotheses

Source-reliability tiers

Sources are graded A through D (official registries down to anonymous pastes) by an Admiralty-inspired heuristic. The grade is a triage prior, not a verdict — a tier-A domain can still host a forgery.

source-tiers.ts

Disinformation flagging

Planted, fabricated, or coordinated-inauthentic material is flagged and held at low confidence rather than absorbed as fact.

runTask / factCheck

Report templates that fit the job

Due-diligence, threat-intel, litigation-support, and person-of-interest templates reorder the deliverable and add the right blocks — a risk assessment, an IOC section, an exhibit list, or an expanded subject dossier.

report.ts

Evidence indexed for retrieval

Uploaded files are RAG-indexed with OCR and EXIF extraction, with a local lexical fallback when no embedding key is set, so the case can reason over what you bring it.

embeddings / ocr / exif
The workspace

A persistent case, not a disposable thread

Close the tab and a chatbot forgets everything. Black Box keeps the case, the graph, the timeline, and the chain of evidence.

Entity map & link graph

Resolved entities become nodes with typed relationship edges you can read and export — the connections, mapped, not just described.

Notebook

Timelines & sources panel

A master timeline and a sources panel keep the sequence of events and the provenance of every claim in one workspace.

Notebook

Editable per-entity dossiers

Persistent dossiers track each subject's status (active, flagged, cleared), aliases, identifiers, relationships, and sources — built from the case and editable as it develops.

dossiers

Tamper-evident exhibits

Deliverables and exhibits get a SHA-256 hash and numbered exhibit IDs, so a recipient can detect post-hoc alteration. Chain-of-custody-style integrity — tamper-evidence, not a court-admissibility claim.

case-export.ts

Exports built for your stack

Export to JSON, Markdown, Word-openable RTF, and nodes/edges CSV for Gephi or Maltego — with optional best-effort PII redaction on the way out.

case-export.ts

Authorization & scope on every case

Each case carries authorization and scope fields, keeping the engagement framed as authorized and consented from the first action.

cases
Security & sovereignty

Your infrastructure, your keys, your isolation

Sending case data and subject PII to a vendor's shared cloud is increasingly an OPSEC and compliance liability. Black Box is built to keep the case, the keys, and the audit trail on infrastructure you own — and on Enterprise, entirely inside your own air-gapped network.

Keys never reach the browser

All model calls run server-side through an in-house gateway. Provider API keys stay on the server; the client only ever sees has-key boolean flags, never a raw secret.

config.ts / gateway

Secrets encrypted at rest (Windows DPAPI)

On Windows, secrets are sealed with DPAPI so the blob only decrypts for the same user on the same machine. On other platforms a clearly-marked non-secure fallback is used — we state that plainly.

secrets.ts

The upstream vendor is abstracted

You call public model names (geogentia-fast, -pro, -reasoning); the gateway maps them to a hidden upstream model and routes server-side. Which vendor answers is never exposed to the client.

gateway/models.ts

Hash-chained audit log

Activity is appended to a tamper-evident log where each entry hashes the previous one. Editing or removing any record breaks the chain, and the verifier reports the first broken index.

audit.ts

Per-user multi-tenant isolation

Auth.js sessions with bcrypt-hashed passwords and user/admin roles; each user sees only their own cases and searches. The first registrant becomes the owner-admin. Cloud SQL when configured, JSON-file fallback otherwise.

auth.ts / users.ts

Governance, stated honestly

Data-retention is a recorded policy surfaced on export (not automatic deletion), and export PII redaction is a configurable best-effort scrub (not guaranteed de-identification). We never train on your data.

config.ts
Where it sits

Next to i2, Maltego, and Hunchly

Legacy suites are powerful canvases that wait for you to feed them data. Black Box sits upstream — it runs the collection and writes the calibrated assessment, then exports cleanly into the tools you already own.

i2 & Maltego chart; this collects

Link-analysis suites visualize relationships you have already gathered. Black Box plans the case, executes the collection, resolves the entities, and exports nodes/edges CSV straight into your graph tool.

complement or replace

Hunchly logs; this reasons

Hunchly captures what you browse with its own integrity logging. Black Box adds the reasoning and collection on top of tamper-evident capture — it works the case, not just records your session.

on top of integrity

Consumer LLMs do neither

A chatbot neither runs the collection nor produces a sourced, confidence-tagged assessment. Black Box does both, inside an authorized, audited posture, on infrastructure you control.

the wedge
The difference

Black Box vs. a consumer LLM

Same underlying models — a different machine built around them. Here is what changes when the AI has to collect the evidence and show its work.

 Black BoxConsumer LLM (ChatGPT)
Where the answer comes fromRuns real collection tools against live sources — CourtListener/RECAP, OpenSanctions, EDGAR, OpenCorporates, RDAP, crt.sh, Wayback, plus a browser on your own sessionAnswers from frozen training memory; no live collection of fresh evidence
Authorized OSINT / security workScoped for it — runs the collection behind an authorized, consented, approval-gated postureFrequently hedges or declines authorized security and OSINT tasks
Grounding & verificationCited sources, an adversarial verify step per task, and an integrity gate that flags hallucinated dockets and uncited claims before you read itConfident prose, no built-in verification or fact-check stage you can rely on
Confidence signalICD-203 calibrated term on every assessment, mandatory competing hypotheses, source-reliability tiers (as triage, not verdicts)Vague hedging or unwarranted certainty, with no calibrated signal
Memory of the casePersistent workspace: entity graph, timelines, editable dossiers, sources, and a chain of evidenceStateless thread — forgets the case when you close the tab
ArchitectureA coordinated team of specialist agents with a verifier between them and your reportOne model producing a single response
Where your data livesSelf-hosted container, per-user isolated, hash-chained audit log, keys server-side, never trained on your data, air-gap tier availablePrompts sent to a vendor's shared cloud
What you can stand behindTamper-evident exhibits with numbered IDs and chain-of-custody-style integrity (not a court-admissibility claim)An answer taken on faith, with no exhibit record
“We did not build a smarter chatbot. We built the workflow a chatbot refuses to do: real collection, an adversarial verifier between every agent and your report, calibrated confidence on every line, and a tamper-evident record — running on infrastructure you, not a vendor, control.”
— Geogentia
Editions

Two ways to run Black Box

Black Box is the full enterprise platform — deployed into your own air-gapped or on-prem environment with dedicated private AI, full computer and mobile forensics, and large-scale evidence handling. Black Box Lite is the self-serve monthly subscription plus metered tokens: the same investigation engine, with some evidence-holding limits. Enterprise is priced per engagement — talk to us.

● Black Box Lite pricing is illustrative — final pricing on signup
Black Box Lite
Black Box Lite — Analyst
$499 per user / month + tokens
~8M tokens included

A single seat for the working investigator — the full engine, the case workspace, and the integrity gate. A monthly subscription plus metered tokens.

  • Full 6-stage engine on HIGH reasoning
  • All OSINT tools + approval-gated browser
  • Analyzes forensic reports from every major tool — EnCase, Cellebrite (UFED), Magnet AXIOM, GrayKey, Oxygen
  • Small CCTV clips, body-cam & video footage
  • Evidence-holding limits: full phone forensic images and multi-gigabyte video are not stored (those are Black Box Enterprise)
  • Tamper-evident exhibits + JSON/MD/RTF/CSV export
  • Overage billed per token at a transparent rate
Black Box
Black Box — Enterprise
Contact us tailored to your environment
Custom token allotment

The full Black Box platform for enterprise and government teams — deployed into your environment, with no evidence-holding limits. Scoped and stood up per engagement.

  • Air-gapped network & on-prem isolated deployment
  • Dedicated private AI servers — nothing leaves your perimeter
  • Integrates with your enterprise tools, systems & APIs
  • Full computer & mobile device forensics — acquire and analyze phone and disk images end-to-end
  • Large-scale investigations, including full CCTV review (not just clips) and multi-terabyte evidence
  • Custom token allotment, onboarding & deployment support
  • Retention & redaction configured to your governance
Questions

Before you commit

What's the difference between Black Box and Black Box Lite?
They share the same investigation engine. Black Box (Enterprise) is for enterprise and government teams: it deploys into your own air-gapped or on-prem network on dedicated private AI servers, integrates with your enterprise tools and APIs, runs full computer and mobile-device forensics, and handles large-scale evidence — including end-to-end CCTV review and multi-terabyte material. It's priced and scoped per engagement — you contact us. Black Box Lite is the self-serve edition: a monthly subscription plus metered tokens, with some evidence-holding limits (see below). Most individuals and small teams start on Lite; organizations that need air-gap, private AI, or heavy forensics move to Enterprise.
What evidence and forensics can each edition handle?
Black Box Lite analyzes forensic REPORTS exported from every major tool — EnCase, Cellebrite (UFED/Physical Analyzer), Magnet AXIOM, GrayKey, Oxygen — and small media like CCTV clips and body-cam footage. What Lite does NOT do is hold the heavy artifacts themselves: full forensic phone images or multi-gigabyte video are out of scope on the subscription. Black Box (Enterprise) removes those limits — it can acquire and analyze full phone and disk images end-to-end and run large-scale CCTV review and multi-terabyte evidence inside your own environment. In all cases, exhibits are tamper-evident and chain-of-custody-style; we make no guarantee of legal or evidentiary admissibility.
Is my case data private — where does it actually go?
With Black Box (Enterprise) everything runs inside your own air-gapped or on-prem environment on dedicated private AI — nothing leaves your perimeter. With Black Box Lite, it's a managed subscription: provider API keys never reach the browser (every model call runs server-side through an in-house gateway), each user is isolated so they see only their own cases, admins get a hash-chained audit log, and we never train on your data. The upstream model vendor is abstracted behind public model names either way.
How is this different from ChatGPT or Claude — we already pay for one?
A consumer LLM answers from training memory, often hedges or declines authorized OSINT work, and hands you confident prose with no source and no confidence signal. Black Box is scoped for authorized investigation: it runs real collection tools against live sources, grounds claims in cited evidence with calibrated confidence and competing hypotheses, passes every report through an integrity gate, and keeps it all in a persistent case with a tamper-evident chain of evidence. You are buying the workflow a chatbot refuses to do, not a better chatbot.
Is this legal — what keeps it to authorized use?
The product is built for authorized investigations and enforces that posture. The investigation browser uses your own authenticated profile and never acts without a human-in-the-loop approval gate that states the reason and the risk before each step; high-risk sites get an explicit terms-of-service warning. Every case carries authorization and scope fields, and admins get a hash-chained audit log of all activity. It is designed to keep collection consented, scoped, and reviewable — not to bypass anything. We do not promise legal or evidentiary outcomes; exhibits are tamper-evident and chain-of-custody-style, never described as court-admissible.
What does a token actually buy?
Tokens meter the real investigative load — every plan, collect, verify, resolve, synthesize, and fact-check step runs on HIGH reasoning and consumes tokens, so spend tracks work done rather than idle seats. On Black Box Lite each plan includes an allotment, and anything beyond it is billed per token at a transparent rate; the figures on this page are illustrative and confirmed at signup. Black Box (Enterprise) is a custom token allotment scoped to your deployment.

Stop asking a chatbot. Start running the case.

Create an account and put a coordinated team of investigators — real tools, calibrated confidence, an integrity gate, and a tamper-evident chain of evidence — to work on infrastructure you control.

Create an account