E
Epstein Suite

System Status: Active Ingestion

Live processing: OCR, AI summaries, and data indexing in progress across ~3.5 million newly released pages.

Ask Epstein Files
Ask Epstein Files Chat with the archive
Feedback Suggest improvements

Transparency

Epstein Suite is a public-source intelligence tool. Every document in this archive originates from official government releases. This page documents our data handling practices, source verification procedures, and content policies.

Redaction Policy

Victim & Minor Protection

This project maintains a strict prohibition against identifying victims or minors. Our AI analysis prompts explicitly forbid un-redacting names, reversing government redactions, or attempting to identify individuals protected by law. This applies to all automated processing including GPT-4o summaries and entity extraction.

PII Protection

Personally Identifiable Information (PII) of non-public figures is handled with care. When our systems detect sensitive personal data (phone numbers, addresses, financial details) belonging to private individuals, that content is flagged for review. We rely on government redactions as the primary protection layer and supplement with AI-assisted screening.

AI-Assisted Review

Every document in the archive passes through an AI enrichment stage (OpenAI GPT-4o) that generates summaries and extracts entities. These AI prompts are configured to respect existing redactions and avoid speculative identification. AI summaries are navigation aids and may contain inaccuracies — users should always verify against the original source document.

Manual Review Process

Content flagged by users or automated systems undergoes manual review. We prioritize reports involving exposed victim identities, incorrect redactions, or sensitive personal information. Removal requests are typically acknowledged within 72 hours.

Report a concern

If you believe content should be redacted or removed, contact: admin@kevinchamplin.com

Source Integrity

Official Government Sources

Every document in this archive originates from one of three official government sources. We do not host leaked, stolen, or unofficial materials. Each document links back to its original source URL for independent verification.

1

U.S. Department of Justice

EFTA releases (Data Sets 1–12), court records, and BOP footage via justice.gov/epstein

2

FBI Records Vault

FOIA releases from the Federal Bureau of Investigation via vault.fbi.gov

3

House Oversight Committee

Epstein Estate documents released by Congress via oversight.house.gov

6-Stage Processing Pipeline

Documents pass through a structured ingestion pipeline to make them searchable and analyzable. No content is altered — original files are preserved alongside our processed versions.

1
Source Discovery

Automated tools download and index new documents from official government portals into our database.

2
Download & OCR

Files are downloaded and processed through Tesseract OCR to extract machine-readable text from scanned documents.

3
Media Processing

Video and image files have metadata extracted and thumbnails generated for browsing.

4
AI Enrichment

GPT-4o generates plain-language summaries and extracts named entities (people, organizations, locations) from OCR text.

5
Email Extraction

Email headers (From, To, Subject, Date) are parsed from OCR text into a searchable email index.

6
Flight Analysis & Search Embeddings

Flight logs receive significance scores and all documents get vector embeddings for semantic search.

Verification

Every document page includes a link to the original government source URL. Visit our Sources page for the complete list of data origins and ingestion statistics.

Transparency Report

Live statistics from our production database, updated every 5 minutes.

67,844
Total Documents
19,275
Fully Processed
136,008
OCR Pages
3,734
AI Summaries
8,402
Entities Extracted
42,270
Local Copies
February 4, 2026 at 3:55pm UTC
Last Data Ingestion
Detailed metrics

For a full breakdown of ingestion health, dataset coverage, and processing progress, visit the Stats and Sources pages.

DMCA / Takedown Notice

Content Removal Process

Epstein Suite indexes publicly released government documents. If you are a rights holder and believe that content hosted on this site infringes your copyright or should be removed for legal reasons, we have a structured process for handling takedown requests.

How to Submit a Takedown Request

  1. Identify the content. Provide the URL(s) of the specific page(s) or document(s) you want removed.
  2. State your claim. Explain the basis for removal (copyright ownership, privacy concern, legal order, etc.).
  3. Provide contact information. Include your full name, organization (if applicable), email address, and phone number.
  4. Send your request. Email your takedown notice to the address below. Include "DMCA Takedown" or "Content Removal" in the subject line.

Response Timeline

We acknowledge all takedown requests within 72 hours of receipt. Valid requests are processed and content is removed or restricted within 10 business days. We may contact you for additional information if the request is unclear or incomplete.

Good Faith Disclaimer

This project operates in good faith as a public transparency tool. All indexed content originates from official government releases. We do not make claims beyond what is supported by linked public records. Being named in these documents is not an indication of wrongdoing.

DMCA Contact

Send takedown notices to: admin@kevinchamplin.com

Please include "DMCA Takedown" in the subject line for priority handling.