Every deal published on Discoperi passes through a 9-stage AI and human verification pipeline. No deal is published without a minimum confidence score of 0.40 across six weighted dimensions. Here is exactly how it works.
Each stage adds structured data, removes noise, and increases the confidence that what we publish is accurate. Deals can fail at any stage and are discarded rather than published with uncertain data.
Continuous monitoring of 12 distinct source channels: SEC EDGAR (8-K filings, merger agreements), Bloomberg wire, Reuters wire, PE firm press release pages (top 200 firms), M&A trade publications (Mergermarket, PEI), regional financial press, company investor relations pages, HSR antitrust filings, UK CMA public registers, EU DG COMP decisions, and court records for contested transactions. Sources are polled every 15–30 minutes depending on channel latency.
Claude (Anthropic) extracts 20 structured fields from each raw signal using a purpose-built M&A extraction prompt: acquirer name, target name, deal type, enterprise value, equity value, debt component, announcement date, close date, acquirer jurisdiction, target jurisdiction, sector, sub-sector, deal rationale, advisor names (buy-side financial, sell-side financial, legal), regulatory status, and deal conditions. Each field is extracted with an individual confidence score (0–1). Fields with confidence below 0.30 are left blank rather than guessed.
The same deal will typically appear across 3–8 sources within hours of announcement. Our deduplication engine uses fuzzy string matching on acquirer + target + value combinations (threshold: 82% similarity) to group signals referring to the same transaction. The highest-quality source for each field is selected as the canonical value. Conflicting values across sources trigger a contradiction check in Stage 4.
Where multiple sources provide different values for the same field (e.g. $18.5bn vs $18.2bn for enterprise value), the contradiction engine applies a source hierarchy: SEC filings > official press releases > major wire services > trade press > regional press. Value conflicts within 5% are resolved by averaging or selecting the SEC-filed figure. Value conflicts exceeding 5% lower the field confidence score and are flagged for the analyst review queue in Stage 8.
Each deal record is scored across six dimensions, producing a composite quality score from 0 to 1. Records scoring below 0.40 are placed in a review queue and not published.
For deals that pass the quality threshold, Claude generates a structured article: transaction overview, deal structure analysis, strategic context, and regulatory path. The article is generated from the structured fields — not from raw source text — preventing hallucination. All factual claims in the article are traceable to the verified field values. Articles are clearly labelled "AI-generated analysis" on the deal page.
Verified deal records are written to a structured Google Sheets database with 39 fields across 5 tabs: deal overview (12 fields), financial details (8 fields), parties and advisors (9 fields), regulatory and legal (6 fields), and metadata (4 fields). This staging layer allows for human review, batch corrections, and manual enrichment before publication.
Deals scoring 0.40–0.74 are flagged for analyst review before publication. A human reviewer checks: value plausibility against sector benchmarks, entity name accuracy (common failure point for international entities), advisor attribution accuracy, and date consistency. Analysts can approve, reject, or modify any field. Deals scoring ≥0.75 are auto-approved. Approximately 35% of published deals are auto-approved; 65% pass through human review.
Approved deals are published to discoperi.com via the WordPress REST API. Deal alert subscribers with matching sector, size, and geography filters receive an email notification within 30 minutes of publication. The complete 39-field record is also made available for export in CSV and JSON formats for paid subscribers.
| Source | Type | Coverage | Update frequency |
|---|---|---|---|
| SEC EDGAR | Regulatory | All US public company filings (8-K, S-4, merger agreements) | Every 15 min |
| Bloomberg wire | Wire | Global M&A coverage, deal announcements, regulatory decisions | Every 20 min |
| Reuters wire | Wire | Global M&A, PE deals, restructurings | Every 20 min |
| PE firm press releases | Official | Top 200 PE firms — direct from source, no wire latency | Every 30 min |
| HSR antitrust filings | Regulatory | US transactions above $119.5m filing threshold | Daily |
| EU DG COMP decisions | Regulatory | European Commission merger control decisions | Daily |
| UK CMA register | Regulatory | UK merger reviews, phase 1 and phase 2 decisions | Daily |
| Mergermarket / PEI | Trade press | Specialist M&A and private equity trade coverage | Every 30 min |
| Company IR pages | Official | Investor relations pages for FTSE 100, S&P 500, and select mid-caps | Every 30 min |
| Regional financial press | Press | Major regional financial publications — 40+ outlets | Hourly |
Every published deal record contains up to 39 structured fields. Fields that cannot be verified above the confidence threshold are left blank rather than estimated.