Cross-team Review 15-minute Story Screenshot-driven

Competitive Intelligence Agent

Sprint 1 Review — PM-Ready Decision Packages

Transforms scattered competitive signals into structured decision packages with source attribution, priority labels, and actionable recommendations.

Multi-source Signals Agent Orchestration Structured Decision Package Key Devs Community Watch Items
4h+
Weekly manual baseline
6+
Signal sources
90
Total Score
52s
Latency

Hanlin Liu  |  CMD SOX — Meeting Intelligence  |  2026-04-16

The Problem — Why This Agent Matters

Current Pain (Manual CI Research)
Multi-source search
Copy & consolidate
Manual compare
Late action
Core gap: PMs don't lack information — they lack decision-ready synthesis.
What Reviewers Care About
Traceability — Every claim links to a source
Recency — Signals ≤ 7 days old by default
Actionability — Priority labels + next steps
4h+
Weekly manual baseline
6+
Signal categories
3
Comparable variants
15min
Cross-team explainability

What We Built — How Users Interact

Interaction Flow
PM Input
Agent Processing
Structured Output
CapabilityInputOutput
Competitive SearchCompetitor + domainMulti-slice decision package
Benchmark EvalDataset manifestScorecard + comparison
Watch MonitorPriority configP0/P1/P2 action items
Output Preview
Key Developments output slice
Full output includes: executive summary, community voice, watch items, and PM recommendations.

How It Works — Architecture

End-to-End Pipeline

PM / Reviewer
Web / CLI
Agent Orchestration
Search + Synthesis
Benchmark Artifacts

Evaluation Pipeline

Dataset
Manifest
Scorecard
Markdown / CSV / JSON
Design principle: Keep search and evaluation logic decoupled — so each can evolve, scale, and be tested independently.

Search Output — Three Structured Slices

All three screenshots from a single search output — showing how the agent transforms scattered signals into a reviewable decision package.

1. Key Developments
Key Developments slice
Chronological facts with impact severity ratings (H/M) and source attribution.
2. Community Voice
Community Voice slice
KOL + social sentiment aggregation preserving source attribution (X / Reddit).
3. Watch Items
Watch Items slice
Prioritized actions (P0/P1/P2) with confidence labels (Fact / Rumor / Sentiment).

Together, these give PMs everything to brief leadership in ≤ 5 minutes.

Output Deep Dive — Why This Structure Works

Watch Items — Deep Dive
Watch Items deep dive
What Each Slice Answers
Key Developments → "What happened?" — Chronological facts with impact severity.
Community Voice → "What are people saying?" — KOL + social sentiment with attribution.
Watch Items → "What should I do?" — Prioritized actions with confidence tags.
Together, these three slices give a PM everything needed to brief leadership in ≤ 5 minutes — no further research required.

Results & Impact

90
Total Score
88%
Citation Survival
84%
Timeliness Hit
4.3/5
PM Usability
52s
Latency
VariantScore BandCitationTimelinessPosition
CI Agent90 Excellent88%84%Leading
Copilot CLI76 Strong72%68%Upper-middle
General Chat62 Mixed55%50%Middle
+13 pts vs prompt-only baseline

Evaluation Framework

Pipeline
Dataset
Manifest
Scorecard
Report
Objective dimensions (72%): Citation accuracy, timeliness, structure compliance
Subjective dimensions (28%): PM usability, actionability, readability
Performance Bands
BandScoreMeaning
Excellent≥ 82Ship-ready, minimal revision
Strong≥ 70Usable with minor edits
Mixed≥ 55Needs targeted improvements
Weak< 55Significant gaps

PM Expectations & Risks

Sprint 1 Expectations
Met — Structured multi-slice output with source attribution and priority labels.
Partially Met — Real-time monitoring (polling implemented, alerting deferred to Sprint 2).
Verdict: Usable tool exceeding baseline by +13 pts, meets "Excellent" band threshold.
Risk Register
P0 Source freshness decay

Search APIs may lag behind real-time events by 12–48h.

P1 LLM hallucination on niche competitors

Low-frequency entities trigger confabulation in synthesis step.

Mitigation

Citation-survival checks + confidence labels (Fact / Rumor / Sentiment) flag uncertain claims.

Roadmap

Sprint 2
Sprint 3
Sprint 4
Final
PhaseDeliverableSuccess Criteria
Sprint 2 Alert pipeline + dashboard MVP P0 alerts delivered within 1h of detection
Sprint 3 Multi-competitor comparison + trend analysis ≥ 3 competitors per query, trend accuracy ≥ 80%
Sprint 4 Self-improving eval loop + PM feedback integration Score ≥ 92, PM satisfaction ≥ 4.5/5
Final Production deployment + cross-team rollout 5+ PM teams onboarded, < 30s latency

Thank You

Q&A · Sprint 1 Review

Contact

Hanlin Liu

CMD SOX — Meeting Intelligence

Follow-up Materials

Scorecard report (Markdown)

Eval dataset + manifest