Trạng thái: QUALITY_GATE_PARTIAL | DATA_HEALTH: PARTIAL
Manifest: 7 ngày, social-first, degrade graceful. Source manifest: coding agent, agentic programming, SWE-bench, Terminal-Bench, Claude Code, OpenAI Codex, Cursor agent, OpenCode.
KPI Dashboard
Candidates
155
HN
42
GitHub
80
Papers
25
Product
8
KOL/OG Feed Watch
HN/GitHub feed có direct links; X/YT/Reddit/Facebook: N/A do collector công khai bị hạn chế trong runtime. Blocker làm DATA_HEALTH=PARTIAL, không PASS.
Trend Radar
- Hot now: harness reliability, benchmark hóa coding-agent; 42 HN items.
- Emerging: multi-agent validation loop OSS; 80 GitHub repos.
- Noise: repo mới sao thấp; nhiều repo <5 sao.
- Watchlist: Terminal-Bench/SWE-bench verified pipelines; 25 paper candidates.
Impact Coverage
- FARE: Adopt harness scorecard 2 tuần.
- NEXA: Trial auto-eval loop 1-2 tháng.
- SYNCA: Monitor multi-agent orchestration 3-6 tháng.
- Thị trường VN/Global: AI SDLC governance cần checklist 5 bước.
CTO Recommendations (4)
- Lập harness nội bộ cho coding-agent — ROI 18-25%; Risk 2/5; Owner: Eng Manager; TTV: 2 tuần; Validate: pass-rate benchmark nội bộ.
- Pilot 2 IDE agents (Cursor/Codex) — ROI 12-20%; Risk 3/5; Owner: Tech Lead; TTV: 3 tuần; Validate: lead-time PR.
- Thiết lập policy review AI code — ROI 8-15%; Risk 2/5; Owner: QA Lead; TTV: 2 tuần; Validate: defect leakage.
- Radar weekly 100+ signals — ROI 10%; Risk 1/5; Owner: CTO Office; TTV: 1 tuần; Validate: decision evidence-link count.
Source Appendix
Full dataset local: /Users/macbokk/Workspace/tmp/daily_llm_candidates.json.