Beyond Synthetic Benchmarks: Reproducible Evaluation of Engineering-Intelligence Systems for Semiconductor Design
A reproducible, anti-circular, provenance-aware framework for evaluating engineering-intelligence systems, applied to real open-source RTL with maintainer-derived ground truth. Introduces XRecall, a provenance-anchored cross-project engineering-memory architecture, and reports a candid account of findings — including a documented synthetic-to-real generalization gap — making no superiority claim over existing tools.