RemembERR

Leveraging Microprocessor Errata for Design Testing and Validation.

Our work aggregates and annotates every publicly available Intel Core and AMD microprocessor erratum between 2008 and 2022 — 2,563 raw entries in total. By resolving intra- and inter-document duplicates (2,057 Intel entries → 743 unique; 506 AMD entries → 385 unique), we distilled 1,128 unique errata in RemembERR. RemembERR turns scattered, free-form PDF listings into a cohesive, cross-vendor, machine-readable database.

The RemembERR database is available on github.

What are microprocessor errata?

After silicon ships, subtle design bugs emerge only under real-world conditions. Vendors publish these as errata, each describing:

Unfortunately, these documents lack structure, consistency, and cross-reference, making large-scale analysis and automated tooling difficult, especially before the LLM era.

How did we build RemembERR?

1. Collection and De-duplication

2. Unified Three-Tier Schema

We defined 60 abstract categories across Triggers, Contexts, and Effects, then mapped each erratum through an automated filter plus a four-eyes manual process to ensure >80 % initial agreement, resolving all mismatches iteratively.

3. Four-Eyes Classification

4. Public Artifacts

We open-source the complete RemembERR database, annotation tools, and Dockerized experiments on GitHub: https://github.com/comsec-group/rememberr

Key Insights & Gaps in Design Testing

By mining RemembERR’s annotated errata, we expose critical validation blind spots:

How can RemembERR be used?

Frequently Asked Questions

Why classify at three levels?

Concrete details ensure reproducibility; abstract categories enable pattern discovery; class-level groups support high-level validation strategy.

Is this just academic?

No — design teams can immediately integrate RemembERR into existing flows, generating targeted test cases that cover historical blind spots.

Acknowledgements

We would like to thank the anonymous reviewers for their valuable feedback. This work was supported by a Microsoft Swiss JRC grant, the Swiss State Secretariat for Education, Research and Innovation under contract number MB22.00057 (ERC-StG PROMISE), and the Swiss National Science Foundation under NCCR Automation, grant agreement 51NF40 180545.