🤖 AI Summary
To address low IPC, limited memory bandwidth, and excessive area overhead in high-end embedded applications—particularly automotive electronics—this work proposes CVA6S+, a high-performance open-source superscalar RISC-V core. Methodologically, it integrates an enhanced TAGE branch predictor, full register renaming, and multi-level operand forwarding, while tightly co-optimizing the cache subsystem with the OpenHW Core-V HPDCache for improved energy efficiency. Evaluation shows that CVA6S+ achieves a 43.5% IPC improvement over the scalar CVA6 baseline and a further 10.9% gain over the prior superscalar CVA6S; L1 DCache read/write bandwidth increases by 74.1%, with only a 9.30% area overhead. The design thus delivers a balanced trade-off across performance, power, and area (PPA), establishing a reusable, production-ready open-source foundation for automotive-grade RISC-V processor development.
📝 Abstract
Open-source RISC-V cores are increasingly adopted in high-end embedded domains such as automotive, where maximizing instructions per cycle (IPC) is becoming critical. Building on the industry-supported open-source CVA6 core and its superscalar variant, CVA6S, we introduce CVA6S+, an enhanced version incorporating improved branch prediction, register renaming and enhanced operand forwarding. These optimizations enable CVA6S+ to achieve a 43.5% performance improvement over the scalar configuration and 10.9% over CVA6S, with an area overhead of just 9.30% over the scalar core (CVA6). Furthermore, we integrate CVA6S+ with the OpenHW Core-V High-Performance L1 Dcache (HPDCache) and report a 74.1% bandwidth improvement over the legacy CVA6 cache subsystem.