MIREDO: MIP-Driven Resource-Efficient Dataflow Optimization for Computing-in-Memory Accelerator

📅 2025-10-30
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Dataflow optimization for Compute-in-Memory (CIM) architectures faces a dual challenge: an exponentially large design space coupled with stringent hardware constraints. Method: This paper proposes a systematic co-optimization framework based on Mixed-Integer Programming (MIP). It integrates hierarchical hardware abstraction and cycle-accurate latency modeling to jointly optimize DNN workload mapping, dataflow scheduling, and CIM unit-level constraints—overcoming the local optima limitation of conventional heuristic approaches. Contribution/Results: Its key innovation lies in explicitly encoding non-differentiable architectural constraints as MIP formulations and enabling cross-layer trade-offs among resources, bandwidth, and latency. Evaluated across diverse mainstream DNN models and CIM hardware configurations, the framework achieves an average 2.7× speedup (up to 3.2×), significantly narrowing the gap between theoretical peak efficiency and realized system efficiency. It establishes a verifiable, scalable, and automated optimization paradigm for practical CIM accelerator deployment.

Technology Category

Application Category

📝 Abstract
Computing-in-Memory (CIM) architectures have emerged as a promising solution for accelerating Deep Neural Networks (DNNs) by mitigating data movement bottlenecks. However, realizing the potential of CIM requires specialized dataflow optimizations, which are challenged by an expansive design space and strict architectural constraints. Existing optimization approaches often fail to fully exploit CIM accelerators, leading to noticeable gaps between theoretical and actual system-level efficiency. To address these limitations, we propose the MIREDO framework, which formulates dataflow optimization as a Mixed-Integer Programming (MIP) problem. MIREDO introduces a hierarchical hardware abstraction coupled with an analytical latency model designed to accurately reflect the complex data transfer behaviors within CIM systems. By jointly modeling workload characteristics, dataflow strategies, and CIM-specific constraints, MIREDO systematically navigates the vast design space to determine the optimal dataflow configurations. Evaluation results demonstrate that MIREDO significantly enhances performance, achieving up to $3.2 imes$ improvement across various DNN models and hardware setups.
Problem

Research questions and friction points this paper is trying to address.

Optimizing dataflow for computing-in-memory DNN accelerators
Addressing design space complexity and architectural constraints
Bridging theoretical and actual system-level efficiency gaps
Innovation

Methods, ideas, or system contributions that make the work stand out.

MIP formulation for dataflow optimization problem
Hierarchical abstraction with analytical latency model
Joint modeling of workloads and CIM constraints
🔎 Similar Papers
No similar papers found.
X
Xiaolin He
School of Computer Science and Engineering, Beihang University
C
Cenlin Duan
School of Integrated Circuit Science and Engineering, Beihang University
Y
Yingjie Qi
School of Computer Science and Engineering, Beihang University
X
Xiao Ma
School of Computer Science and Engineering, Beihang University
Jianlei Yang
Jianlei Yang
Beihang University
Deep LearningComputer ArchitectureNueromorphic ComputingSpitronicsEDA/VLSI