DxPTA: An Architecture Design Space Exploration with Optical Dataflow-guided Strategy for HW/SW Co-Design of Photonic Transformer Accelerators

📅 2026-06-02
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work addresses the limitations of existing photonic Transformer accelerator designs, which often neglect practical constraints such as area, power, energy, and latency while relying on inefficient manual parameter tuning that hinders scalability. To overcome these challenges, the authors propose DxPTA, a novel methodology that introduces coherent photonic dataflow-guided design space exploration for the first time. DxPTA enables automated and efficient architecture optimization by modeling photonic dataflows, analyzing architectural parameter sensitivity, and integrating a constraint-aware hardware-software co-search algorithm. Under stringent constraints of 50 mm² area, 5 W power, 50 mJ energy, and 10 ms latency, DxPTA achieves a 15.2× speedup in design space search and successfully generalizes across diverse Transformer models.
📝 Abstract
Transformer-based networks have emerged as prominent AI models with state-of-the-art performance, which potentially pave the way toward artificial general intelligence (AGI). However, their large sizes still hinder their efficient implementation, thus highlighting the need for alternate solutions to enable their energy-efficient acceleration. Recently, state-of-the-art works propose photonic transformer accelerators (PTAs) with significant speedup and energy efficiency improvements over the conventional electronic accelerators. However, their PTA architectures are developed without considering the application constraints (e.g., area, power, energy, and latency). Moreover, their manual design approach also requires huge design time to determine a suitable architecture for the targeted application, hence making this approach not scalable. To address these limitations, we propose DxPTA, a novel design space exploration methodology for enabling efficient hardware/software co-design of the appropriate PTA architecture that meets all constraints. It is achieved by (1) identifying the PTA architecture parameters based on the coherent optical dataflow; (2) analyzing the impact/significance of the parameters; and (3) leveraging this analysis for devising a constraint-aware architecture search algorithm. Experimental results show that, our DxPTA can find the appropriate PTA architectures for different transformer-based models (i.e., DeiT-T/S/B and BERT-B/L). It achieves up to 26mm^2 area, 4.8W power, 39mJ energy, and 6ms latency, for constraints of 50mm^2 area, 5W power, 50mJ energy, and 10ms latency; with 15.2x faster searching time than the exhaustive approach. These results demonstrate the potential of DxPTA methodology for enabling efficient PTA designs for diverse AGI-based applications.
Problem

Research questions and friction points this paper is trying to address.

Photonic Transformer Accelerators
Design Space Exploration
Hardware/Software Co-Design
Application Constraints
Architecture Optimization
Innovation

Methods, ideas, or system contributions that make the work stand out.

Photonic Transformer Accelerator
Design Space Exploration
Optical Dataflow
HW/SW Co-Design
Constraint-aware Architecture Search