A Cross-Chain Event-Driven Data Infrastructure for Aave Protocol Analytics and Applications

📅 2025-12-12
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Existing research is hindered by the absence of standardized, cross-chain, event-level datasets, impeding empirical analysis of decentralized lending protocols such as Aave V3. To address this, we construct the first Aave V3 event-level data infrastructure covering six EVM-compatible blockchains. Our pipeline systematically ingests and decodes eight core on-chain events, producing over 50 million structured records—each annotated with USD valuation, block timestamp, and chain identifier. Methodologically, we introduce cross-chain event alignment, full-chain synchronous decoding, and real-time foreign exchange rate mapping. We also design an open-source Python pipeline supporting dynamic batch processing, automatic sharding (≤1M rows/file), and multi-chain temporal normalization. The resulting dataset ensures temporal rigor, cryptographic verifiability, and full public availability. It enables, for the first time, reproducible research on liquid capital flow tracking, liquidation risk modeling, and cross-chain user behavior analysis—providing foundational data for empirical studies of interest rate mechanisms and systemic risk.

Technology Category

Application Category

📝 Abstract
Decentralized lending protocols, exemplified by Aave V3, have transformed financial intermediation by enabling permissionless, multi-chain borrowing and lending without intermediaries. Despite managing over $10 billion in total value locked, empirical research remains severely constrained by the lack of standardized, cross-chain event-level datasets. This paper introduces the first comprehensive, event-driven data infrastructure for Aave V3 spanning six major EVM-compatible chains (Ethereum, Arbitrum, Optimism, Polygon, Avalanche, and Base) from respective deployment blocks through October 2025. We collect and fully decode eight core event types -- Supply, Borrow, Withdraw, Repay, LiquidationCall, FlashLoan, ReserveDataUpdated, and MintedToTreasury -- producing over 50 million structured records enriched with block metadata and USD valuations. Using an open-source Python pipeline with dynamic batch sizing and automatic sharding (each file less than or equal to 1 million rows), we ensure strict chronological ordering and full reproducibility. The resulting publicly available dataset enables granular analysis of capital flows, interest rate dynamics, liquidation cascades, and cross-chain user behavior, providing a foundational resource for future studies on decentralized lending markets and systemic risk.
Problem

Research questions and friction points this paper is trying to address.

Standardized cross-chain event-level datasets for Aave V3 are lacking
The infrastructure collects and decodes eight core event types across six chains
It enables granular analysis of capital flows and systemic risk in lending
Innovation

Methods, ideas, or system contributions that make the work stand out.

Cross-chain event-driven data infrastructure for Aave V3
Open-source Python pipeline with dynamic batch sizing and sharding
Public dataset enabling granular analysis of decentralized lending
🔎 Similar Papers
No similar papers found.
Junyi Fan
Junyi Fan
University of Southern California
machine learning
L
Li Sun
Daniel J. Epstein Department of Industrial & Systems Engineering, University of Southern California, 3715 McClintock Ave GER 240, Los Angeles, 90007, California, United States.