🤖 AI Summary
The non-rivalrous nature of data creates a fundamental tension between shared value and competitive advantage: existing synthetic data methods produce symmetric utility, exacerbating value leakage. To address this, we propose the “Encapsulate–Forge” framework—the first to endow synthetic data with *exclusive utility*. It encapsulates knowledge from raw data into a designated “key” model and generates synthetic data that overfits *only* that model, thereby binding data value strictly to the authorized model. This mechanism breaks utility symmetry, enabling asymmetric value allocation while preserving strong privacy guarantees and robustness against misuse. Empirically, it achieves original-data-level performance using significantly smaller synthetic datasets. Our core contribution is transforming non-rivalrous data into a *model-level excludable*, rivalrous product—establishing a new paradigm for secure, controllable data collaboration.
📝 Abstract
The non-rival nature of data creates a dilemma for firms: sharing data unlocks value but risks eroding competitive advantage. Existing data synthesis methods often exacerbate this problem by creating data with symmetric utility, allowing any party to extract its value. This paper introduces the Encapsulation-Forging (EnFo) framework, a novel approach to generate rival synthetic data with asymmetric utility. EnFo operates in two stages: it first encapsulates predictive knowledge from the original data into a designated ``key''model, and then forges a synthetic dataset by optimizing the data to intentionally overfit this key model. This process transforms non-rival data into a rival product, ensuring its value is accessible only to the intended model, thereby preventing unauthorized use and preserving the data owner's competitive edge. Our framework demonstrates remarkable sample efficiency, matching the original data's performance with a fraction of its size, while providing robust privacy protection and resistance to misuse. EnFo offers a practical solution for firms to collaborate strategically without compromising their core analytical advantage.