📣 Send us your press release
Site updates every 15 minutes
Technology

AWS Clean Rooms adds synthetic data generation for ML model training

Amazon Web Services has launched a new feature for its AWS Clean Rooms service. This capability allows organizations to generate privacy-enhancing synthetic datasets for training machine learning (ML) models.

13 June 2026
AWS Clean Rooms adds synthetic data generation for ML model training

Amazon Web Services (AWS) has introduced a new capability within its AWS Clean Rooms service, enabling organizations and their partners to generate privacy-enhancing synthetic datasets. This feature is designed to facilitate the training of machine learning (ML) regression and classification models.

The development of ML models typically involves a tension between data utility and privacy protection. Accurate models require access to high-quality, granular data, but using individual-level data from multiple parties raises significant privacy concerns and compliance challenges.

The new functionality addresses this by allowing the creation of synthetic versions of sensitive datasets. These synthetic datasets preserve the statistical patterns of the original data without exposing original records. This opens new avenues for model training where privacy was previously a barrier.

Integrated into AWS Clean Rooms ML, the feature utilizes advanced ML techniques to produce synthetic data. It aims to enable insights into questions, such as identifying customer characteristics that indicate a high probability of conversion, without direct access to individual-level signals that might conflict with privacy policies or regulations.

Original source: aws.amazon.com