Data made by a program or model for training or testing AI.
Synthetic data is like a toy city for self-driving cars. The cars can practice without denting your mailbox.
People use it to train AI, test AI, and protect private data. But if the toy city is weird, the AI learns weird habits.
Data Augmentation
Synthetic data often adds more examples when real data is scarce.
Generative Model
A generative model can make synthetic data in large batches.
Data-privacy
Synthetic data can reduce exposure of sensitive real data.
AI-bias
If synthetic data is skewed, it can feed bias into the model.