A primary concerns is that AI models can collapse when they rely too much on synthetic data