Phi-4 LLM and why synthetic data is now preferred to web

Link. “each token generated by a language model is by definition predicted by the preceding tokens”