Understanding and Mitigating Bias Inheritance in LLM-based Data Augmentation on Downstream Tasks
概要
arXiv:2502.04419v3 Announce Type: replace-cross Abstract: Generating synthetic datasets via large language models (LLMs) has emerged as a promising approach to improve LLM performance. However, LLMs inherently reflect biases in their training data, leading to a critical challenge: when models are t…