The Hidden Influence of Data Contamination on Large Language Models
Data contamination in Large Language Models (LLMs) is a significant concern that can impact their performance on various tasks. It refers to the presence of test data from downstream tasks in the training data of LLMs. Addressing data contamination is …