#Model Generalization
Latest news articles tagged with "Model Generalization". Follow the timeline of events, related topics, and entities.
Articles (1)
-
πΊπΈ Soft Contamination Means Benchmarks Test Shallow Generalization
[USA]
arXiv:2602.12413v1 Announce Type: cross Abstract: If LLM training data is polluted with benchmark test data, then benchmark performance gives biased estimates of out-of-distribution (OOD) generalizat...
Related: #AI Evaluation, #Data Contamination