Challenges and Opportunities in Improving Worst-Group Generalization in Presence of Spurious Features

Joshi, Siddharth; Yang, Yu; Xue, Yihao; Yang, Wenhan; Mirzasoleiman, Baharan

Computer Science > Machine Learning

arXiv:2306.11957 (cs)

[Submitted on 21 Jun 2023 (v1), last revised 16 Apr 2025 (this version, v5)]

Title:Challenges and Opportunities in Improving Worst-Group Generalization in Presence of Spurious Features

Authors:Siddharth Joshi, Yu Yang, Yihao Xue, Wenhan Yang, Baharan Mirzasoleiman

View PDF HTML (experimental)

Abstract:Deep neural networks often exploit *spurious* features that are present in the majority of examples within a class during training. This leads to *poor worst-group test accuracy*, i.e., poor accuracy for minority groups that lack these spurious features. Despite the growing body of recent efforts to address spurious correlations (SC), several challenging settings remain this http URL this work, we propose studying methods to mitigate SC in settings with: 1) spurious features that are learned more slowly, 2) a larger number of classes, and 3) a larger number of groups. We introduce two new datasets, Animals and SUN, to facilitate this study and conduct a systematic benchmarking of 8 state-of-the-art (SOTA) methods across a total of 5 vision datasets, training over 5,000 models. Through this, we highlight how existing group inference methods struggle in the presence of spurious features that are learned later in training. Additionally, we demonstrate how all existing methods struggle in settings with more groups and/or classes. Finally, we show the importance of careful model selection (hyperparameter tuning) in extracting optimal performance, especially in the more challenging settings we introduced, and propose more cost-efficient strategies for model selection. Overall, through extensive and systematic experiments, this work uncovers a suite of new challenges and opportunities for improving worst-group generalization in the presence of spurious features. Our datasets, methods and scripts available at this https URL.

Comments:	Package: this https URL * - These authors contributed equally
Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:2306.11957 [cs.LG]
	(or arXiv:2306.11957v5 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2306.11957

Submission history

From: Siddharth Joshi [view email]
[v1] Wed, 21 Jun 2023 00:59:06 UTC (9,077 KB)
[v2] Fri, 29 Sep 2023 06:09:08 UTC (9,077 KB)
[v3] Thu, 3 Oct 2024 04:09:42 UTC (7,049 KB)
[v4] Sun, 13 Oct 2024 14:32:30 UTC (7,049 KB)
[v5] Wed, 16 Apr 2025 22:23:48 UTC (15,382 KB)

Computer Science > Machine Learning

Title:Challenges and Opportunities in Improving Worst-Group Generalization in Presence of Spurious Features

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Challenges and Opportunities in Improving Worst-Group Generalization in Presence of Spurious Features

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators