How does promoting the minority fraction affect generalization? a theoretical study of one-hidden-layer neural network on group imbalance

H Li, S Zhang, Y Zhang, M Wang, S Liu… - IEEE Journal of …, 2024 - ieeexplore.ieee.org
Group imbalance has been a known problem in empirical risk minimization (ERM), where
the achieved high average accuracy is accompanied by low accuracy in a minority group …

Learning on Transformers is Provable Low-Rank and Sparse: A One-layer Analysis

H Li, M Wang, S Zhang, S Liu… - 2024 IEEE 13rd Sensor …, 2024 - ieeexplore.ieee.org
Efficient training and inference algorithms, such as low-rank adaption and model pruning,
have shown impressive performance for learning Transformer-based large foundation …