How does promoting the minority fraction affect generalization? a theoretical study of one-hidden-layer neural network on group imbalance
Group imbalance has been a known problem in empirical risk minimization (ERM), where
the achieved high average accuracy is accompanied by low accuracy in a minority group …
the achieved high average accuracy is accompanied by low accuracy in a minority group …
Learning on Transformers is Provable Low-Rank and Sparse: A One-layer Analysis
Efficient training and inference algorithms, such as low-rank adaption and model pruning,
have shown impressive performance for learning Transformer-based large foundation …
have shown impressive performance for learning Transformer-based large foundation …