Structured Pruning for Efficient Transformer Model compression

E Yoo, Y Lee - Transactions on Semiconductor Engineering, 2023 - koreascience.kr
With the recent development of Generative AI technology by IT giants, the size of the
transformer model is increasing exponentially over trillion won. In order to continuously …