作者
Souvika Sarkar, Mohammad Fakhruddin Babar, Md Mahadi Hassan, Monowar Hasan, Shubhra Kanti Karmaker Santu
发表日期
2024/5/7
图书
Proceedings of the 15th ACM/SPEC International Conference on Performance Engineering
页码范围
211-222
简介
Voice-controlled systems are becoming ubiquitous in many IoT-specific applications such as home/industrial automation, automotive infotainment, and healthcare. While cloud-based voice services (\eg Alexa, Siri) can leverage high-performance computing servers, some use cases (\eg robotics, automotive infotainment) may require to execute the natural language processing (NLP) tasks offline, often on resource-constrained embedded devices. Transformer-based language models such as BERT and its variants are primarily developed with compute-heavy servers in mind. Despite the great performance of BERT models across various NLP tasks, their large size and numerous parameters pose substantial obstacles to offline computation on embedded systems. Lighter replacement of such language models (\eg DistilBERT and TinyBERT) often sacrifice accuracy, particularly for complex NLP tasks. Until now, it is …
引用总数
学术搜索中的文章
S Sarkar, MF Babar, MM Hassan, M Hasan, SKK Santu - arXiv preprint arXiv:2304.11520, 2023
S Sarkar, MF Babar, MM Hassan, M Hasan… - Proceedings of the 15th ACM/SPEC International …, 2024