Llm-based edge intelligence: A comprehensive survey on architectures, applications, security and trustworthiness

O Friha, MA Ferrag, B Kantarci… - IEEE Open Journal …, 2024 - ieeexplore.ieee.org
The integration of Large Language Models (LLMs) and Edge Intelligence (EI) introduces a
groundbreaking paradigm for intelligent edge devices. With their capacity for human-like …

The unlocking spell on base llms: Rethinking alignment via in-context learning

BY Lin, A Ravichander, X Lu, N Dziri… - The Twelfth …, 2023 - openreview.net
Alignment tuning has become the de facto standard practice for enabling base large
language models (LLMs) to serve as open-domain AI assistants. The alignment tuning …

Llm inference serving: Survey of recent advances and opportunities

B Li, Y Jiang, V Gadepally, D Tiwari - arXiv preprint arXiv:2407.12391, 2024 - arxiv.org
This survey offers a comprehensive overview of recent advancements in Large Language
Model (LLM) serving systems, focusing on research since the year 2023. We specifically …

Towards efficient generative large language model serving: A survey from algorithms to systems

X Miao, G Oliaro, Z Zhang, X Cheng, H Jin… - arXiv preprint arXiv …, 2023 - arxiv.org
In the rapidly evolving landscape of artificial intelligence (AI), generative large language
models (LLMs) stand at the forefront, revolutionizing how we interact with our data. However …

[PDF][PDF] Efficiently Programming Large Language Models using SGLang.

L Zheng, L Yin, Z Xie, J Huang, C Sun, CH Yu, S Cao… - 2023 - par.nsf.gov
Large language models (LLMs) are increasingly used for complex tasks that require multiple
generation calls, advanced prompting techniques, control flow, and structured …

[PDF][PDF] Sglang: Efficient execution of structured language model programs

L Zheng, L Yin, Z Xie, C Sun, J Huang… - arXiv preprint arXiv …, 2023 - minjiazhang.github.io
Large language models (LLMs) are increasingly used for complex tasks that require multiple
generation calls, advanced prompting techniques, control flow, and structured …

Rethinking software engineering in the era of foundation models: A curated catalogue of challenges in the development of trustworthy fmware

AE Hassan, D Lin, GK Rajbahadur, K Gallaba… - … Proceedings of the …, 2024 - dl.acm.org
Foundation models (FMs), such as Large Language Models (LLMs), have revolutionized
software development by enabling new use cases and business models. We refer to …

Configurable foundation models: Building llms from a modular perspective

C Xiao, Z Zhang, C Song, D Jiang, F Yao, X Han… - arXiv preprint arXiv …, 2024 - arxiv.org
Advancements in LLMs have recently unveiled challenges tied to computational efficiency
and continual scalability due to their requirements of huge parameters, making the …

Explore, select, derive, and recall: Augmenting llm with human-like memory for mobile task automation

S Lee, J Choi, J Lee, MH Wasi, H Choi, SY Ko… - arXiv preprint arXiv …, 2023 - arxiv.org
The advent of large language models (LLMs) has opened up new opportunities in the field
of mobile task automation. Their superior language understanding and reasoning …

Mooncake: A kvcache-centric disaggregated architecture for llm serving

R Qin, Z Li, W He, M Zhang, Y Wu, W Zheng… - arXiv preprint arXiv …, 2024 - arxiv.org
Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI. It
features a KVCache-centric disaggregated architecture that separates the prefill and …