Llm-based edge intelligence: A comprehensive survey on architectures, applications, security and trustworthiness
The integration of Large Language Models (LLMs) and Edge Intelligence (EI) introduces a
groundbreaking paradigm for intelligent edge devices. With their capacity for human-like …
groundbreaking paradigm for intelligent edge devices. With their capacity for human-like …
The unlocking spell on base llms: Rethinking alignment via in-context learning
Alignment tuning has become the de facto standard practice for enabling base large
language models (LLMs) to serve as open-domain AI assistants. The alignment tuning …
language models (LLMs) to serve as open-domain AI assistants. The alignment tuning …
Llm inference serving: Survey of recent advances and opportunities
This survey offers a comprehensive overview of recent advancements in Large Language
Model (LLM) serving systems, focusing on research since the year 2023. We specifically …
Model (LLM) serving systems, focusing on research since the year 2023. We specifically …
Towards efficient generative large language model serving: A survey from algorithms to systems
In the rapidly evolving landscape of artificial intelligence (AI), generative large language
models (LLMs) stand at the forefront, revolutionizing how we interact with our data. However …
models (LLMs) stand at the forefront, revolutionizing how we interact with our data. However …
[PDF][PDF] Efficiently Programming Large Language Models using SGLang.
Large language models (LLMs) are increasingly used for complex tasks that require multiple
generation calls, advanced prompting techniques, control flow, and structured …
generation calls, advanced prompting techniques, control flow, and structured …
[PDF][PDF] Sglang: Efficient execution of structured language model programs
Large language models (LLMs) are increasingly used for complex tasks that require multiple
generation calls, advanced prompting techniques, control flow, and structured …
generation calls, advanced prompting techniques, control flow, and structured …
Rethinking software engineering in the era of foundation models: A curated catalogue of challenges in the development of trustworthy fmware
Foundation models (FMs), such as Large Language Models (LLMs), have revolutionized
software development by enabling new use cases and business models. We refer to …
software development by enabling new use cases and business models. We refer to …
Configurable foundation models: Building llms from a modular perspective
Advancements in LLMs have recently unveiled challenges tied to computational efficiency
and continual scalability due to their requirements of huge parameters, making the …
and continual scalability due to their requirements of huge parameters, making the …
Explore, select, derive, and recall: Augmenting llm with human-like memory for mobile task automation
The advent of large language models (LLMs) has opened up new opportunities in the field
of mobile task automation. Their superior language understanding and reasoning …
of mobile task automation. Their superior language understanding and reasoning …
Mooncake: A kvcache-centric disaggregated architecture for llm serving
Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI. It
features a KVCache-centric disaggregated architecture that separates the prefill and …
features a KVCache-centric disaggregated architecture that separates the prefill and …