Knowledge graphs meet multi-modal learning: A comprehensive survey
Knowledge Graphs (KGs) play a pivotal role in advancing various AI applications, with the
semantic web community's exploration into multi-modal dimensions unlocking new avenues …
semantic web community's exploration into multi-modal dimensions unlocking new avenues …
Exploring the potential of large language models (llms) in learning on graphs
Learning on Graphs has attracted immense attention due to its wide real-world applications.
The most popular pipeline for learning on graphs with textual node attributes primarily relies …
The most popular pipeline for learning on graphs with textual node attributes primarily relies …
Language models are free boosters for biomedical imaging tasks
In this study, we uncover the unexpected efficacy of residual-based large language models
(LLMs) as part of encoders for biomedical imaging tasks, a domain traditionally devoid of …
(LLMs) as part of encoders for biomedical imaging tasks, a domain traditionally devoid of …
Situational Awareness Matters in 3D Vision Language Reasoning
Being able to carry out complicated vision language reasoning tasks in 3D space represents
a significant milestone in developing household robots and human-centered embodied AI …
a significant milestone in developing household robots and human-centered embodied AI …
Residual-based Language Models are Free Boosters for Biomedical Imaging Tasks
In this study we uncover the unexpected efficacy of residual-based large language models
(LLMs) as part of encoders for biomedical imaging tasks a domain traditionally devoid of …
(LLMs) as part of encoders for biomedical imaging tasks a domain traditionally devoid of …
Multivariate Time-Series Anomaly Detection based on Enhancing Graph Attention Networks with Topological Analysis
Unsupervised anomaly detection in time series is essential in industrial applications, as it
significantly reduces the need for manual intervention. Multivariate time series pose a …
significantly reduces the need for manual intervention. Multivariate time series pose a …
Lexicon3d: Probing visual foundation models for complex 3d scene understanding
Complex 3D scene understanding has gained increasing attention, with scene encoding
strategies playing a crucial role in this success. However, the optimal scene encoding …
strategies playing a crucial role in this success. However, the optimal scene encoding …
Prior Knowledge Integration via LLM Encoding and Pseudo Event Regulation for Video Moment Retrieval
In this paper, we investigate the feasibility of leveraging large language models (LLMs) for
integrating general knowledge and incorporating pseudo-events as priors for temporal …
integrating general knowledge and incorporating pseudo-events as priors for temporal …
Llm4gen: Leveraging semantic representation of llms for text-to-image generation
Diffusion Models have exhibited substantial success in text-to-image generation. However,
they often encounter challenges when dealing with complex and dense prompts that involve …
they often encounter challenges when dealing with complex and dense prompts that involve …
Adapting LLaMA Decoder to Vision Transformer
This work examines whether decoder-only Transformers such as LLaMA, which were
originally designed for large language models (LLMs), can be adapted to the computer …
originally designed for large language models (LLMs), can be adapted to the computer …