Aligning cyber space with physical world: A comprehensive survey on embodied ai

Y Liu, W Chen, Y Bai, X Liang, G Li, W Gao… - arXiv preprint arXiv …, 2024 - arxiv.org
Embodied Artificial Intelligence (Embodied AI) is crucial for achieving Artificial General
Intelligence (AGI) and serves as a foundation for various applications that bridge cyberspace …

Knowledge graphs meet multi-modal learning: A comprehensive survey

Z Chen, Y Zhang, Y Fang, Y Geng, L Guo… - arXiv preprint arXiv …, 2024 - arxiv.org
Knowledge Graphs (KGs) play a pivotal role in advancing various AI applications, with the
semantic web community's exploration into multi-modal dimensions unlocking new avenues …

Eda: Explicit text-decoupling and dense alignment for 3d visual grounding

Y Wu, X Cheng, R Zhang, Z Cheng… - Proceedings of the …, 2023 - openaccess.thecvf.com
Abstract 3D visual grounding aims to find the object within point clouds mentioned by free-
form natural language descriptions with rich semantic cues. However, existing methods …

Synthesizing event-centric knowledge graphs of daily activities using virtual space

S Egami, T Ugai, M Oono, K Kitamura, K Fukuda - IEEE Access, 2023 - ieeexplore.ieee.org
Artificial intelligence (AI) is expected to be embodied in software agents, robots, and cyber-
physical systems that can understand the various contextual information of daily life in the …

Transformer-based vision-language alignment for robot navigation and question answering

H Luo, Z Guo, Z Wu, F Teng, T Li - Information Fusion, 2024 - Elsevier
The task of robot navigation and question answering, which is also known as Embodied
Question Answering (EQA), places its emphasis on empowering agents to actively explore …

S-eqa: Tackling situational queries in embodied question answering

VS Dorbala, P Goyal, R Piramuthu, M Johnston… - arXiv preprint arXiv …, 2024 - arxiv.org
We present and tackle the problem of Embodied Question Answering (EQA) with Situational
Queries (S-EQA) in a household environment. Unlike prior EQA work tackling simple queries …

Embodied-RAG: General Non-parametric Embodied Memory for Retrieval and Generation

Q Xie, SY Min, T Zhang, K Xu, A Bajaj… - arXiv preprint arXiv …, 2024 - arxiv.org
There is no limit to how much a robot might explore and learn, but all of that knowledge
needs to be searchable and actionable. Within language research, retrieval augmented …

基于形态的具身智能研究: 历史回顾与前沿进展

刘华平, 郭迪, 孙富春, 张新钰 - 自动化学报, 2023 - aas.net.cn
具身智能强调智能受脑, 身体与环境协同影响, 更侧重关注智能体与环境的“交互”. 因此,
在具身智能的研究中, 智能体的物理形态与感知, 学习, 控制的关系起到至关重要的作用. 当前 …

Model Adaptation for Time Constrained Embodied Control

J Song, M Yoo, H Woo - … of the IEEE/CVF Conference on …, 2024 - openaccess.thecvf.com
When adopting a deep learning model for embodied agents it is required that the model
structure be optimized for specific tasks and operational conditions. Such optimization can …

Piecewise convolutional neural network relation extraction with self-attention mechanism

B Zhang, L Xu, KH Liu, R Yang, MZ Li, XY Guo - Pattern Recognition, 2025 - Elsevier
The task of relation extraction in natural language processing is to identify the relation
between two specified entities in a sentence. However, the existing model methods do not …