A metaverse: Taxonomy, components, applications, and open challenges

SM Park, YG Kim - IEEE access, 2022 - ieeexplore.ieee.org
Unlike previous studies on the Metaverse based on Second Life, the current Metaverse is
based on the social value of Generation Z that online and offline selves are not different …

Latte: Language trajectory transformer

A Bucker, L Figueredo, S Haddadin… - … on Robotics and …, 2023 - ieeexplore.ieee.org
Natural language is one of the most intuitive ways to express human intent. However,
translating instructions and commands towards robotic motion generation and deployment …

[HTML][HTML] Semantic scene understanding with large language models on unmanned aerial vehicles

J De Curtò, I De Zarza, CT Calafate - Drones, 2023 - mdpi.com
Unmanned Aerial Vehicles (UAVs) are able to provide instantaneous visual cues and a high-
level data throughput that could be further leveraged to address complex tasks, such as …

Reshaping robot trajectories using natural language commands: A study of multi-modal data alignment using transformers

A Bucker, L Figueredo, S Haddadinl… - 2022 IEEE/RSJ …, 2022 - ieeexplore.ieee.org
Natural language is the most intuitive medium for us to interact with other people when
expressing commands and instructions. However, using language is seldom an easy task …

Gait: Generating aesthetic indoor tours with deep reinforcement learning

D Xie, P Hu, X Sun, S Pirk, J Zhang… - Proceedings of the …, 2023 - openaccess.thecvf.com
Placing and orienting a camera to compose aesthetically meaningful shots of a scene is not
only a key objective in real-world photography and cinematography but also for virtual …

DanceCamera3D: 3D Camera Movement Synthesis with Music and Dance

Z Wang, J Jia, S Sun, H Wu, R Han… - Proceedings of the …, 2024 - openaccess.thecvf.com
Choreographers determine what the dances look like while cameramen determine the final
presentation of dances. Recently various methods and datasets have showcased the …

Camera keyframing with style and control

H Jiang, M Christie, X Wang, L Liu, B Wang… - ACM Transactions on …, 2021 - dl.acm.org
We present a novel technique that enables 3D artists to synthesize camera motions in virtual
environments following a camera style, while enforcing user-designed camera keyframes as …

Socratic video understanding on unmanned aerial vehicles

I de Zarzà, J de Curtò, CT Calafate - Procedia Computer Science, 2023 - Elsevier
In this work, we propose a system for video understanding through zero-shot reading
comprehension using Socratic Models. Specifically, we create a language-based world …

CineMPC: A Fully Autonomous Drone Cinematography System Incorporating Zoom, Focus, Pose, and Scene Composition

P Pueyo, J Dendarieta, E Montijano… - IEEE Transactions …, 2024 - ieeexplore.ieee.org
We present CineMPC, a complete cinematographic system that autonomously controls a
drone to film multiple targets recording user-specified aesthetic objectives. Existing solutions …

Onboard view planning of a flying camera for high fidelity 3D reconstruction of a moving actor

Q Jiang, V Isler - arXiv preprint arXiv:2308.00134, 2023 - arxiv.org
Capturing and reconstructing a human actor's motion is important for filmmaking and
gaming. Currently, motion capture systems with static cameras are used for pixel-level high …