A metaverse: Taxonomy, components, applications, and open challenges
SM Park, YG Kim - IEEE access, 2022 - ieeexplore.ieee.org
Unlike previous studies on the Metaverse based on Second Life, the current Metaverse is
based on the social value of Generation Z that online and offline selves are not different …
based on the social value of Generation Z that online and offline selves are not different …
Latte: Language trajectory transformer
Natural language is one of the most intuitive ways to express human intent. However,
translating instructions and commands towards robotic motion generation and deployment …
translating instructions and commands towards robotic motion generation and deployment …
[HTML][HTML] Semantic scene understanding with large language models on unmanned aerial vehicles
Unmanned Aerial Vehicles (UAVs) are able to provide instantaneous visual cues and a high-
level data throughput that could be further leveraged to address complex tasks, such as …
level data throughput that could be further leveraged to address complex tasks, such as …
Reshaping robot trajectories using natural language commands: A study of multi-modal data alignment using transformers
Natural language is the most intuitive medium for us to interact with other people when
expressing commands and instructions. However, using language is seldom an easy task …
expressing commands and instructions. However, using language is seldom an easy task …
Gait: Generating aesthetic indoor tours with deep reinforcement learning
Placing and orienting a camera to compose aesthetically meaningful shots of a scene is not
only a key objective in real-world photography and cinematography but also for virtual …
only a key objective in real-world photography and cinematography but also for virtual …
DanceCamera3D: 3D Camera Movement Synthesis with Music and Dance
Choreographers determine what the dances look like while cameramen determine the final
presentation of dances. Recently various methods and datasets have showcased the …
presentation of dances. Recently various methods and datasets have showcased the …
Camera keyframing with style and control
We present a novel technique that enables 3D artists to synthesize camera motions in virtual
environments following a camera style, while enforcing user-designed camera keyframes as …
environments following a camera style, while enforcing user-designed camera keyframes as …
Socratic video understanding on unmanned aerial vehicles
In this work, we propose a system for video understanding through zero-shot reading
comprehension using Socratic Models. Specifically, we create a language-based world …
comprehension using Socratic Models. Specifically, we create a language-based world …
CineMPC: A Fully Autonomous Drone Cinematography System Incorporating Zoom, Focus, Pose, and Scene Composition
P Pueyo, J Dendarieta, E Montijano… - IEEE Transactions …, 2024 - ieeexplore.ieee.org
We present CineMPC, a complete cinematographic system that autonomously controls a
drone to film multiple targets recording user-specified aesthetic objectives. Existing solutions …
drone to film multiple targets recording user-specified aesthetic objectives. Existing solutions …
Onboard view planning of a flying camera for high fidelity 3D reconstruction of a moving actor
Capturing and reconstructing a human actor's motion is important for filmmaking and
gaming. Currently, motion capture systems with static cameras are used for pixel-level high …
gaming. Currently, motion capture systems with static cameras are used for pixel-level high …