[HTML][HTML] Exploring the frontiers of deep learning and natural language processing: A comprehensive overview of key challenges and emerging trends

W Khan, A Daud, K Khan, S Muhammad… - Natural Language …, 2023 - Elsevier
In the recent past, more than 5 years or so, DL especially the large language models (LLMs)
has generated extensive studies out of a distinctly average downturn field of knowledge …

Brain-conditional multimodal synthesis: A survey and taxonomy

W Mai, J Zhang, P Fang, Z Zhang - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
In the era of Artificial Intelligence Generated Content (AIGC), conditional multimodal
synthesis technologies (eg, text-to-image) are dynamically reshaping the natural content …

Minddiffuser: Controlled image reconstruction from human brain activity with semantic and structural diffusion

Y Lu, C Du, Q Zhou, D Wang, H He - Proceedings of the 31st ACM …, 2023 - dl.acm.org
Reconstructing visual stimuli from brain recordings has been a meaningful and challenging
task. Especially, the achievement of precise and controllable image reconstruction bears …

BrainCLIP: Bridging brain and visual-linguistic representation via CLIP for generic natural visual stimulus decoding

Y Liu, Y Ma, W Zhou, G Zhu, N Zheng - arXiv preprint arXiv:2302.12971, 2023 - arxiv.org
Due to the lack of paired samples and the low signal-to-noise ratio of functional MRI (fMRI)
signals, reconstructing perceived natural images or decoding their semantic contents from …

Visual decoding and reconstruction via eeg embeddings with guided diffusion

D Li, C Wei, S Li, J Zou, H Qin, Q Liu - arXiv preprint arXiv:2403.07721, 2024 - arxiv.org
How to decode human vision through neural signals has attracted a long-standing interest in
neuroscience and machine learning. Modern contrastive learning and generative models …

Decoding Natural Images from EEG for Object Recognition

Y Song, B Liu, X Li, N Shi, Y Wang, X Gao - arXiv preprint arXiv …, 2023 - arxiv.org
Electroencephalogram (EEG) is a brain signal known for its high time resolution and
moderate signal-to-noise ratio. Whether natural images can be decoded from EEG has been …

Fuser: An enhanced multimodal fusion framework with congruent reinforced perceptron for hateful memes detection

F Wu, B Gao, X Pan, L Li, Y Ma, S Liu, Z Liu - Information Processing & …, 2024 - Elsevier
As a multimodal form of hate speech on social media, hateful memes are more aggressive
and cryptic threats to the real life of humans. Automatic detection of hateful memes is crucial …

Emergence and reconfiguration of modular structure for artificial neural networks during continual familiarity detection

S Gu, MG Mattar, H Tang, G Pan - Science Advances, 2024 - science.org
Advances in artificial intelligence enable neural networks to learn a wide variety of tasks, yet
our understanding of the learning dynamics of these networks remains limited. Here, we …

fmri-pte: A large-scale fmri pretrained transformer encoder for multi-subject brain activity decoding

X Qian, Y Wang, J Huo, J Feng, Y Fu - arXiv preprint arXiv:2311.00342, 2023 - arxiv.org
The exploration of brain activity and its decoding from fMRI data has been a longstanding
pursuit, driven by its potential applications in brain-computer interfaces, medical diagnostics …

Mind Artist: Creating Artistic Snapshots with Human Thought

J Chen, Y Qi, Y Wang, G Pan - Proceedings of the IEEE …, 2024 - openaccess.thecvf.com
Abstract We introduce Mind Artist (MindArt) a novel and efficient neural decoding
architecture to snap artistic photographs from our mind in a controllable manner. Recently …