A Survey on Safe Multi-Modal Learning Systems
In the rapidly evolving landscape of artificial intelligence, multimodal learning systems
(MMLS) have gained traction for their ability to process and integrate information from …
(MMLS) have gained traction for their ability to process and integrate information from …
Parameter Efficient Fine-tuning of Self-supervised ViTs without Catastrophic Forgetting
Artificial neural networks often suffer from catastrophic forgetting where learning new
concepts leads to a complete loss of previously acquired knowledge. We observe that this …
concepts leads to a complete loss of previously acquired knowledge. We observe that this …
Decompose and Compare Consistency: Measuring VLMs' Answer Reliability via Task-Decomposition Consistency Comparison
Despite tremendous advancements, current state-of-the-art Vision-Language Models
(VLMs) are still far from perfect. They tend to hallucinate and may generate biased …
(VLMs) are still far from perfect. They tend to hallucinate and may generate biased …
Open-Vocabulary Calibration for Vision-Language Models
Vision-language models (VLMs) have emerged as formidable tools, showing their strong
capability in handling various open-vocabulary tasks in image recognition, text-driven visual …
capability in handling various open-vocabulary tasks in image recognition, text-driven visual …
DaWin: Training-free Dynamic Weight Interpolation for Robust Adaptation
Adapting a pre-trained foundation model on downstream tasks should ensure robustness
against distribution shifts without the need to retrain the whole model. Although existing …
against distribution shifts without the need to retrain the whole model. Although existing …
Improving Network Interpretability via Explanation Consistency Evaluation
H Wu, H Jiang, K Wang, Z Tang… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
While deep neural networks have achieved remarkable performance, they tend to lack
transparency in prediction. The pursuit of greater interpretability in neural networks often …
transparency in prediction. The pursuit of greater interpretability in neural networks often …
MoTE: Reconciling Generalization with Specialization for Visual-Language to Video Knowledge Transfer
Transferring visual-language knowledge from large-scale foundation models for video
recognition has proved to be effective. To bridge the domain gap, additional parametric …
recognition has proved to be effective. To bridge the domain gap, additional parametric …
Calibrating Prompt from History for Continual Vision-Language Retrieval and Grounding
In the field of machine learning, continual learning is a crucial concept that allows models to
adapt to non-stationary data distributions. However, most of the existing works focus on uni …
adapt to non-stationary data distributions. However, most of the existing works focus on uni …
Open-Vocabulary Calibration for Fine-tuned CLIP
Vision-language models (VLMs) have emerged as formidable tools, showing their strong
capability in handling various open-vocabulary tasks in image recognition, text-driven visual …
capability in handling various open-vocabulary tasks in image recognition, text-driven visual …
Meta-learning algorithms and applications
O Bohdal - 2024 - era.ed.ac.uk
Meta-learning in the broader context concerns how an agent learns about their own
learning, allowing them to improve their learning process. Learning how to learn is not only …
learning, allowing them to improve their learning process. Learning how to learn is not only …