Benchmarking large language models on cmexam-a comprehensive chinese medical exam dataset

J Liu, P Zhou, Y Hua, D Chong, Z Tian… - Advances in …, 2024 - proceedings.neurips.cc
Recent advancements in large language models (LLMs) have transformed the field of
question answering (QA). However, evaluating LLMs in the medical field is challenging due …

How easy is it to fool your multimodal llms? an empirical analysis on deceptive prompts

Y Qian, H Zhang, Y Yang, Z Gan - arXiv preprint arXiv:2402.13220, 2024 - arxiv.org
The remarkable advancements in Multimodal Large Language Models (MLLMs) have not
rendered them immune to challenges, particularly in the context of handling deceptive …

Foundation models for recommender systems: A survey and new perspectives

C Huang, T Yu, K Xie, S Zhang, L Yao… - arXiv preprint arXiv …, 2024 - arxiv.org
Recently, Foundation Models (FMs), with their extensive knowledge bases and complex
architectures, have offered unique opportunities within the realm of recommender systems …

Mm-soc: Benchmarking multimodal large language models in social media platforms

Y Jin, M Choi, G Verma, J Wang, S Kumar - arXiv preprint arXiv …, 2024 - arxiv.org
Social media platforms are hubs for multimodal information exchange, encompassing text,
images, and videos, making it challenging for machines to comprehend the information or …

Inductive Graph Alignment Prompt: Bridging the Gap between Graph Pre-training and Inductive Fine-tuning From Spectral Perspective

Y Yan, P Zhang, Z Fang, Q Long - Proceedings of the ACM on Web …, 2024 - dl.acm.org
The" Graph pre-training and fine-tuning" paradigm has significantly improved Graph Neural
Networks (GNNs) by capturing general knowledge without manual annotations for …

Better to ask in English: Cross-lingual evaluation of large language models for healthcare queries

Y Jin, M Chandra, G Verma, Y Hu… - Proceedings of the …, 2024 - dl.acm.org
Large language models (LLMs) are transforming the ways the general public accesses and
consumes information. Their influence is particularly pronounced in pivotal sectors like …

Exploring boundary of gpt-4v on marine analysis: A preliminary case study

Z Zheng, Y Chen, J Zhang, TA Vu, H Zeng… - arXiv preprint arXiv …, 2024 - arxiv.org
Large language models (LLMs) have demonstrated a powerful ability to answer various
queries as a general-purpose assistant. The continuous multi-modal large language models …

Collage prompting: Budget-friendly visual recognition with gpt-4v

S Xu, Y Wang, D Liu, C Xu - arXiv preprint arXiv:2403.11468, 2024 - arxiv.org
Recent advancements in generative AI have suggested that by taking visual prompt, GPT-4V
can demonstrate significant proficiency in image recognition task. Despite its impressive …

Is Contrastive Learning Necessary? A Study of Data Augmentation vs Contrastive Learning in Sequential Recommendation

P Zhou, YL Huang, Y Xie, J Gao, S Wang… - Proceedings of the …, 2024 - dl.acm.org
Sequential recommender systems (SRS) are designed to predict users' future behaviors
based on their historical interaction data. Recent research has increasingly utilized …

GPT4Rec: Graph Prompt Tuning for Streaming Recommendation

P Zhang, Y Yan, X Zhang, L Kang, C Li… - arXiv preprint arXiv …, 2024 - arxiv.org
In the realm of personalized recommender systems, the challenge of adapting to evolving
user preferences and the continuous influx of new users and items is paramount …