[HTML][HTML] Small data machine learning in materials science

P Xu, X Ji, M Li, W Lu - npj Computational Materials, 2023 - nature.com
This review discussed the dilemma of small data faced by materials machine learning. First,
we analyzed the limitations brought by small data. Then, the workflow of materials machine …

Using machine learning approaches for multi-omics data analysis: A review

PS Reel, S Reel, E Pearson, E Trucco… - Biotechnology advances, 2021 - Elsevier
With the development of modern high-throughput omic measurement platforms, it has
become essential for biomedical studies to undertake an integrative (combined) approach to …

Trocr: Transformer-based optical character recognition with pre-trained models

M Li, T Lv, J Chen, L Cui, Y Lu, D Florencio… - Proceedings of the …, 2023 - ojs.aaai.org
Text recognition is a long-standing research problem for document digitalization. Existing
approaches are usually built based on CNN for image understanding and RNN for char …

[HTML][HTML] Vlp: A survey on vision-language pre-training

FL Chen, DZ Zhang, ML Han, XY Chen, J Shi… - Machine Intelligence …, 2023 - Springer
In the past few years, the emergence of pre-training models has brought uni-modal fields
such as computer vision (CV) and natural language processing (NLP) to a new era …

What does it mean for a language model to preserve privacy?

H Brown, K Lee, F Mireshghallah, R Shokri… - Proceedings of the 2022 …, 2022 - dl.acm.org
Natural language reflects our private lives and identities, making its privacy concerns as
broad as those of real life. Language models lack the ability to understand the context and …

Sequence-to-sequence contrastive learning for text recognition

A Aberdam, R Litman, S Tsiper… - Proceedings of the …, 2021 - openaccess.thecvf.com
We propose a framework for sequence-to-sequence contrastive learning (SeqCLR) of visual
representations, which we apply to text recognition. To account for the sequence-to …

Artificial intelligence for breast cancer analysis: Trends & directions

SM Shah, RA Khan, S Arif, U Sajid - Computers in Biology and Medicine, 2022 - Elsevier
Breast cancer is one of the leading causes of death among women. Early detection of breast
cancer can significantly improve the lives of millions of women across the globe. Given …

SemEval-2020 Task 8: Memotion Analysis--The Visuo-Lingual Metaphor!

C Sharma, D Bhageria, W Scott, S Pykl, A Das… - arXiv preprint arXiv …, 2020 - arxiv.org
Information on social media comprises of various modalities such as textual, visual and
audio. NLP and Computer Vision communities often leverage only one prominent modality …

Holistic analysis of hallucination in gpt-4v (ision): Bias and interference challenges

C Cui, Y Zhou, X Yang, S Wu, L Zhang, J Zou… - arXiv preprint arXiv …, 2023 - arxiv.org
While GPT-4V (ision) impressively models both visual and textual information
simultaneously, it's hallucination behavior has not been systematically assessed. To bridge …

[HTML][HTML] Opportunities and challenges of text mining in materials research

O Kononova, T He, H Huo, A Trewartha, EA Olivetti… - Iscience, 2021 - cell.com
Research publications are the major repository of scientific knowledge. However, their
unstructured and highly heterogenous format creates a significant obstacle to large-scale …