Large language model supply chain: A research agenda

S Wang, Y Zhao, X Hou, H Wang - ACM Transactions on Software …, 2024 - dl.acm.org
The rapid advancement of large language models (LLMs) has revolutionized artificial
intelligence, introducing unprecedented capabilities in natural language processing and …

What do we know about Hugging Face? A systematic literature review and quantitative validation of qualitative claims

J Jones, W Jiang, N Synovic, G Thiruvathukal… - Proceedings of the 18th …, 2024 - dl.acm.org
Background: Software Package Registries (SPRs) are an integral part of the software supply
chain. These collaborative platforms unite contributors, users, and code for streamlined …

Signing in four public software package registries: Quantity, quality, and influencing factors

TR Schorlemmer, KG Kalu, L Chigges… - … IEEE Symposium on …, 2024 - ieeexplore.ieee.org
Many software applications incorporate open-source third-party packages distributed by
public package registries. Guaranteeing authorship along this supply chain is a challenge …

Challenges and practices of deep learning model reengineering: A case study on computer vision

W Jiang, V Banna, N Vivek, A Goel, N Synovic… - Empirical Software …, 2024 - Springer
Context Many engineering organizations are reimplementing and extending deep neural
networks from the research community. We describe this process as deep learning model …

Analysis of failures and risks in deep learning model converters: A case study in the onnx ecosystem

P Jajal, W Jiang, A Tewari, E Kocinare, J Woo… - arXiv preprint arXiv …, 2023 - arxiv.org
Software engineers develop, fine-tune, and deploy deep learning (DL) models using a
variety of development frameworks and runtime environments. DL model converters move …

Exploring naming conventions (and defects) of pre-trained deep learning models in hugging face and other model hubs

W Jiang, C Cheung, GK Thiruvathukal… - arXiv preprint arXiv …, 2023 - arxiv.org
As innovation in deep learning continues, many engineers want to adopt Pre-Trained deep
learning Models (PTMs) as components in computer systems. PTMs are part of a research-to …

Peatmoss: A dataset and initial analysis of pre-trained models in open-source software

W Jiang, J Yasmin, J Jones, N Synovic… - 2024 IEEE/ACM 21st …, 2024 - ieeexplore.ieee.org
The development and training of deep learning models have become increasingly costly
and complex. Consequently, software engineers are adopting pre-trained models (PTMs) for …

Ecosystem of Large Language Models for Code

Z Yang, J Shi, P Devanbu, D Lo - arXiv preprint arXiv:2405.16746, 2024 - arxiv.org
The availability of vast amounts of publicly accessible data of source code and the advances
in modern language models, coupled with increasing computational resources, have led to …

Deep learning model reuse in the huggingface community: Challenges, benefit and trends

M Taraghi, G Dorcelus, A Foundjem, F Tambon… - arXiv preprint arXiv …, 2024 - arxiv.org
The ubiquity of large-scale Pre-Trained Models (PTMs) is on the rise, sparking interest in
model hubs, and dedicated platforms for hosting PTMs. Despite this trend, a comprehensive …

Interoperability in deep learning: a user survey and failure analysis of ONNX model converters

P Jajal, W Jiang, A Tewari, E Kocinare, J Woo… - Proceedings of the 33rd …, 2024 - dl.acm.org
Software engineers develop, fine-tune, and deploy deep learning (DL) models using a
variety of development frameworks and runtime environments. DL model converters move …