Reusing deep learning models: Challenges and directions in software engineering

JC Davis, P Jajal, W Jiang… - 2023 IEEE John …, 2023 - ieeexplore.ieee.org
Deep neural networks (DNNs) achieve state-of-the-art performance in many areas, including
computer vision, system configuration, and question-answering. However, DNNs are …

An empirical study of pre-trained model reuse in the hugging face deep learning model registry

W Jiang, N Synovic, M Hyatt… - 2023 IEEE/ACM 45th …, 2023 - ieeexplore.ieee.org
Deep Neural Networks (DNNs) are being adopted as components in software systems.
Creating and specializing DNNs from scratch has grown increasingly difficult as state-of-the …

Boms away! inside the minds of stakeholders: A comprehensive study of bills of materials for software systems

T Stalnaker, N Wintersgill, O Chaparro… - Proceedings of the 46th …, 2024 - dl.acm.org
Software Bills of Materials (SBOMs) have emerged as tools to facilitate the management of
software dependencies, vulnerabilities, licenses, and the supply chain. While significant …

Large language model supply chain: A research agenda

S Wang, Y Zhao, X Hou, H Wang - arXiv preprint arXiv:2404.12736, 2024 - arxiv.org
The rapid advancements in pre-trained Large Language Models (LLMs) and Large
Multimodal Models (LMMs) have ushered in a new era of intelligent applications …

Challenges and practices of deep learning model reengineering: A case study on computer vision

W Jiang, V Banna, N Vivek, A Goel, N Synovic… - arXiv preprint arXiv …, 2023 - arxiv.org
Many engineering organizations are reimplementing and extending deep neural networks
from the research community. We describe this process as deep learning model …

PTMTorrent: a dataset for mining open-source pre-trained model packages

W Jiang, N Synovic, P Jajal… - 2023 IEEE/ACM 20th …, 2023 - ieeexplore.ieee.org
Due to the cost of developing and training deep learning models from scratch, machine
learning engineers have begun to reuse pre-trained models (PTMs) and fine-tune them for …

Signing in four public software package registries: Quantity, quality, and influencing factors

TR Schorlemmer, KG Kalu, L Chigges, KM Ko… - arXiv preprint arXiv …, 2024 - arxiv.org
Many software applications incorporate open-source third-party packages distributed by
third-party package registries. Guaranteeing authorship along this supply chain is a …

Exploring naming conventions (and defects) of pre-trained deep learning models in hugging face and other model hubs

W Jiang, C Cheung, GK Thiruvathukal… - arXiv preprint arXiv …, 2023 - arxiv.org
As innovation in deep learning continues, many engineers want to adopt Pre-Trained deep
learning Models (PTMs) as components in computer systems. PTMs are part of a research-to …

Analysis of failures and risks in deep learning model converters: A case study in the onnx ecosystem

P Jajal, W Jiang, A Tewari, J Woo… - arXiv preprint arXiv …, 2023 - arxiv.org
Many software engineers develop, fine-tune, and deploy deep learning (DL) models. They
use DL models in a variety of development frameworks and deploy to a range of runtime …

Peatmoss: A dataset and initial analysis of pre-trained models in open-source software

W Jiang, J Yasmin, J Jones, N Synovic… - 2024 IEEE/ACM 21st …, 2024 - ieeexplore.ieee.org
The development and training of deep learning models have become increasingly costly
and complex. Consequently, software engineers are adopting pre-trained models (PTMs) for …