The art and practice of data science pipelines: A comprehensive study of data science pipelines in theory, in-the-small, and in-the-large
Increasingly larger number of software systems today are including data science
components for descriptive, predictive, and prescriptive analytics. The collection of data …
components for descriptive, predictive, and prescriptive analytics. The collection of data …
A comprehensive study on deep learning bug characteristics
Deep learning has gained substantial popularity in recent years. Developers mainly rely on
libraries and tools to add deep learning capabilities to their software. What kinds of bugs are …
libraries and tools to add deep learning capabilities to their software. What kinds of bugs are …
The prevalence of code smells in machine learning projects
Artificial Intelligence (AI) and Machine Learning (ML) are pervasive in the current computer
science landscape. Yet, there still exists a lack of software engineering experience and best …
science landscape. Yet, there still exists a lack of software engineering experience and best …
A preliminary investigation of MLOps practices in GitHub
Background. The rapid and growing popularity of machine learning (ML) applications has
led to an increasing interest in MLOps, that is, the practice of continuous integration and …
led to an increasing interest in MLOps, that is, the practice of continuous integration and …
An empirical study for common language features used in python projects
As a dynamic programming language, Python is widely used in many fields. For developers,
various language features affect programming experience. For researchers, they affect the …
various language features affect programming experience. For researchers, they affect the …
A large-scale comparative analysis of coding standard conformance in open-source data science projects
Background: Meeting the growing industry demand for Data Science requires cross-
disciplinary teams that can translate machine learning research into production-ready code …
disciplinary teams that can translate machine learning research into production-ready code …
23 shades of self-admitted technical debt: An empirical study on machine learning software
In software development, the term “technical debt”(TD) is used to characterize short-term
solutions and workarounds implemented in source code which may incur a long-term cost …
solutions and workarounds implemented in source code which may incur a long-term cost …
[HTML][HTML] The yin yang of AI: Exploring how commercial and non-commercial orientations shape machine learning innovation
E Brea - Research Policy, 2024 - Elsevier
The scale of the potential implications of machine learning (ML) has prompted discussions
on the issues of corporate control and technological openness. However, how commercial …
on the issues of corporate control and technological openness. However, how commercial …
An exploratory study on the predominant programming paradigms in Python code
Python is a multi-paradigm programming language that fully supports object-oriented (OO)
programming. The language allows writing code in a non-procedural imperative manner …
programming. The language allows writing code in a non-procedural imperative manner …
Actor concurrency bugs: a comprehensive study on symptoms, root causes, API usages, and differences
M Bagherzadeh, N Fireman, A Shawesh… - Proceedings of the …, 2020 - dl.acm.org
Actor concurrency is becoming increasingly important in the development of real-world
software systems. Although actor concurrency may be less susceptible to some …
software systems. Although actor concurrency may be less susceptible to some …