Regression with linked datasets subject to linkage error

Z Wang, E Ben‐David, G Diao… - Wiley Interdisciplinary …, 2022 - Wiley Online Library
Data are often collected from multiple heterogeneous sources and are combined
subsequently. In combing data, record linkage is an essential task for linking records in …

A Novel Methodology for Improving Applications of Modern Predictive Modeling Techniques to Linked Data Sets Subject to Mismatch Error

E Ben-David, BT West, M Slawski - 2023 Big Data Meets Survey …, 2023 - ieeexplore.ieee.org
In recent years, the rise of social media platforms such as Twitter/X has provided social
scientists with a wealth of user-content data. Combining social media and survey data has …

[PDF][PDF] Modernizing person-level entity resolution with biometrically linked records

M Gross, M Mueller-Smith - 2020 - matthew-gross.github.io
We propose a novel approach to person-level record linkage in administrative data, a
procedure and setting that is increasingly at the frontier of economic research. We build a …

A general framework for regression with mismatched data based on mixture modelling

M Slawski, BT West, P Bukke, Z Wang… - Journal of the Royal …, 2024 - academic.oup.com
The advent of the information age has revolutionized data collection and has led to a rapid
expansion of available data sources. Methods of data integration are indispensable when a …

Estimation in exponential family regression based on linked data contaminated by mismatch error

Z Wang, E Ben-David, M Slawski - arXiv preprint arXiv:2010.00181, 2020 - arxiv.org
Identification of matching records in multiple files can be a challenging and error-prone task.
Linkage error can considerably affect subsequent statistical analysis based on the resulting …

[PDF][PDF] Criminal justice administrative records system (cjars)

K Finlay, M Mueller-Smith - Ann Arbor: University of Michigan, Institute for …, 2021 - cjars.org
Abstract The Criminal Justice Administrative Records System (CJARS) is a nationally
integrated data repository designed to transform research and policymaking on the United …

[PDF][PDF] Labor Income Inequality in Thailand: the Roles of Education, Occupation and Employment History

N Wasi, SW Paweenawat, CDN Ayudhya… - … Ungphakorn Institute for …, 2019 - pier.or.th
Thailand's income inequality has reportedly declined since the mid-1990s. This paper
examines possible mechanisms underlying the dynamic patterns of the country's labor …

Record linkage in statistical sampling: Past, present, and future

B Williams - Recent advances on sampling methods and …, 2022 - Springer
Record linkage is a useful tool to match records across datasets when the datasets lack a
unique identifier. In this chapter, we examine the past, current, and present uses of …

Machine Learning based linkage of company data for economic research: Application to the EBDC Business Panels

VFM Reich - 2024 - econstor.eu
This article presents a comprehensive approach to probabilistic linkage of German company
data using Machine Learning and Natural Language Processing techniques. Here, the long …

[PDF][PDF] An Evaluation of Machine Learning Approaches to Integrate Historical Farm Data

PB Guerrón, M Hallo - Baltic Journal of Modern Computing, 2022 - researchgate.net
Large datasets in agriculture are increasingly available through yearly surveys. However,
very few longitudinal datasets providing insights for farmer's decision are available. The …