Regression with linked datasets subject to linkage error
Data are often collected from multiple heterogeneous sources and are combined
subsequently. In combing data, record linkage is an essential task for linking records in …
subsequently. In combing data, record linkage is an essential task for linking records in …
A Novel Methodology for Improving Applications of Modern Predictive Modeling Techniques to Linked Data Sets Subject to Mismatch Error
E Ben-David, BT West, M Slawski - 2023 Big Data Meets Survey …, 2023 - ieeexplore.ieee.org
In recent years, the rise of social media platforms such as Twitter/X has provided social
scientists with a wealth of user-content data. Combining social media and survey data has …
scientists with a wealth of user-content data. Combining social media and survey data has …
[PDF][PDF] Modernizing person-level entity resolution with biometrically linked records
M Gross, M Mueller-Smith - 2020 - matthew-gross.github.io
We propose a novel approach to person-level record linkage in administrative data, a
procedure and setting that is increasingly at the frontier of economic research. We build a …
procedure and setting that is increasingly at the frontier of economic research. We build a …
A general framework for regression with mismatched data based on mixture modelling
The advent of the information age has revolutionized data collection and has led to a rapid
expansion of available data sources. Methods of data integration are indispensable when a …
expansion of available data sources. Methods of data integration are indispensable when a …
Estimation in exponential family regression based on linked data contaminated by mismatch error
Z Wang, E Ben-David, M Slawski - arXiv preprint arXiv:2010.00181, 2020 - arxiv.org
Identification of matching records in multiple files can be a challenging and error-prone task.
Linkage error can considerably affect subsequent statistical analysis based on the resulting …
Linkage error can considerably affect subsequent statistical analysis based on the resulting …
[PDF][PDF] Criminal justice administrative records system (cjars)
K Finlay, M Mueller-Smith - Ann Arbor: University of Michigan, Institute for …, 2021 - cjars.org
Abstract The Criminal Justice Administrative Records System (CJARS) is a nationally
integrated data repository designed to transform research and policymaking on the United …
integrated data repository designed to transform research and policymaking on the United …
[PDF][PDF] Labor Income Inequality in Thailand: the Roles of Education, Occupation and Employment History
N Wasi, SW Paweenawat, CDN Ayudhya… - … Ungphakorn Institute for …, 2019 - pier.or.th
Thailand's income inequality has reportedly declined since the mid-1990s. This paper
examines possible mechanisms underlying the dynamic patterns of the country's labor …
examines possible mechanisms underlying the dynamic patterns of the country's labor …
Record linkage in statistical sampling: Past, present, and future
B Williams - Recent advances on sampling methods and …, 2022 - Springer
Record linkage is a useful tool to match records across datasets when the datasets lack a
unique identifier. In this chapter, we examine the past, current, and present uses of …
unique identifier. In this chapter, we examine the past, current, and present uses of …
Machine Learning based linkage of company data for economic research: Application to the EBDC Business Panels
VFM Reich - 2024 - econstor.eu
This article presents a comprehensive approach to probabilistic linkage of German company
data using Machine Learning and Natural Language Processing techniques. Here, the long …
data using Machine Learning and Natural Language Processing techniques. Here, the long …
[PDF][PDF] An Evaluation of Machine Learning Approaches to Integrate Historical Farm Data
PB Guerrón, M Hallo - Baltic Journal of Modern Computing, 2022 - researchgate.net
Large datasets in agriculture are increasingly available through yearly surveys. However,
very few longitudinal datasets providing insights for farmer's decision are available. The …
very few longitudinal datasets providing insights for farmer's decision are available. The …