[图书][B] Data profiling

Z Abedjan, L Golab, F Naumann, T Papenbrock - 2019 - Springer
Data profiling refers to the activity of collecting data about data,{ie}, metadata. Most IT
professionals and researchers who work with data have engaged in data profiling, at least …

Ontology-based entity matching in attributed graphs

H Ma, M Alipourlangouri, Y Wu, F Chiang… - Proceedings of the VLDB …, 2019 - dl.acm.org
Keys for graphs incorporate the topology and value constraints needed to uniquely identify
entities in a graph. They have been studied to support object identification, knowledge …

Fact checking in knowledge graphs with ontological subgraph patterns

P Lin, Q Song, Y Wu - Data Science and Engineering, 2018 - Springer
Given a knowledge graph and a fact (a triple statement), fact checking is to decide whether
the fact belongs to the missing part of the graph. Facts in real-world knowledge bases are …

Contextual data cleaning with ontology functional dependencies

Z Zheng, L Zheng, M Alipourlangouri… - ACM Journal of Data …, 2022 - dl.acm.org
Functional Dependencies define attribute relationships based on syntactic equality, and
when used in data cleaning, they erroneously label syntactically different but semantically …

CurrentClean: Spatio-temporal cleaning of stale data

M Milani, Z Zheng, F Chiang - 2019 IEEE 35th International …, 2019 - ieeexplore.ieee.org
Data currency is imperative towards achieving up-to-date and accurate data analysis. Data
is considered current if changes in real world entities are reflected in the database. When …

Privacy-aware data cleaning-as-a-service

Y Huang, M Milani, F Chiang - Information Systems, 2020 - Elsevier
Data cleaning is a pervasive problem for organizations as they try to reap value from their
data. Recent advances in networking and cloud computing technology have fueled a new …

RTClean: Context-aware Tabular Data Cleaning using Real-time OFDs

D Del Gaudio, T Schubert… - 2023 IEEE International …, 2023 - ieeexplore.ieee.org
Nowadays, machine learning plays a key role in developing plenty of applications, eg, smart
homes, smart medical assistance, and autonomous driving. A major challenge of these …

Discovery of temporal graph functional dependencies

L Noronha, F Chiang - Proceedings of the 30th ACM International …, 2021 - dl.acm.org
Temporal Graph Functional Dependencies (TGFDs) are a class of data quality rules
imposing topological, attribute dependency constraints over a period of time. To make …

Test case selection and prioritization approach for automated regression testing using ontology and COSMIC measurement

Z Sakhrawi, T Labidi - Automated Software Engineering, 2024 - Springer
Regression testing is an important activity that aims to provide information about the quality
of the software product under test when changes occur. The two primary techniques for …

Mining Keys for Graphs

M Alipourlangouri, F Chiang - Data & Knowledge Engineering, 2024 - Elsevier
Keys for graphs are a class of data quality rules that use topological and value constraints to
uniquely identify entities in a data graph. They have been studied to support object …