Investigating the quality of reverse geocoding services using text similarity techniques and logistic regression analysis

B Kilic, F Gülgen - Cartography and Geographic Information …, 2020 - Taylor & Francis
Location, usually defined by postal address information or geographic coordinate values, is
one of the leading themes in geography. Famous global mapping services such as ArcGIS …

[PDF][PDF] An empirical study of Chinese name matching and applications

N Peng, M Yu, M Dredze - … of the 53rd Annual Meeting of the …, 2015 - aclanthology.org
Methods for name matching, an important component to support downstream tasks such as
entity linking and entity clustering, have focused on alphabetic languages, primarily English …

[PDF][PDF] Combining Approximate String Matching Algorithms and Term Frequency In The Detection of Plagiarism

Z Balani, C Varol - International Journal of Computer Science and …, 2021 - researchgate.net
One of the key factors behind plagiarism is the availability of a large amount of data and
information on the internet that can be accessed rapidly. This increases the risk of academic …

Web-based Arabic/English duplicate record detection with nested blocking technique

A Higazy, T El Tobely, AH Yousef… - 2013 8th International …, 2013 - ieeexplore.ieee.org
Data accuracy and quality affects the success of any business intelligence and data mining
solutions. The first step to ensure the data accuracy is to make sure that each real world …

Detection of terrorism's apologies on Twitter using a new bi-lingual dataset

K Bedjou, F Azouaou - International Journal of Data Mining …, 2023 - inderscienceonline.com
A lot of terrorist apology content is being shared on social media without being detected.
Therefore, the automatic and immediate detection of these contents is essential for people's …

Cross-language personal name mapping

AH Yousef - arXiv preprint arXiv:1405.6293, 2014 - arxiv.org
Name matching between multiple natural languages is an important step in cross-enterprise
integration applications and data mining. It is difficult to decide whether or not two syntactic …

Cross language duplicate record detection in big data

AH Yousef - Big Data in Complex Systems: Challenges and …, 2015 - Springer
The importance of data accuracy and quality has increased with the explosion of data size.
This factor is crucial to ensure the success of any cross-enterprise integration applications …

A lemma based evaluator for semitic language text summarization systems

T El-Shishtawy, F El-Ghannam - arXiv preprint arXiv:1403.5596, 2014 - arxiv.org
Matching texts in highly inflected languages such as Arabic by simple stemming strategy is
unlikely to perform well. In this paper, we present a strategy for automatic text matching …

An unsupervised entity resolution framework for english and arabic datasets

A Ouhab, M Malki, D Berrabah… - International Journal of …, 2017 - igi-global.com
Entity resolution (ER) is an important step in data integration and in many data mining
projects; its goal is to identify records that refer to the same real-world entity. Most existing …

What's in a name? Implicit bias affects patient perception of surgeon skill

D Bhat, T Kollu, JA Ricci, A Patel - Plastic and reconstructive …, 2021 - journals.lww.com
Background: Implicit bias is the unconscious associations and beliefs held toward specific
demographic groups. Instagram is commonly used by plastic surgeons to market their …