Fast parallel set similarity joins on many-core architectures

S Ribeiro-Junior, RD Quirino, LA Ribeiro… - Journal of Information …, 2017 - periodicos.ufmg.br
Set similarity join is a core operation for text data integration, cleaning, and mining. Previous
research work on improving the performance of set similarity joins mostly focused on …

LSH SimilarityJoin Pattern in FastFlow

N Tonci, S Rivault, M Bamha, S Robert, S Limet… - International Journal of …, 2024 - Springer
Similarity joins are recognized to be among the most used data processing and analysis
operations. We introduce a C++-based high-level parallel pattern implemented on top of …

Efficient filter-based algorithms for exact set similarity join on GPUs

RD Quirino, S Ribeiro-Junior, LA Ribeiro… - … Conference, ICEIS 2017 …, 2018 - Springer
Set similarity join is a core operation for text data integration, cleaning, and mining. Most
state-of-the-art solutions rely on inherently sequential, CPU-based algorithms. In this paper …

[PDF][PDF] Sstr: Set similarity join over stream data

L Pacıfico, LA Ribeiro - Proc. 22nd International Conference on …, 2020 - scitepress.org
In modern application scenarios, large volumes of data are continuously generated over
time at high speeds. Delivering timely analysis results from such massive stream of data …

Set Similarity Joins on Heterogeneous Clusters

LRM Silva, LA Ribeiro - Journal of Information and Data …, 2023 - journals-sol.sbc.org.br
Set similarity join (SSJ) is a fundamental operation widely used in many application
scenarios, including data discovery, cleaning, and integration. As this operation is …

[PDF][PDF] Algoritmos de junção por similaridade sobre fluxo de dados

LO Pacífico - 2020 - ww2.inf.ufg.br
Na atual era de Big Data, dados são gerados e coletados em grande velocidade, o que
impõe requisitos severos de desempenho e memória para processamento desses dados …

Streaming Set Similarity Joins

L Pacıfico, LA Ribeiro - … , ICEIS 2020, Virtual Event, May 5–7 …, 2021 - books.google.com
We consider the problem of efficiently answering set similarity joins over streams. This
problem is challenging both in terms of CPU cost, because similarity matching is …

Streaming Set Similarity Joins

L Pacífico, LA Ribeiro - International Conference on Enterprise Information …, 2020 - Springer
We consider the problem of efficiently answering set similarity joins over streams. This
problem is challenging both in terms of CPU cost, because similarity matching is …

Efficiently Computing Exact Set Similarity Joins

X Wang - 2018 - unsworks.unsw.edu.au
dc. description. abstract Set similarity join, which finds all the similar set pairs from two
collections of sets, is a fundamental problem with a wide range of applications including …

[引用][C] Bwjoin: A Blockwise GPU-based Algorithm for Set Similarity Joins

RD Quirino, AM Quirino, LA Ribeiro, WS Martins - Simpósio em Sistemas …, 2023 - SBC