Fast parallel set similarity joins on many-core architectures
S Ribeiro-Junior, RD Quirino, LA Ribeiro… - Journal of Information …, 2017 - periodicos.ufmg.br
Set similarity join is a core operation for text data integration, cleaning, and mining. Previous
research work on improving the performance of set similarity joins mostly focused on …
research work on improving the performance of set similarity joins mostly focused on …
LSH SimilarityJoin Pattern in FastFlow
Similarity joins are recognized to be among the most used data processing and analysis
operations. We introduce a C++-based high-level parallel pattern implemented on top of …
operations. We introduce a C++-based high-level parallel pattern implemented on top of …
Efficient filter-based algorithms for exact set similarity join on GPUs
RD Quirino, S Ribeiro-Junior, LA Ribeiro… - … Conference, ICEIS 2017 …, 2018 - Springer
Set similarity join is a core operation for text data integration, cleaning, and mining. Most
state-of-the-art solutions rely on inherently sequential, CPU-based algorithms. In this paper …
state-of-the-art solutions rely on inherently sequential, CPU-based algorithms. In this paper …
[PDF][PDF] Sstr: Set similarity join over stream data
L Pacıfico, LA Ribeiro - Proc. 22nd International Conference on …, 2020 - scitepress.org
In modern application scenarios, large volumes of data are continuously generated over
time at high speeds. Delivering timely analysis results from such massive stream of data …
time at high speeds. Delivering timely analysis results from such massive stream of data …
Set Similarity Joins on Heterogeneous Clusters
LRM Silva, LA Ribeiro - Journal of Information and Data …, 2023 - journals-sol.sbc.org.br
Set similarity join (SSJ) is a fundamental operation widely used in many application
scenarios, including data discovery, cleaning, and integration. As this operation is …
scenarios, including data discovery, cleaning, and integration. As this operation is …
[PDF][PDF] Algoritmos de junção por similaridade sobre fluxo de dados
LO Pacífico - 2020 - ww2.inf.ufg.br
Na atual era de Big Data, dados são gerados e coletados em grande velocidade, o que
impõe requisitos severos de desempenho e memória para processamento desses dados …
impõe requisitos severos de desempenho e memória para processamento desses dados …
Streaming Set Similarity Joins
L Pacıfico, LA Ribeiro - … , ICEIS 2020, Virtual Event, May 5–7 …, 2021 - books.google.com
We consider the problem of efficiently answering set similarity joins over streams. This
problem is challenging both in terms of CPU cost, because similarity matching is …
problem is challenging both in terms of CPU cost, because similarity matching is …
Streaming Set Similarity Joins
L Pacífico, LA Ribeiro - International Conference on Enterprise Information …, 2020 - Springer
We consider the problem of efficiently answering set similarity joins over streams. This
problem is challenging both in terms of CPU cost, because similarity matching is …
problem is challenging both in terms of CPU cost, because similarity matching is …
Efficiently Computing Exact Set Similarity Joins
X Wang - 2018 - unsworks.unsw.edu.au
dc. description. abstract Set similarity join, which finds all the similar set pairs from two
collections of sets, is a fundamental problem with a wide range of applications including …
collections of sets, is a fundamental problem with a wide range of applications including …
[引用][C] Bwjoin: A Blockwise GPU-based Algorithm for Set Similarity Joins
RD Quirino, AM Quirino, LA Ribeiro, WS Martins - Simpósio em Sistemas …, 2023 - SBC