Exploiting GPUs for fast intersection of large sets

C Bellas, A Gounaris - Information Systems, 2022 - Elsevier
The main focus of this work is on large set intersection, which is a pivotal operation in
information retrieval, graph analytics and database systems. We aim to experimentally …

A graph pattern mining framework for large graphs on GPU

L Hu, Y Lin, L Zou, MT Özsu - The VLDB Journal, 2025 - Springer
Graph pattern mining (GPM) is an important problem in graph processing. There are many
parallel frameworks for GPM, many of which suffer from low performance. GPU is a powerful …

Metricjoin: Leveraging metric properties for robust exact set similarity joins

M Widmoser, D Kocher, N Augsten… - 2023 IEEE 39th …, 2023 - ieeexplore.ieee.org
Given two collections of sets, the set similarity join reports all pairs of sets that are within a
given distance threshold. State-of-the-art solutions employ an inverted list index and several …

An empirical evaluation of exact set similarity join techniques using gpus

C Bellas, A Gounaris - Information Systems, 2020 - Elsevier
Exact set similarity join is a notoriously expensive operation, for which several solutions
have been proposed. Recently, there have been studies that present a comparative analysis …

Set similarity joins with complex expressions on distributed platforms

DJ do Carmo Oliveira, FF Borges, LA Ribeiro… - Advances in Databases …, 2018 - Springer
A set similarity join finds all similar pairs from a collection of sets. This operation is essential
for many important tasks in Big Data analytics including string data integration and cleaning …

Exact set similarity joins for large datasets in the GPGPU paradigm

C Bellas, A Gounaris - Proceedings of the 15th International Workshop …, 2019 - dl.acm.org
We investigate the problem of exact set similarity joins using a co-process CPU-GPU
scheme. We focus on large instances of the problem, ie, using datasets of> 1M entries …

[PDF][PDF] Sstr: Set similarity join over stream data

L Pacıfico, LA Ribeiro - Proc. 22nd International Conference on …, 2020 - scitepress.org
In modern application scenarios, large volumes of data are continuously generated over
time at high speeds. Delivering timely analysis results from such massive stream of data …

Set Similarity Joins on Heterogeneous Clusters

LRM Silva, LA Ribeiro - Journal of Information and Data …, 2023 - journals-sol.sbc.org.br
Set similarity join (SSJ) is a fundamental operation widely used in many application
scenarios, including data discovery, cleaning, and integration. As this operation is …

[PDF][PDF] Advanced Joins on GPUs

C Bellas - 2022 - ikee.lib.auth.gr
In the last decade, it has become widely accepted that the performance of modern
processors is no longer limited by transistor density, but by power consumption. As a result …

[引用][C] A framework for set similarity join on multi-attribute data

LA Ribeiro, FF Borges, DJ do Carmo Oliveira - Anais do XXXV Simpósio …, 2020 - SBC