Distributed data mining: a survey
Most data mining approaches assume that the data can be provided from a single source. If
data was produced from many physically distributed locations like Wal-Mart, these methods …
data was produced from many physically distributed locations like Wal-Mart, these methods …
Data mining and life sciences applications on the grid
Data mining (DM) is increasingly used in the analysis of data generated in life sciences,
including biological data produced in several disciplines such as genomics and proteomics …
including biological data produced in several disciplines such as genomics and proteomics …
A parallel distributed weka framework for big data mining using spark
AK Koliopoulos, P Yiapanis, F Tekiner… - … congress on big …, 2015 - ieeexplore.ieee.org
Effective Big Data Mining requires scalable and efficient solutions that are also accessible to
users of all levels of expertise. Despite this, many current efforts to provide effective …
users of all levels of expertise. Despite this, many current efforts to provide effective …
[PDF][PDF] Dengue fever prediction: A data mining problem
Dengue infection is vital disease caused by dengue germ, which extent in body of human by
female mosquito [1]. With indications of headache, retro orbital pain, joint-pain, muscular …
female mosquito [1]. With indications of headache, retro orbital pain, joint-pain, muscular …
Toolkit-based high-performance data mining of large data on MapReduce clusters
D Wegener, M Mock, D Adranale… - 2009 IEEE International …, 2009 - ieeexplore.ieee.org
The enormous growth of data in a variety of applications has increased the need for high
performance data mining based on distributed environments. However, standard data …
performance data mining based on distributed environments. However, standard data …
Grid-enabling data mining applications with DataMiningGrid: An architectural perspective
V Stankovski, M Swain, V Kravtsov, T Niessen… - Future Generation …, 2008 - Elsevier
The DataMiningGrid system has been designed to meet the requirements of modern and
distributed data mining scenarios. Based on the Globus Toolkit and other open technology …
distributed data mining scenarios. Based on the Globus Toolkit and other open technology …
Performance improvement of data mining in Weka through GPU acceleration
TA Engel, AS Charão, M Kirsch-Pinheiro… - Procedia Computer …, 2014 - Elsevier
Data mining tools may be computationally demanding, so there is an increasing interest on
parallel computing strategies to improve their performance. The popularization of Graphics …
parallel computing strategies to improve their performance. The popularization of Graphics …
Design and implementation of a data mining grid-aware architecture
Current business processes often use data from several sources. Data is characterized to be
heterogeneous, incomplete and usually involves a huge amount of records. This implies that …
heterogeneous, incomplete and usually involves a huge amount of records. This implies that …
Performance improvement of data mining in Weka through multi-core and GPU acceleration: opportunities and pitfalls
TA Engel, AS Charão, M Kirsch-Pinheiro… - Journal of ambient …, 2015 - Springer
Data mining tools may be computationally demanding, which leads to an increasing interest
on parallel computing strategies in order to improve their performance. While multi-core …
on parallel computing strategies in order to improve their performance. While multi-core …
Webservices oriented data mining in knowledge architecture
Massive parallelism is required for an efficient solution to data mining tasks, considering the
proliferation of data and the need for high computational effort. The DisDaMin (Distributed …
proliferation of data and the need for high computational effort. The DisDaMin (Distributed …