A comprehensive survey on cloud data mining (CDM) frameworks and algorithms

HB Barua, KC Mondal - ACM Computing Surveys (CSUR), 2019 - dl.acm.org
Data mining is used for finding meaningful information out of a vast expanse of data. With
the advent of Big Data concept, data mining has come to much more prominence …

Research on parallelization of Apriori algorithm in association rule mining

HB Wang, YJ Gao - Procedia Computer Science, 2021 - Elsevier
Aiming at the performance bottleneck of traditional Apriori algorithm when the data set is
slightly large, this paper adopts the idea of parallelization and improves the Apriori algorithm …

Short-term load forecasting with clustering–regression model in distributed cluster

J Lei, T Jin, J Hao, F Li - Cluster Computing, 2019 - Springer
This paper tackles a new challenge in power big data: how to improve the precision of short-
term load forecasting with large-scale data set. The proposed load forecasting method is …

HBase 在飞行模拟训练数据存储中的应用分析.

吕游, 周晓光, 张家, 叶子 - Ordnance Industry Automation, 2020 - search.ebscohost.com
为提升飞行员飞行模拟训练质量, 对飞行模拟训练数据化存储方式进行研究. 介绍HBase
分布式数据库原理, 飞行模拟训练数据, 对数据在HBase 分布式数据库中的存储结构进行设计 …

[PDF][PDF] An improved algorithm for mining correlation item pairs

T Li, Y Ren, Y Ren, J Xia - Computers, Materials & Continua, 2020 - cdn.techscience.cn
Apriori algorithm is often used in traditional association rules mining, searching for the mode
of higher frequency. Then the correlation rules are obtained by detected the correlation of …

Optimization of a Similarity Performance on Bounded Content of Motion Histogram by Using Distributed Model

EM Saoudi, A Adoui El Ouadrhiri… - Advances on Smart and …, 2021 - Springer
In this paper, a content-based video retrieval (CBVR) system called Bounded Coordinate of
Motion Histogram version 2 (BCMH v2) was processed on a distributed computing platform …

[HTML][HTML] Arquitectura distribuida de alta disponibilidad para la detección de fraude

A García Núñez, JL Olmedo Flores - Revista Cubana de Ciencias …, 2021 - scielo.sld.cu
La detección temprana, rápida y eficaz del fraude en el sector de las telecomunicaciones se
ha convertido en la punta de lanza para enfrentar las más complejas y diversas vías en la …

Optimal Design Method of Merging Disk Files Based on Hot Data

L Qian, M Xu, J Yu, W Li, H Yuan, J Ren… - Journal of Physics …, 2020 - iopscience.iop.org
In recent years, with the rapid development of big data technology, users are more and more
inclined to solve the problems of large amount of data and complex business scenarios with …

Grouping Association Rules Using Clustering Techniques in Big Data

MA Alasow - 2019 - search.proquest.com
Mining association rules between data items is essential in the discovery of knowledge
hidden in datasets. There are many efficient association rules mining algorithms. The …

Development of high volumes of data processing algorithm for 3D printers in Hadoop systems

K Nam, K Lee, G Kim, J Kim, S Kim… - Proceedings of the …, 2017 - koreascience.kr
하둡 시스템은 대용량의 데이터를 처리할 수 있는 클러스터 기반 개방형 소프트웨어
프레임워크이다. 이는 하둡 분산 파일시스템 (HDFS) 과 MapReduce 모델을 활용하여 …