Predicting malicious insider threat scenarios using organizational data and a heterogeneous stack-classifier
2018 IEEE International Conference on Big Data (Big Data), 2018•ieeexplore.ieee.org
Insider threats continue to present a major challenge for the information security community.
Despite constant research taking place in this area; a substantial gap still exists between the
requirements of this community and the solutions that are currently available. This paper
uses the CERT dataset r4. 2 along with a series of machine learning classifiers to predict the
occurrence of a particular malicious insider threat scenario-the uploading sensitive
information to wiki leaks before leaving the organization. These algorithms are aggregated …
Despite constant research taking place in this area; a substantial gap still exists between the
requirements of this community and the solutions that are currently available. This paper
uses the CERT dataset r4. 2 along with a series of machine learning classifiers to predict the
occurrence of a particular malicious insider threat scenario-the uploading sensitive
information to wiki leaks before leaving the organization. These algorithms are aggregated …
Insider threats continue to present a major challenge for the information security community. Despite constant research taking place in this area; a substantial gap still exists between the requirements of this community and the solutions that are currently available. This paper uses the CERT dataset r4.2 along with a series of machine learning classifiers to predict the occurrence of a particular malicious insider threat scenario - the uploading sensitive information to wiki leaks before leaving the organization. These algorithms are aggregated into a meta-classifier which has a stronger predictive performance than its constituent models. It also defines a methodology for performing pre-processing on organizational log data into daily user summaries for classification, and is used to train multiple classifiers. Boosting is also applied to optimise classifier accuracy. Overall the models are evaluated through analysis of their associated confusion matrix and Receiver Operating Characteristic (ROC) curve, and the best performing classifiers are aggregated into an ensemble classifier. This meta-classifier has an accuracy of 96.2% with an area under the ROC curve of 0.988.
ieeexplore.ieee.org
以上显示的是最相近的搜索结果。 查看全部搜索结果