作者
Gabriela F Cretu, Angelos Stavrou, Michael E Locasto, Salvatore J Stolfo, Angelos D Keromytis
发表日期
2008/5/18
研讨会论文
2008 IEEE Symposium on Security and Privacy (sp 2008)
页码范围
81-95
出版商
IEEE
简介
The efficacy of anomaly detection (AD) sensors depends heavily on the quality of the data used to train them. Artificial or contrived training data may not provide a realistic view of the deployment environment. Most realistic data sets are dirty; that is, they contain a number of attacks or anomalous events. The size of these high-quality training data sets makes manual removal or labeling of attack data infeasible. As a result, sensors trained on this data can miss attacks and their variations. We propose extending the training phase of AD sensors (in a manner agnostic to the underlying AD algorithm) to include a sanitization phase. This phase generates multiple models conditioned on small slices of the training data. We use these "micro- models" to produce provisional labels for each training input, and we combine the micro-models in a voting scheme to determine which parts of the training data may represent attacks …
引用总数
20082009201020112012201320142015201620172018201920202021202220232024312161077121419172492831283518
学术搜索中的文章
GF Cretu, A Stavrou, ME Locasto, SJ Stolfo… - 2008 IEEE Symposium on Security and Privacy (sp …, 2008