[PDF][PDF] 基于机器学习的文本分类技术研究进展

苏金树, 张博锋, 徐昕[1 - 软件学报, 2006 - Citeseer
文本自动分类是信息检索与数据挖掘领域的研究热点与核心技术, 近年来得到了广泛的关注和
快速的发展. 提出了基于机器学习的文本分类技术所面临的互联网内容信息处理等复杂应用的 …

Web page classification: Features and algorithms

X Qi, BD Davison - ACM computing surveys (CSUR), 2009 - dl.acm.org
Classification of Web page content is essential to many tasks in Web information retrieval
such as maintaining Web directories and focused crawling. The uncontrolled nature of Web …

Large-scale hierarchical text classification with recursively regularized deep graph-cnn

H Peng, J Li, Y He, Y Liu, M Bao, L Wang… - Proceedings of the …, 2018 - dl.acm.org
Text classification to a hierarchical taxonomy of topics is a common and practical problem.
Traditional approaches simply use bag-of-words and have achieved good results. However …

Large-scale multi-label text classification—revisiting neural networks

J Nam, J Kim, E Loza Mencía, I Gurevych… - Machine Learning and …, 2014 - Springer
Neural networks have recently been proposed for multi-label classification because they are
able to capture and model label dependencies in the output layer. In this work, we …

A survey of hierarchical classification across different application domains

CN Silla, AA Freitas - Data mining and knowledge discovery, 2011 - Springer
In this survey we discuss the task of hierarchical classification. The literature about this field
is scattered across very different application domains and for that reason research in one …

[图书][B] Modern information retrieval

R Baeza-Yates, B Ribeiro-Neto - 1999 - people.ischool.berkeley.edu
Information retrieval (IR) has changed considerably in recent years with the expansion of the
World Wide Web and the advent of modern and inexpensive graphical user interfaces and …

[图书][B] Introduction to information retrieval

H Schütze, CD Manning, P Raghavan - 2008 - cs.hacettepe.edu.tr
Introduction to Information Retrieval ` `%%%`#`&12_`__~~~ alse [0.5cm] IIR 19: Web Search
Page 1 Recap Big picture Ads Duplicate detection Spam Web IR Size of the web Introduction …

Wafer map failure pattern recognition and similarity ranking for large-scale data sets

MJ Wu, JSR Jang, JL Chen - IEEE Transactions on …, 2014 - ieeexplore.ieee.org
Wafer maps can exhibit specific failure patterns that provide crucial details for assisting
engineers in identifying the cause of wafer pattern failures. Conventional approaches of …

Hierarchical taxonomy-aware and attentional graph capsule RCNNs for large-scale multi-label text classification

H Peng, J Li, S Wang, L Wang, Q Gong… - … on Knowledge and …, 2019 - ieeexplore.ieee.org
CNNs, RNNs, GCNs, and CapsNets have shown significant insights in representation
learning and are widely used in various text mining tasks such as large-scale multi-label text …

Statistical topic models for multi-label document classification

TN Rubin, A Chambers, P Smyth, M Steyvers - Machine learning, 2012 - Springer
Abstract Machine learning approaches to multi-label document classification have to date
largely relied on discriminative modeling techniques such as support vector machines. A …