[PDF][PDF] 基于机器学习的文本分类技术研究进展
苏金树, 张博锋, 徐昕[1 - 软件学报, 2006 - Citeseer
文本自动分类是信息检索与数据挖掘领域的研究热点与核心技术, 近年来得到了广泛的关注和
快速的发展. 提出了基于机器学习的文本分类技术所面临的互联网内容信息处理等复杂应用的 …
快速的发展. 提出了基于机器学习的文本分类技术所面临的互联网内容信息处理等复杂应用的 …
Web page classification: Features and algorithms
X Qi, BD Davison - ACM computing surveys (CSUR), 2009 - dl.acm.org
Classification of Web page content is essential to many tasks in Web information retrieval
such as maintaining Web directories and focused crawling. The uncontrolled nature of Web …
such as maintaining Web directories and focused crawling. The uncontrolled nature of Web …
[图书][B] Introduction to information retrieval
CD Manning - 2008 - diglib.globalcollege.edu.et
Introduction to Information Retrieval is the first textbook with a coherent treatment of classical
and web information retrieval, including web search and the related areas of text …
and web information retrieval, including web search and the related areas of text …
[图书][B] The text mining handbook: advanced approaches in analyzing unstructured data
Text mining is a new and exciting area of computer science research that tries to solve the
crisis of information overload by combining techniques from data mining, machine learning …
crisis of information overload by combining techniques from data mining, machine learning …
[图书][B] Introduction to information retrieval
Introduction to Information Retrieval ` `%%%`#`&12_`__~~~ alse [0.5cm] IIR 19: Web Search
Page 1 Recap Big picture Ads Duplicate detection Spam Web IR Size of the web Introduction …
Page 1 Recap Big picture Ads Duplicate detection Spam Web IR Size of the web Introduction …
Flat refractive geometry
While the study of geometry has mainly concentrated on single viewpoint (SVP) cameras,
there is growing attention to more general non-SVP systems. Here, we study an important …
there is growing attention to more general non-SVP systems. Here, we study an important …
Fast webpage classification using URL features
We demonstrate the usefulness of the uniform resource locator (URL) alone in performing
web page classification. This approach is faster than typical web page classification, as the …
web page classification. This approach is faster than typical web page classification, as the …
Towards the self-annotating web
The success of the Semantic Web depends on the availability of ontologies as well as on the
proliferation of web pages annotated with metadata conforming to these ontologies. Thus, a …
proliferation of web pages annotated with metadata conforming to these ontologies. Thus, a …
Web classification using support vector machine
In web classification, web pages from one or more web sites are assigned to pre-defined
categories according to their content. Since web pages are more than just plain text …
categories according to their content. Since web pages are more than just plain text …
Web-page classification through summarization
Web-page classification is much more difficult than pure-text classification due to a large
variety of noisy information embedded in Web pages. In this paper, we propose a new Web …
variety of noisy information embedded in Web pages. In this paper, we propose a new Web …