[图书][B] The text mining handbook: advanced approaches in analyzing unstructured data

R Feldman, J Sanger - 2007 - books.google.com
Text mining is a new and exciting area of computer science research that tries to solve the
crisis of information overload by combining techniques from data mining, machine learning …

Soft set based association rule mining

F Feng, J Cho, W Pedrycz, H Fujita… - Knowledge-Based Systems, 2016 - Elsevier
Association rules, one of the most useful constructs in data mining, can be exerted to capture
interesting dependencies between variables in large datasets. Herawan and Deris initiated …

Text mining: generating hypotheses from MEDLINE

P Srinivasan - Journal of the American Society for Information …, 2004 - Wiley Online Library
Hypothesis generation, a crucial initial step for making scientific discoveries, relies on prior
knowledge, experience, and intuition. Chance connections made between seemingly …

A soft set approach for association rules mining

T Herawan, MM Deris - Knowledge-based systems, 2011 - Elsevier
In this paper, we present an alternative approach for mining regular association rules and
maximal association rules from transactional datasets using soft set theory. This approach is …

Text mining at the term level

R Feldman, M Fresko, Y Kinar, Y Lindell… - Principles of Data …, 1998 - Springer
Abstract Knowledge Discovery in Databases (KDD) focuses on the computerized
exploration of large amounts of data and on the discovery of interesting patterns within them …

Mining ontology for automatically acquiring web user information needs

Y Li, N Zhong - IEEE transactions on Knowledge and Data …, 2006 - ieeexplore.ieee.org
It is not easy to obtain the right information from the Web for a particular Web user or a group
of users due to the obstacle of automatically acquiring Web user profiles. The current …

Topcat: Data mining for topic identification in a text corpus

C Clifton, R Cooley, J Rennie - IEEE transactions on …, 2004 - ieeexplore.ieee.org
TopCat (topic categories) is a technique for identifying topics that recur in articles in a text
corpus. Natural language processing techniques are used to identify key entities in …

Mining the biomedical literature using semantic analysis and natural language processing techniques

R Feldman, Y Regev, E Hurvitz, M Finkelstein-Landau - Biosilico, 2003 - Elsevier
The information age has made the electronic storage of large amounts of data effortless. The
proliferation of documents available on the Internet, corporate intranets, news wires and …

Anchor text mining for translation of Web queries: A transitive translation approach

WH Lu, LF Chien, HJ Lee - ACM Transactions on Information Systems …, 2004 - dl.acm.org
To discover translation knowledge in diverse data resources on the Web, this article
proposes an effective approach to finding translation equivalents of query terms and …

TopCat: Data mining for topic identification in a text corpus

C Clifton, R Cooley - Principles of Data Mining and Knowledge Discovery …, 1999 - Springer
Abstract TopCat (Topic Categories) is a technique for identifying topics that recur in articles
in a text corpus. Natural language processing techniques are used to identify key entities in …