[图书][B] The text mining handbook: advanced approaches in analyzing unstructured data
Text mining is a new and exciting area of computer science research that tries to solve the
crisis of information overload by combining techniques from data mining, machine learning …
crisis of information overload by combining techniques from data mining, machine learning …
Soft set based association rule mining
Association rules, one of the most useful constructs in data mining, can be exerted to capture
interesting dependencies between variables in large datasets. Herawan and Deris initiated …
interesting dependencies between variables in large datasets. Herawan and Deris initiated …
Text mining: generating hypotheses from MEDLINE
P Srinivasan - Journal of the American Society for Information …, 2004 - Wiley Online Library
Hypothesis generation, a crucial initial step for making scientific discoveries, relies on prior
knowledge, experience, and intuition. Chance connections made between seemingly …
knowledge, experience, and intuition. Chance connections made between seemingly …
A soft set approach for association rules mining
T Herawan, MM Deris - Knowledge-based systems, 2011 - Elsevier
In this paper, we present an alternative approach for mining regular association rules and
maximal association rules from transactional datasets using soft set theory. This approach is …
maximal association rules from transactional datasets using soft set theory. This approach is …
Text mining at the term level
Abstract Knowledge Discovery in Databases (KDD) focuses on the computerized
exploration of large amounts of data and on the discovery of interesting patterns within them …
exploration of large amounts of data and on the discovery of interesting patterns within them …
Mining ontology for automatically acquiring web user information needs
It is not easy to obtain the right information from the Web for a particular Web user or a group
of users due to the obstacle of automatically acquiring Web user profiles. The current …
of users due to the obstacle of automatically acquiring Web user profiles. The current …
Topcat: Data mining for topic identification in a text corpus
TopCat (topic categories) is a technique for identifying topics that recur in articles in a text
corpus. Natural language processing techniques are used to identify key entities in …
corpus. Natural language processing techniques are used to identify key entities in …
Mining the biomedical literature using semantic analysis and natural language processing techniques
R Feldman, Y Regev, E Hurvitz, M Finkelstein-Landau - Biosilico, 2003 - Elsevier
The information age has made the electronic storage of large amounts of data effortless. The
proliferation of documents available on the Internet, corporate intranets, news wires and …
proliferation of documents available on the Internet, corporate intranets, news wires and …
Anchor text mining for translation of Web queries: A transitive translation approach
WH Lu, LF Chien, HJ Lee - ACM Transactions on Information Systems …, 2004 - dl.acm.org
To discover translation knowledge in diverse data resources on the Web, this article
proposes an effective approach to finding translation equivalents of query terms and …
proposes an effective approach to finding translation equivalents of query terms and …
TopCat: Data mining for topic identification in a text corpus
C Clifton, R Cooley - Principles of Data Mining and Knowledge Discovery …, 1999 - Springer
Abstract TopCat (Topic Categories) is a technique for identifying topics that recur in articles
in a text corpus. Natural language processing techniques are used to identify key entities in …
in a text corpus. Natural language processing techniques are used to identify key entities in …