Word sense disambiguation: A survey
R Navigli - ACM computing surveys (CSUR), 2009 - dl.acm.org
Word sense disambiguation (WSD) is the ability to identify the meaning of words in context
in a computational manner. WSD is considered an AI-complete problem, that is, a task …
in a computational manner. WSD is considered an AI-complete problem, that is, a task …
[HTML][HTML] Natural language processing methods and systems for biomedical ontology learning
K Liu, WR Hogan, RS Crowley - Journal of biomedical informatics, 2011 - Elsevier
While the biomedical informatics community widely acknowledges the utility of domain
ontologies, there remain many barriers to their effective use. One important requirement of …
ontologies, there remain many barriers to their effective use. One important requirement of …
[HTML][HTML] Inducing domain-specific sentiment lexicons from unlabeled corpora
A word's sentiment depends on the domain in which it is used. Computational social science
research thus requires sentiment lexicons that are specific to the domains being studied. We …
research thus requires sentiment lexicons that are specific to the domains being studied. We …
Distributional memory: A general framework for corpus-based semantics
Research into corpus-based semantics has focused on the development of ad hoc models
that treat single tasks, or sets of closely related tasks, as unrelated challenges to be tackled …
that treat single tasks, or sets of closely related tasks, as unrelated challenges to be tackled …
[PDF][PDF] Multiword expressions.
• San Francisco, ad hoc, by and large, Where Eagles Dare, kick the bucket, part of speech, in
step, the Oakland Raiders, trip the light fantastic, telephone box, call (someone) up, take a …
step, the Oakland Raiders, trip the light fantastic, telephone box, call (someone) up, take a …
Making sense of word embeddings
We present a simple yet effective approach for learning word sense embeddings. In contrast
to existing techniques, which either directly learn sense representations from corpora or rely …
to existing techniques, which either directly learn sense representations from corpora or rely …
Graph-based term weighting for information retrieval
A standard approach to Information Retrieval (IR) is to model text as a bag of words.
Alternatively, text can be modelled as a graph, whose vertices represent words, and whose …
Alternatively, text can be modelled as a graph, whose vertices represent words, and whose …
Name disambiguation in author citations using a k-way spectral clustering method
An author may have multiple names and multiple authors may share the same name simply
due to name abbreviations, identical names, or name misspellings in publications or …
due to name abbreviations, identical names, or name misspellings in publications or …
[图书][B] Semisupervised learning for computational linguistics
S Abney - 2007 - taylorfrancis.com
The rapid advancement in the theoretical understanding of statistical and machine learning
methods for semisupervised learning has made it difficult for nonspecialists to keep up to …
methods for semisupervised learning has made it difficult for nonspecialists to keep up to …
[PDF][PDF] Authorship attribution of micro-messages
Work on authorship attribution has traditionally focused on long texts. In this work, we tackle
the question of whether the author of a very short text can be successfully identified. We use …
the question of whether the author of a very short text can be successfully identified. We use …