Personality, gender, and age in the language of social media: The open-vocabulary approach

HA Schwartz, JC Eichstaedt, ML Kern, L Dziurzynski… - PloS one, 2013 - journals.plos.org
We analyzed 700 million words, phrases, and topic instances collected from the Facebook
messages of 75,000 volunteers, who also took standard personality tests, and found striking …

[PDF][PDF] A survey of text similarity approaches

WH Gomaa, AA Fahmy - international journal of Computer Applications, 2013 - Citeseer
Measuring the similarity between words, sentences, paragraphs and documents is an
important component in various tasks such as information retrieval, document clustering …

Gaining insights from social media language: Methodologies and challenges.

ML Kern, G Park, JC Eichstaedt, HA Schwartz… - Psychological …, 2016 - psycnet.apa.org
Abstract Language data available through social media provide opportunities to study
people at an unprecedented scale. However, little guidance is available to psychologists …

[图书][B] Handbook of natural language processing

N Indurkhya, FJ Damerau - 2010 - taylorfrancis.com
The Handbook of Natural Language Processing, Second Edition presents practical tools
and techniques for implementing natural language processing in computer systems. Along …

[PDF][PDF] Corpora and collocations

S Evert - Corpus linguistics. An international handbook, 2008 - stephanie-evert.de
The concept of collocations is certainly one of the most controversial notions in linguis-3 tics,
even though it is based on a compelling, widely-shared intuition that certain words 4 have a …

[PDF][PDF] The statistics of word cooccurrences: word pairs and collocations

S Evert - 2005 - academia.edu
2.1 List of special situations for the comparison of different coefficients of association
strength. The symbol ǫ in Equations B and E indicates a first-order approximation for ǫ→ …

[PDF][PDF] Multiword expressions.

T Baldwin, SN Kim - Handbook of natural language processing, 2010 - eltimster.github.io
• San Francisco, ad hoc, by and large, Where Eagles Dare, kick the bucket, part of speech, in
step, the Oakland Raiders, trip the light fantastic, telephone box, call (someone) up, take a …

Dirt@ sbt@ discovery of inference rules from text

D Lin, P Pantel - Proceedings of the seventh ACM SIGKDD international …, 2001 - dl.acm.org
In this paper, we propose an unsupervised method for discovering inference rules from text,
such as" X is author of Y≈ X wrote Y"," X solved Y≈ X found a solution to Y", and" X caused …

Discovery of inference rules for question-answering

D Lin, P Pantel - Natural Language Engineering, 2001 - cambridge.org
One of the main challenges in question-answering is the potential mismatch between the
expressions in questions and the expressions in texts. While humans appear to use …

Semi-supervised learning for natural language

P Liang - 2005 - dspace.mit.edu
Statistical supervised learning techniques have been successful for many natural language
processing tasks, but they require labeled datasets, which can be expensive to obtain. On …