[PDF][PDF] From Puppy to Maturity: Experiences in Developing Terrier.
The Terrier information retrieval (IR) platform, maintained by the University of Glasgow, has
been open sourced since 2004. Open source IR platforms are vital to the research …
been open sourced since 2004. Open source IR platforms are vital to the research …
[PDF][PDF] Language identification for creating language-specific twitter collections
Social media services such as Twitter offer an immense volume of real-world linguistic data.
We explore the use of Twitter to obtain authentic user-generated text in low-resource …
We explore the use of Twitter to obtain authentic user-generated text in low-resource …
[PDF][PDF] Using conceptual class attributes to characterize social media users
S Bergsma, B Van Durme - … of the 51st Annual Meeting of the …, 2013 - aclanthology.org
We describe a novel approach for automatically predicting the hidden demographic
properties of social media users. Building on prior work in common-sense knowledge …
properties of social media users. Building on prior work in common-sense knowledge …
[PDF][PDF] Keyword Detection Techniques: A Comprehensive Study.
ZA Shaikh - Engineering, Technology & Applied Science …, 2018 - researchgate.net
Automatic identification of influential segments from a large amount of data is an important
part of topic detection and tracking (TDT). This can be done using keyword identification via …
part of topic detection and tracking (TDT). This can be done using keyword identification via …
[PDF][PDF] Classifying short text in social media: Twitter as case study
With the huge growth of social media, especially with 500 million Twitter messages being
posted per day, analyzing these messages has caught intense interest of researchers …
posted per day, analyzing these messages has caught intense interest of researchers …
Temporal feedback for tweet search with non-parametric density estimation
This paper investigates the temporal cluster hypothesis: in search tasks where time plays an
important role, do relevant documents tend to cluster together in time? We explore this …
important role, do relevant documents tend to cluster together in time? We explore this …
On sparsity and drift for effective real-time filtering in microblogs
In this paper, we approach the problem of real-time filtering in the Twitter Microblogging
platform. We adapt an effective traditional news filtering technique, which uses a text …
platform. We adapt an effective traditional news filtering technique, which uses a text …
Engaging and maintaining a sense of being informed: Understanding the tasks motivating twitter search
D Elsweiler, M Harvey - Journal of the Association for …, 2015 - Wiley Online Library
Micro‐blogging services such as Twitter represent constantly evolving, user‐generated
sources of information. Previous studies show that users search such content regularly but …
sources of information. Previous studies show that users search such content regularly but …
Discovering filter keywords for company name disambiguation in twitter
A major problem in monitoring the online reputation of companies, brands, and other entities
is that entity names are often ambiguous (apple may refer to the company, the fruit, the …
is that entity names are often ambiguous (apple may refer to the company, the fruit, the …
Fast candidate generation for real-time tweet search with bloom filter chains
N Asadi, J Lin - ACM Transactions on Information Systems (TOIS), 2013 - dl.acm.org
The rise of social media and other forms of user-generated content have created the
demand for real-time search: against a high-velocity stream of incoming documents, users …
demand for real-time search: against a high-velocity stream of incoming documents, users …