MapReduce indexing strategies: Studying scalability and efficiency

R McCreadie, C Macdonald, I Ounis - Information Processing & …, 2012 - Elsevier
In Information Retrieval (IR), the efficient indexing of terabyte-scale and larger corpora is still
a difficult problem. MapReduce has been proposed as a framework for distributing data …

Virtualknotter: Online virtual machine shuffling for congestion resolving in virtualized datacenter

S Zou, X Wen, K Chen, S Huang, Y Chen, Y Liu, Y Xia… - Computer …, 2014 - Elsevier
Our measurements on production datacenter traffic together with recently-reported results
(Kandula et al.)[1] suggest that datacenter networks suffer from long-lived congestion …

[PDF][PDF] Of Ivory and Smurfs: Loxodontan MapReduce Experiments for Web Search.

J Lin, T Elsayed, L Wang, D Metzler - TREC, 2009 - Citeseer
This paper describes Ivory, an attempt to build a distributed retrieval system around the open-
source Hadoop implementation of MapReduce. We focus on three noteworthy aspects of our …

[PDF][PDF] University of Glasgow at TREC 2009: Experiments with Terrier.

R McCreadie, C Macdonald, I Ounis, J Peng… - TREC, 2009 - academia.edu
In TREC 2009, we extend our Voting Model for the faceted blog distillation, top stories
identification, and related entity finding tasks. Moreover, we experiment with our novel …

Scalable entity-based summarization of web search results using MapReduce

I Kitsos, K Magoutis, Y Tzitzikas - Distributed and Parallel Databases, 2014 - Springer
Abstract Although Web Search Engines index and provide access to huge amounts of
documents, user queries typically return only a linear list of hits. While this is often …

Large-scale music tag recommendation with explicit multiple attributes

Z Zhao, X Wang, Q Xiang, AM Sarroff, Z Li… - Proceedings of the 18th …, 2010 - dl.acm.org
Social tagging can provide rich semantic information for large-scale retrieval in music
discovery. Such collaborative intelligence, however, also generates a high degree of tags …

A fast algorithm for constructing inverted files on heterogeneous platforms

Z Wei, J JaJa - Journal of Parallel and Distributed Computing, 2012 - Elsevier
Given a collection of documents residing on a disk, we develop a new strategy for
processing these documents and building the inverted files extremely quickly. Our approach …

Scalable and distributed processing of scientific XML data

E Dede, Z Fadika, C Gupta… - 2011 IEEE/ACM 12th …, 2011 - ieeexplore.ieee.org
A seamless and intuitive search capability for the vast amount of datasets generated by
scientific experiments is critical to ensure effective use of such data by domain specific …

Tissue-type discrimination in magnetic resonance images

DY Amamoto, R Kasturi… - [1990] Proceedings. 10th …, 1990 - ieeexplore.ieee.org
A method developed for classifying each location in a set of magnetic resonance (MR)
images by tissue type is described. Three MR images of a region of interest are acquired …

Distributed indexing: performance analysis of solr, terrier and katta information retrievals

AY Aldailamy, NAWA Hamid… - Malaysian Journal of …, 2018 - ajba.um.edu.my
Abstract Information Retrieval (IR) systems are currently facing a continuous challenge due
to the increasing size of datasets. Extremely, large data from different aspects is gathered …