MapReduce indexing strategies: Studying scalability and efficiency
In Information Retrieval (IR), the efficient indexing of terabyte-scale and larger corpora is still
a difficult problem. MapReduce has been proposed as a framework for distributing data …
a difficult problem. MapReduce has been proposed as a framework for distributing data …
Virtualknotter: Online virtual machine shuffling for congestion resolving in virtualized datacenter
Our measurements on production datacenter traffic together with recently-reported results
(Kandula et al.)[1] suggest that datacenter networks suffer from long-lived congestion …
(Kandula et al.)[1] suggest that datacenter networks suffer from long-lived congestion …
[PDF][PDF] Of Ivory and Smurfs: Loxodontan MapReduce Experiments for Web Search.
This paper describes Ivory, an attempt to build a distributed retrieval system around the open-
source Hadoop implementation of MapReduce. We focus on three noteworthy aspects of our …
source Hadoop implementation of MapReduce. We focus on three noteworthy aspects of our …
[PDF][PDF] University of Glasgow at TREC 2009: Experiments with Terrier.
In TREC 2009, we extend our Voting Model for the faceted blog distillation, top stories
identification, and related entity finding tasks. Moreover, we experiment with our novel …
identification, and related entity finding tasks. Moreover, we experiment with our novel …
Scalable entity-based summarization of web search results using MapReduce
I Kitsos, K Magoutis, Y Tzitzikas - Distributed and Parallel Databases, 2014 - Springer
Abstract Although Web Search Engines index and provide access to huge amounts of
documents, user queries typically return only a linear list of hits. While this is often …
documents, user queries typically return only a linear list of hits. While this is often …
Large-scale music tag recommendation with explicit multiple attributes
Social tagging can provide rich semantic information for large-scale retrieval in music
discovery. Such collaborative intelligence, however, also generates a high degree of tags …
discovery. Such collaborative intelligence, however, also generates a high degree of tags …
A fast algorithm for constructing inverted files on heterogeneous platforms
Z Wei, J JaJa - Journal of Parallel and Distributed Computing, 2012 - Elsevier
Given a collection of documents residing on a disk, we develop a new strategy for
processing these documents and building the inverted files extremely quickly. Our approach …
processing these documents and building the inverted files extremely quickly. Our approach …
Scalable and distributed processing of scientific XML data
E Dede, Z Fadika, C Gupta… - 2011 IEEE/ACM 12th …, 2011 - ieeexplore.ieee.org
A seamless and intuitive search capability for the vast amount of datasets generated by
scientific experiments is critical to ensure effective use of such data by domain specific …
scientific experiments is critical to ensure effective use of such data by domain specific …
Tissue-type discrimination in magnetic resonance images
DY Amamoto, R Kasturi… - [1990] Proceedings. 10th …, 1990 - ieeexplore.ieee.org
A method developed for classifying each location in a set of magnetic resonance (MR)
images by tissue type is described. Three MR images of a region of interest are acquired …
images by tissue type is described. Three MR images of a region of interest are acquired …
Distributed indexing: performance analysis of solr, terrier and katta information retrievals
AY Aldailamy, NAWA Hamid… - Malaysian Journal of …, 2018 - ajba.um.edu.my
Abstract Information Retrieval (IR) systems are currently facing a continuous challenge due
to the increasing size of datasets. Extremely, large data from different aspects is gathered …
to the increasing size of datasets. Extremely, large data from different aspects is gathered …