Adaptive weighing of context models for lossless data compression

MV Mahoney - 2005 - repository.fit.edu
Until recently the state of the art in lossless data compression was prediction by partial
match (PPM). A PPM model estimates the next-symbol probability distribution by combining …

[PDF][PDF] Fast and efficient log file compression

P Skibiński, J Swacha - Proc. 11th East-Eur. Conf. Adv. Databases Inf …, 2007 - ceur-ws.org
Contemporary information systems are replete with log files, created in multiple places (eg,
network servers, database management systems, user monitoring applications, system …

[PDF][PDF] Fast text compression using multiple static dictionaries

A Carus, A Mesut - Information Technology Journal, 2010 - academia.edu
We developed a fast text compression method based on multiple static dictionaries and
named this algorithm as STECA (Static Text Compression Algorithm). This algorithm is …

Obscuring information in messages using compression with site-specific prebuilt dictionary

RC Henderson, JR Hind, BY Langner, Y Li - US Patent 8,453,040, 2013 - Google Patents
Obscuring information in messages to be exchanged over a communications network. In
one aspect, the information comprises path name information and parameters for use in a …

Effective asymmetric XML compression

P Skibiński, S Grabowski… - Software: Practice and …, 2008 - Wiley Online Library
The innate verbosity of the extensible markup language (XML) remains one of its main
weaknesses, especially when large documents are concerned. This problem can be solved …

Boosting text compression with word-based statistical encoding

A Farina, G Navarro, JR Paramá - The Computer Journal, 2012 - ieeexplore.ieee.org
Semistatic word-based byte-oriented compressors are known to be attractive alternatives to
compress natural language texts. With compression ratios around 30–35%, they allow fast …

[图书][B] Statistical data reduction for streaming data

K Wu, D Lee, A Sim, J Choi - 2017 - ieeexplore.ieee.org
Bulk of the streaming data from scientific simulations and experiments consists of numerical
values, and these values often change in unpredictable ways over a short time horizon …

Combining efficient XML compression with query processing

P Skibiński, J Swacha - Advances in Databases and Information Systems …, 2007 - Springer
This paper describes a new XML compression scheme that offers both high compression
ratios and short query response time. Its core is a fully reversible transform featuring …

Grayscale true two-dimensional dictionary-based image compression

NJ Brittain, MR El-Sakka - Journal of Visual Communication and Image …, 2007 - Elsevier
Dictionary-based encoding methods are popular forms of data compression. These methods
were initially implemented to reduce the one-dimensional correlation in data, since they are …

A feature-free and parameter-light multi-task clustering framework

TN Huy, H Shao, B Tong, E Suzuki - Knowledge and information systems, 2013 - Springer
The two last decades have witnessed extensive research on multi-task learning algorithms
in diverse domains such as bioinformatics, text mining, natural language processing as well …