PDI - Resultado de Búsqueda

Article SCOPUS
PROGRESS IN ARTIFICIAL INTELLIGENCE AND PATTERN RECOGNITION, IWAIPR 2018 (2003)

Abstract

This work presents (s,c)-Dense Code, a new method for compressing natural language texts. This technique is a generalization of a previous compression technique called End-Tagged Dense Code that obtains better compression ratio as well as a simpler and faster encoding than Tagged Huffman. At the same time, (s.c)-Dense Code is a prefix code that maintains the most interesting features of Tagged Huffman Code with respect to direct search on the compressed text. (s,c)-Dense Coding retains all the efficiency and simplicity of Tagged Huffman, and improves its compression ratios. We formally describe the (s, c)-Dense Code and show how to compute the parameters s and c that optimize the compression for a specific corpus. Our empirical results show that (s,c)-Dense Code improves End-Tagged Dense Code and Tagged Huffman Code, and reaches only 0.5% overhead over plain Huffman Code. © Springer-Verlag Berlin Heidelberg 2003.

Más información

Título según SCOPUS:	(S,C) code for natural language text databases
Título de la Revista:	PROGRESS IN ARTIFICIAL INTELLIGENCE AND PATTERN RECOGNITION, IWAIPR 2018
Volumen:	2857
Editorial:	SPRINGER INTERNATIONAL PUBLISHING AG
Fecha de publicación:	2003
Página de inicio:	122
Página final:	136
Idioma:	English
URL:	http://www.scopus.com/inward/record.url?eid=2-s2.0-35248828986&partnerID=q2rCbXpz
Notas:	SCOPUS

(S,C) code for natural language text databases

Abstract

Más información