Skip to content

Latest commit

 

History

History
36 lines (24 loc) · 949 Bytes

Readme.md

File metadata and controls

36 lines (24 loc) · 949 Bytes

Grammar Compression modulo X (GCX)

Introduction

GCX is grammar compressor based on the induced suffix ordering technique introduced in GCIS. Our method builds on the factorization of the DC3 algorithm to create the rules of a straight line grammar for the input string.

GCX supports very fast extraction operation on the encoded grammar without the need to decompress the complete string.

In this version, the size of the *grammar rules varies at each level, this value is calculated based on the average LCP of all suffixes (ordered) in the text.

Compilation

TBD

Compression

To compress a text, execute:

./main -c <plain_file> <compressed_file>

Decompression

To decompress a text, execute:

./main -d <compressed_file> <decompressed_file> 

Extraction

To extract substrings from the compressed text, execute:

./main -e <compressed_file> <output_file> <queries_file>