Compression and GFT
Compares a redundant string and a pseudo-random string through empirical entropy.
PT mathematics / GFT
Compression as extracting what persists and rejecting what is entropic.
Compressing a file means removing what repeats or what does not help reconstruct the essential part. In PT language: one looks for what persists.
This is one of the best entry points into GFT: structured information is kept, noise is costly, and the total budget imposes a limit.
Efficient compression increases the usable share of structure relative to a raw representation. It does not create information; it reorganizes the budget.
PT can present compression as a concrete case of the $D_{KL}+H$ partition: structure detectable against uniformity on one side, irreducible entropy on the other.
The pt-compress project can serve as a laboratory: measuring entropy, redundancy, divergence from uniformity, and reconstruction cost.
The canonical mathematical point remains GFT. The performance of a particular compressor is an experimental validation matter.
GitHub repository to publish: Igrekess/pt-compress; monograph ch04_gft, ch_PM.
The links below point to public resources or planned GitHub repositories. No local working path is exposed to the reader.
GitHub repository to publish before this can become a download link.
Compares a redundant string and a pseudo-random string through empirical entropy.