BLU Discuss list archive

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

endless compression

Subject: endless compression
From: sethg at ropine.com (Seth Gordon)
Date: Thu Aug 12 15:59:01 2004
In-reply-to: <17529.24.91.171.78.1092340471.squirrel@mail.mohawksoft.com>
References: <17529.24.91.171.78.1092340471.squirrel@mail.mohawksoft.com>

markw at mohawksoft.com wrote:
> 
> When you look at a ZIP compressed file, it does not compress well using
> standard stream compressors. This is because there are few if any
> repeating paterns. The fact that it has few repeating patterns, makes this
> stream interresting. It means that it is less likely in a universe of
> streams. If it is less likely, then it can be represented with fewer bits.

I don't see why it matters how "likely" the stream is.  Even if the 
stream I want to compress is extremely unlikely, there might be some 
very large number of other streams that are equally unlikely.

If your recompression engine can say "aha, even though this stream does 
not have any 'repeating patterns' that a ZIP compressor can recognize, 
the data in the stream could be reconstituted with the following 
formula...", then the engine could write out some abbreviation that 
formula, and the corresponding uncompression engine could read the 
abbreviation and regenerate the stream.

Of course, you might as well apply that pattern-recognition technology 
to the original document, rather than the ZIPped version, right?

And once you reach the point where the formula to reconstitute the 
stream takes up just as many bits as the original stream, then you lose.

-- 
"Remember that "freedom" is not just a word. It is a squishy,
  concept-lite word which invokes positive feelings." --Fafblog
// seth gordon // sethg at ropine.com // http://dynamic.ropine.com/yo/ //

References:
- endless compression
  - From: markw at mohawksoft.com (markw at mohawksoft.com)

Prev by Date: endless compression
Next by Date: Help building gcc
Previous by thread: endless compression
Next by thread: endless compression
Index(es):
- Date
- Thread

Boston Linux & Unix / webmaster@blu.org