Lossy compression
From Wikipedia, the free encyclopedia
A lossy compression method is one where compressing data and then decompressing it retrieves data that may well be different from the original, but is close enough to be useful in some way. Lossy compression is most commonly used to compress multimedia data (audio, video, still images), especially in applications such as streaming media and internet telephony. By contrast, lossless compression is required for text and data files, such as bank records, text articles, etc.
Lossy compression formats suffer from generation loss: repeatedly compressing and decompressing the file will cause it to progressively lose quality. This is in contrast with lossless data compression.
Information-theoretical foundations for lossy data compression are provided by rate-distortion theory. Much like the use of probability in optimal coding theory, rate-distortion theory heavily draws on Bayesian estimation and decision theory in order to model perceptual distortion and even aesthetic judgment.
Contents |
[edit] Types
There are two basic lossy compression schemes:
- In lossy transform codecs, samples of picture or sound are taken, chopped into small segments, transformed into a new basis space, and quantized. The resulting quantized values are then entropy coded.
- In lossy predictive codecs, previous and/or subsequent decoded data is used to predict the current sound sample or image frame. The error between the predicted data and the real data, together with any extra information needed to reproduce the prediction, is then quantized and coded.
In some systems the two techniques are combined, with transform codecs being used to compress the error signals generated by the predictive stage.
[edit] Lossy versus lossless
The advantage of lossy methods over lossless methods is that in some cases a lossy method can produce a much smaller compressed file than any known lossless method, while still meeting the requirements of the application.
Lossy methods are most often used for compressing sound, images or videos. This is because these types of data are intended for human interpretation where the mind can easily "fill in the blanks" or see past very minor errors or inconsistencies – ideally lossy compression is transparent (imperceptible), which can be verified via an ABX test.
[edit] Transparency
- Further information: Transparency (data compression)
When a user acquires a lossily compressed file, (for example, to reduce download time) the retrieved file can be quite different from the original at the bit level while being indistinguishable to the human ear or eye for most practical purposes. Many compression methods focus on the idiosyncrasies of human physiology, taking into account, for instance, that the human eye can see only certain wavelengths of light. The psychoacoustic model describes how sound can be highly compressed without degrading perceived quality. Flaws caused by lossy compression that are noticeable to the human eye or ear are known as compression artifacts.
[edit] Compression ratio
The compression ratio (that is, the size of the compressed file compared to that of the uncompressed file) of lossy video codecs is nearly always far superior to that of the audio and still-image equivalents.
- Video can be compressed immensely (e.g. 300:1) with little visible quality loss;[citation needed]
- Audio can often be compressed at 10:1 with imperceptible loss of quality;[citation needed]
- Still images are often lossily compressed at 10:1, as with audio, but the quality loss is more noticeable, especially on closer inspection.[citation needed]
[edit] Transcoding and Editing
-
For more details on this topic, see Transcoding.
An important caveat about lossy compression is that converting (formally, transcoding) or editing lossily compressed files causes digital generation loss from the re-encoding. This can be avoided by only producing lossy files from (lossless) originals, and only editing (copies of) original files, such as images in raw image format instead of JPEG.
[edit] Lossless editing
Some lossless editing of lossily compressed files is possible, which consists of modifying the compressed data directly, without decoding and re-encoding.
[edit] JPEG
The primary programs for lossless editing of JPEGs are jpegtran, and the derived exiftran (which also preserves Exif information), and Jpegcrop (which provides a Windows interface).
These allow one to
- crop,
- rotate, flip, and flop, or
- convert to grayscale (by dropping the chrominance channel).
JPEGjoin allows one to join different JPEG images (which have the same encoding), without re-encoding. (See also: New jpegtran features.)
One can also make some changes to the compression without re-encoding:
- optimize the compression (so it takes less space),
- convert between progressive and non-progressive encoding,
There is also the freeware Windows-only IrfanView, which has some lossless JPEG operations in its JPG_TRANSFORM plugin.
[edit] MP3
- Splitting and joining
- Mp3splt and Mp3wrap (or AlbumWrap) allow one to split an MP3 file into pieces or join them loselessly. These are analogous to
splitandcat.[1] - Gain
- Various Replay Gain programs such as MP3gain allow one to modify the gain (overall volume) of MP3 files losslessly.
[edit] Metadata
One can generally modify or remove metadata, such as ID3 tags, Vorbis comments, or Exif information, without modifying the underlying media.
[edit] Downsampling / compressed representation scalability
One may wish to downsample or otherwise decrease the resolution of the represented source signal and the quantity of data used for its compressed representation without re-encoding, as in bitrate peeling, but this functionality is not supported in all designs, as not all codecs encode data in a form that allows less important detail to simply be dropped.
Some well known designs that have this capability include JPEG 2000 for still images and H.264/MPEG-4 AVC based Scalable Video Coding for video. Actually such schemes have also been standardized for older designs as well, such as JPEG images with progressive encoding, and MPEG-2 and MPEG-4 Part 2 video, although those prior schemes had limited success in terms of adoption into real-world common usage.
Without this capacity, which is often the case in practice, to produce a representation with lower resolution or lower fidelity than a given one, one needs to start with the original source signal and encode, or start with a compressed representation and then decompress and re-encode it (transcoding), thought this latter tends to cause digital generation loss.
On a related point, some audio formats feature a combination of a lossy format and a lossless correction; this allows stripping the correction to easily obtain a lossy file, though whether the lossy portion itself can be further stripped is a separate question. Such formats include MPEG-4 SLS (Scalable to Lossless), WavPack, and OptimFROG DualStream.
[edit] Methods
[edit] Graphics
[edit] Image
- Cartesian Perceptual Compression: Also known as CPC
- DjVu
- Fractal compression
- HAM, hardware compression of color information used in Amiga computers
- ICER, used by the Mars Rovers: related to JPEG 2000 in its use of wavelets
- JPEG
- JPEG 2000, JPEG's successor format that uses wavelets.
- JBIG2
- PGF, Progressive Graphics File (lossless or lossy compression)
- Wavelet compression
- S3TC texture compression for 3D computer graphics hardware
[edit] Video
- H.261
- H.263
- H.264
- MNG (supports JPEG sprites)
- Motion JPEG
- MPEG-1 Part 2
- MPEG-2 Part 2
- MPEG-4 Part 2 and Part 10 (AVC)
- Ogg Theora (noted for its lack of patent restrictions)
- Sorenson video codec
- VC-1
[edit] Audio
[edit] Music
- AAC
- ADPCM
- ATRAC
- Dolby AC-3
- MP2
- MP3
- Musepack
- Ogg Vorbis (noted for its lack of patent restrictions)
- WMA
[edit] Speech
- CELP
- G.711
- G.726
- Harmonic and Individual Lines and Noise (HILN)
- AMR (used by GSM cell carriers, such as T-Mobile)
- Speex (noted for its lack of patent restrictions)
[edit] Other data
Researchers have (semi-seriously) performed lossy compression on text by either using a thesaurus to substitute short words for long ones, or generative text techniques [2], although these sometimes fall into the related category of lossy data conversion.
[edit] See also
[edit] Notes
- ^ Though the wrap programs do more, encoding the divisions between the original files.
- ^ I. H. WITTEN, et al.. "Semantic and Generative Models for Lossy Text Compression" (PDF). The Computer Journal. Retrieved on 2007-10-13.
[edit] External links
- Lossy audio formats, comparing the speed and compression strength of five lossy audio formats.
- lossy PNG image compression (research)
- using lossy GIF/PNG compression for the web (article)





