Last night I wrote a lossy image compression algorithm. It is based on an idea from
this paper. First I trained a set of 256 6x6 color filters on the CIFAR-10 image dataset. To train the filters, I used the k-means algorithm on 400,000 randomly selected image patches. Here are the resulting filters:
data:image/s3,"s3://crabby-images/b5f14/b5f144a2602b8d524846298eeda98793b4960210" alt=""
I then used the filters to compress the following image:
data:image/s3,"s3://crabby-images/76844/76844c07471afae74eaff21d98dd12e8be20b2fb" alt=""
Using the maximum compression level, JPEG compresses the image to 7.1KiB:
data:image/s3,"s3://crabby-images/12ae2/12ae226b35b4ce4ed9187a58229d1375cbf7c26a" alt=""
My compression algorithm compresses the image to 5.6KiB:
data:image/s3,"s3://crabby-images/d26c3/d26c3b94299cb9b79ed0496c28aa215eb6f8e1eb" alt=""
Try clicking on the above images to see the high resolution versions. My compressed version of the image is smaller and looks better than JPEG.
I did the compression by doing a raster scan of the image and for each image patch I select the best filter. I then losslessly compress the filter selections using
paq8l. The obvious improvement to this algorithm would be to use a combination of several filters for each image patch instead of selecting the best filter. Using several filters would take more space to encode but would result in a much better image approximation. Another idea I plan to try is to use this same algorithm for lossy video compression.
No comments:
Post a Comment