tensorflow
diff --git a/‎README.md
Lines changed: 11 additions & 3 deletions b/‎README.md
Lines changed: 11 additions & 3 deletions
diff --git a/‎results/image_compression/README.md
Lines changed: 174 additions & 0 deletions b/‎results/image_compression/README.md
Lines changed: 174 additions & 0 deletions
diff --git a/‎results/image_compression/kodak/MS-SSIM_sRGB_RGB/balle-2017-iclr.txt
Lines changed: 21 additions & 0 deletions b/‎results/image_compression/kodak/MS-SSIM_sRGB_RGB/balle-2017-iclr.txt
Lines changed: 21 additions & 0 deletions
diff --git a/‎results/image_compression/kodak/MS-SSIM_sRGB_RGB/balle-2018-iclr.txt
Lines changed: 21 additions & 0 deletions b/‎results/image_compression/kodak/MS-SSIM_sRGB_RGB/balle-2018-iclr.txt
Lines changed: 21 additions & 0 deletions
diff --git a/‎results/image_compression/kodak/MS-SSIM_sRGB_RGB/bpg420.txt
Lines changed: 64 additions & 0 deletions b/‎results/image_compression/kodak/MS-SSIM_sRGB_RGB/bpg420.txt
Lines changed: 64 additions & 0 deletions
diff --git a/‎results/image_compression/kodak/MS-SSIM_sRGB_RGB/bpg444.txt
Lines changed: 64 additions & 0 deletions b/‎results/image_compression/kodak/MS-SSIM_sRGB_RGB/bpg444.txt
Lines changed: 64 additions & 0 deletions
@@ -243,10 +243,18 @@ pip uninstall tensorflow-compression
 To build packages for Darwin (and potentially other platforms), you can follow
 the same steps, but the Docker image should not be necessary.
 
+## Evaluation
+
+We provide evaluation results for several image compression methods in terms of
+different metrics in different colorspaces. Please see the
+[results subdirectory](https://tensorflow.github.io/compression/results/readme/image_compression/README.md)
+for more information.
+
 ## Authors
 
-Johannes Ballé (github: [jonycgn](https://github.com/jonycgn)), Sung Jin Hwang
-(github: [ssjhv](https://github.com/ssjhv)), and Nick Johnston (github:
-[nmjohn](https://github.com/nmjohn))
+* Johannes Ballé (github: [jonycgn](https://github.com/jonycgn))
+* Sung Jin Hwang (github: [ssjhv](https://github.com/ssjhv))
+* Nick Johnston (github: [nmjohn](https://github.com/nmjohn))
+* David Minnen (github: [minnend](https://github.com/minnend))
 
 Note that this is not an officially supported Google product.
@@ -0,0 +1,174 @@
+# Rate-distortion data for image compression
+
+Subdirectories contain CSV files with rate-distortion (RD) data for different
+image compression methods. We include data for standard codecs (JPG, J2K, WebP,
+etc.) and many learning-based methods. Quality is measured by PSNR and MS-SSIM.
+
+Note that not all combinations of compression methods, quality metrics, and
+evaluation data sets are covered.
+
+### Table of Contents
+
+* [Image Compression Methods](#image_compression_methods)
+* [Quality Metrics](#quality_metrics)
+* [Data Sets for Evaluation](#data_sets_for_evaluation)
+
+## Image Compression Methods
+
+--------------------------------------------------------------------------------
+
+### Standard (Hand-Engineered) Codecs
+
+*   JPEG (4:2:0)
+*   JPEG 2000 ([OpenJPEG](https://www.openjpeg.org) and
+               [Kakadu](https://kakadusoftware.com/))
+*   [WebP](https://developers.google.com/speed/webp)
+*   [BPG](https://bellard.org/bpg/) (4:4:4 and 4:2:0)
+
+### Learning-based Methods
+
+1.   [Context-adaptive Entropy Model for End-to-end Optimized Image Compression]
+    (https://openreview.net/forum?id=HyxKIiAqYQ) \
+    Jooyoung Lee, Seunghyun Cho, and Seung-Kwon Beack \
+    Int. Conf. on Learning Representations (ICLR) 2019
+
+2.  [Joint autoregressive and hierarchical priors for learned image
+    compression]
+    (https://arxiv.org/abs/1809.02736) \
+    David Minnen, Johannes Ballé, and George Toderici \
+    Advances in Neural Information Processing Systems (NeurIPS) 2018
+
+3.  [Learning a Code-Space Predictor by Exploiting Intra-Image-Dependencies]
+    (http://bmvc2018.org/contents/papers/0491.pdf) \
+    Jan P. Klopp, Yu-Chiang Frank Wang, Shao-Yi Chien, and Liang-Gee Chen \
+    British Machine Vision Conference (BMVC) 2018
+
+4.  [Variational Image Compression with a Scale Hyperprior]
+    (https://arxiv.org/abs/1802.01436) \
+    Johannes Ballé, David Minnen, Saurabh Singh, Sung Jin Hwang, and Nick
+    Johnston \
+    Int. Conf. on Learning Representations (ICLR) 2018
+
+5.  [Image-dependent local entropy models for image compression with deep
+    networks]
+    (https://arxiv.org/abs/1805.12295) \
+    David Minnen, George Toderici, Saurabh Singh, Sung Jin Hwang, and Michele
+    Covell \
+    Int. Conf. on Image Processing (ICIP) 2018
+
+6.  [Improved Lossy Image Compression With Priming and Spatially Adaptive Bit
+    Rates for Recurrent Networks]
+    (https://arxiv.org/abs/1703.10114) \
+    Nick Johnston, Damien Vincent, David Minnen, Michele Covell, Saurabh Singh,
+    Troy Chinen, Sung Jin Hwang, Joel Shor, and George Toderici \
+    IEEE Conf. on Computer Vision and Pattern Recognition (CVPR) 2018
+
+7.  [Real-Time Adaptive Image Compression]
+    (https://arxiv.org/abs/1705.05823) \
+    Oren Rippel and Lubomir Bourdev \
+    International Conference on Machine Learning (ICML) 2017
+
+8.  [End-to-end Optimized Image Compression]
+    (https://arxiv.org/abs/1611.01704) \
+    Johannes Ballé, Valero Laparra, and Eero P. Simoncelli \
+    Int. Conf. on Learning Representations (ICLR) 2017
+
+9.  [Lossy Image Compression with Compressive Autoencoders]
+    (https://openreview.net/forum?id=rJiNwv9gg) \
+    Lucas Theis, Wenzhe Shi, Andrew Cunningham, and Ferenc Huszár \
+    Int. Conf. on Learning Representations (ICLR) 2017
+
+10. [Spatially adaptive image compression using a tiled deep network]
+    (https://arxiv.org/abs/1802.02629) \
+    David Minnen, George Toderici, Michele Covell, Troy Chinen, Nick Johnston,
+    Joel Shor, Sung Jin Hwang, Damien Vincent, and Saurabh Singh \
+    Int. Conference on Image Processing (ICIP) 2017
+
+11. [Full Resolution Image Compression with Recurrent Neural Networks]
+    (https://arxiv.org/abs/1608.05148) \
+    George Toderici, Damien Vincent, Nick Johnston, Sung Jin Hwang, David
+    Minnen, Joel Shor, and Michele Covell \
+    IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2017
+
+## Quality Metrics
+
+--------------------------------------------------------------------------------
+
+### Peak Signal-to-Noise Ratio (PSNR)
+
+According to
+[wikipedia](https://en.wikipedia.org/wiki/Peak_signal-to-noise_ratio):
+
+> Peak signal-to-noise ratio, often abbreviated PSNR, is an engineering term for
+> the ratio between the maximum possible power of a signal and the power of
+> corrupting noise that affects the fidelity of its representation. Because many
+> signals have a very wide dynamic range, PSNR is usually expressed in terms of
+> the logarithmic decibel scale.
+
+PSNR is commonly used to measure image quality even though its correlation with
+human preferences is rather low (see the [TID 2013
+study](http://www.ponomarenko.info/tid2013.htm)). You can calculate the PSNR
+between two images using
+[tf.image.psnr()](https://www.tensorflow.org/api_docs/python/tf/image/psnr).
+
+### Multiscale Structural Similarity (MS-SSIM)
+
+Multiscale Structural Similarity (MS-SSIM) is an extension of [structural
+similarity (SSIM)](https://en.wikipedia.org/wiki/Structural_similarity) that
+adds flexibility by measuring similarity at different spatial scales. It was
+developed in 2003 by Wang, Simoncelli, and Bovik
+([PDF](https://www.cns.nyu.edu/pub/eero/wang03b.pdf)). MS-SSIM is typically
+thought to better match human preferences than PSNR although optimizing directly
+for MS-SSIM can lead to objectionable distortion, e.g. blurrier reconstructions
+around text and faces.
+
+You can calculate the MS-SSIM score between two images using
+[tf.image.ssim_multiscale()](
+https://www.tensorflow.org/api_docs/python/tf/image/ssim_multiscale). Note that
+both SSIM and MS-SSIM have a maximum score of 1.0, and very small quantitative
+differences can imply very large visual differences. For this reason, we often
+graph MS-SSIM as decibels to improve readability using: `ms_ssim_db = -10 *
+log10(1 - ms_ssim)`.
+
+### Colorspaces
+
+Many research papers on learned image compression report image quality results
+(distortion) averaged over the RGB channels. While mathematically valid, this
+approach does not match the sensitivity of the human visual system (e.g. we're
+more sensitive to green than blue) and is **not** in line with common practice
+in the image processing community.
+
+We provide RGB evaluation results to facilitate comparing against older papers,
+but we **strongly recommend** that future papers report results only the
+luminance channel (`Y'` in `Y'CbCr`) or by using a 6:1:1 weighted average over
+`YCbCr`.
+
+## Data Sets for Evaluation
+
+--------------------------------------------------------------------------------
+
+### Kodak
+
+The Kodak data set is a collection of 24 images with resolution 768x512 (or
+512x768). The images are available as PNG files here:
+[http://r0k.us/graphics/kodak](http://r0k.us/graphics/kodak)
+
+    @misc{kodak,
+      title="Kodak Lossless True Color Image Suite ({PhotoCD PCD0992})",
+      author="Eastman Kodak",
+      url = {http://r0k.us/graphics/kodak},
+    }
+
+### Tecnick
+
+The Tecnick data set contains 100 1200x1200 images. It is available for download
+here (511 MB):
+[https://sourceforge.net/projects/testimages/files/OLD/OLD_SAMPLING/testimages.zip](https://sourceforge.net/projects/testimages/files/OLD/OLD_SAMPLING/testimages.zip)
+
+    @inproceedings{tecnick,
+      author = {N. Asuni and A. Giachetti},
+      title = {{TESTIMAGES}: A large-scale archive for testing visual devices and basic image processing algorithms {(SAMPLING 1200 RGB set)}},
+      year = {2014},
+      booktitle = {{STAG}: Smart Tools and Apps for Graphics}
+      url = {https://sourceforge.net/projects/testimages/files/OLD/OLD_SAMPLING/testimages.zip},
+    }
@@ -0,0 +1,21 @@
+# Aggregate rate-distortion data for "Ballé 2017 (ICLR)" on kodak.
+# The first column contains bits per pixel (bpp) values.
+# The second column contains MS-SSIM/sRGB/R'G'B' values.
+#
+# Notes:
+#  1. Aggregate values were calculated by averaging over a constant
+#     lambda value.
+#  2. We often graph MS-SSIM values in dB for visual clarity using:
+#     ms_ssim_db = -10 * log10(1 - ms_ssim).
+#
+# If you have questions or corrections, please contact:
+#  David Minnen ([email protected]) or George Toderici ([email protected]).
+
+0.119752, 0.903700
+0.194591, 0.931041
+0.316000, 0.954783
+0.481060, 0.969139
+0.721303, 0.980815
+1.060841, 0.986755
+1.458681, 0.992090
+1.957564, 0.994965
@@ -0,0 +1,21 @@
+# Aggregate rate-distortion data for "Ballé 2018 (ICLR)" on kodak.
+# The first column contains bits per pixel (bpp) values.
+# The second column contains MS-SSIM/sRGB/R'G'B' values.
+#
+# Notes:
+#  1. Aggregate values were calculated by averaging over a constant
+#     lambda value.
+#  2. We often graph MS-SSIM values in dB for visual clarity using:
+#     ms_ssim_db = -10 * log10(1 - ms_ssim).
+#
+# If you have questions or corrections, please contact:
+#  David Minnen ([email protected]) or George Toderici ([email protected]).
+
+0.115239, 0.907527
+0.185698, 0.936307
+0.301804, 0.958691
+0.468972, 0.972416
+0.686378, 0.982478
+0.966864, 0.988344
+1.307441, 0.992647
+1.727503, 0.995267
@@ -0,0 +1,64 @@
+# Aggregate rate-distortion data for "BPG (4:2:0)" on kodak.
+# The first column contains bits per pixel (bpp) values.
+# The second column contains MS-SSIM/sRGB/R'G'B' values.
+#
+# Notes:
+#  1. Aggregate values were calculated by averaging over a constant QP value.
+#  2. We often graph MS-SSIM values in dB for visual clarity using:
+#     ms_ssim_db = -10 * log10(1 - ms_ssim).
+#
+# If you have questions or corrections, please contact:
+#  David Minnen ([email protected]) or George Toderici ([email protected]).
+
+0.023778, 0.784989
+0.028261, 0.800720
+0.034131, 0.816491
+0.041103, 0.832549
+0.049042, 0.846330
+0.058963, 0.860314
+0.070547, 0.872750
+0.083888, 0.885062
+0.100428, 0.896837
+0.117291, 0.906517
+0.138370, 0.916372
+0.161282, 0.924601
+0.189494, 0.933334
+0.219051, 0.939601
+0.256016, 0.947138
+0.294495, 0.952187
+0.338370, 0.957859
+0.385684, 0.961779
+0.440687, 0.966350
+0.500803, 0.970688
+0.569223, 0.974397
+0.642459, 0.977489
+0.711308, 0.979273
+0.795128, 0.981873
+0.884535, 0.983938
+0.977693, 0.985811
+1.083310, 0.987469
+1.193119, 0.989031
+1.302428, 0.990242
+1.425981, 0.991342
+1.558065, 0.992281
+1.699156, 0.993105
+1.862072, 0.993914
+2.036448, 0.994620
+2.211966, 0.995205
+2.412092, 0.995756
+2.613790, 0.996303
+2.838744, 0.996711
+3.068833, 0.997057
+3.303337, 0.997373
+3.539684, 0.997648
+3.787450, 0.997908
+4.045572, 0.998128
+4.317412, 0.998322
+4.674708, 0.998600
+4.988948, 0.998819
+5.355372, 0.999060
+5.604246, 0.999147
+5.786789, 0.999190
+5.974362, 0.999228
+6.206643, 0.999251
+6.442166, 0.999261
@@ -0,0 +1,64 @@
+# Aggregate rate-distortion data for "BPG (4:4:4)" on kodak.
+# The first column contains bits per pixel (bpp) values.
+# The second column contains MS-SSIM/sRGB/R'G'B' values.
+#
+# Notes:
+#  1. Aggregate values were calculated by averaging over a constant QP value.
+#  2. We often graph MS-SSIM values in dB for visual clarity using:
+#     ms_ssim_db = -10 * log10(1 - ms_ssim).
+#
+# If you have questions or corrections, please contact:
+#  David Minnen ([email protected]) or George Toderici ([email protected]).
+
+0.023857, 0.783939
+0.028540, 0.800621
+0.034282, 0.815880
+0.041183, 0.830978
+0.049301, 0.845919
+0.058861, 0.859465
+0.070444, 0.871920
+0.084175, 0.884832
+0.100255, 0.896600
+0.119211, 0.908020
+0.140076, 0.917528
+0.165529, 0.926932
+0.193939, 0.935368
+0.226882, 0.943184
+0.264902, 0.950550
+0.308004, 0.956814
+0.353821, 0.962287
+0.406663, 0.967124
+0.465174, 0.971309
+0.528513, 0.975060
+0.602615, 0.978421
+0.681227, 0.981216
+0.763150, 0.983577
+0.855122, 0.985761
+0.954195, 0.987516
+1.058729, 0.989023
+1.178031, 0.990455
+1.302895, 0.991701
+1.427853, 0.992658
+1.570484, 0.993543
+1.724310, 0.994321
+1.890798, 0.994994
+2.085680, 0.995657
+2.296926, 0.996264
+2.513738, 0.996777
+2.762745, 0.997260
+3.021654, 0.997676
+3.311613, 0.998029
+3.616902, 0.998321
+3.943210, 0.998584
+4.282488, 0.998796
+4.657569, 0.998987
+5.069882, 0.999142
+5.543335, 0.999276
+6.176737, 0.999429
+6.804908, 0.999547
+7.545595, 0.999672
+8.045927, 0.999713
+8.453601, 0.999736
+8.895050, 0.999755
+9.349519, 0.999765
+9.810592, 0.999770