| Platform and Nature of Decompression Test (see notes below for explanations) | Digital Cinema (24fps) 2048x857, 240 frames, W9x7, Cblk={32,32}, 10.176 Mbits/frame =5.8 bits/pixel (full XYZ) =244 Mbits/s @ 24fps | Digital Cinema (48 fps) 2048x857, 240 frames, W9x7, Cblk={32,32}, 15.088 Mbits/frame =2.9 bits/pixel (full XYZ) =244 Mbits/s @ 48fps | Big Image (lossless) 13,333x13,333 (full RGB) RPCL, W5x3, Clayers=8, Clevels=9, Cblk={64,64}, Cprecincts={256,256} | Big Image Tiled (2bpp) 13,333x13,333 (full RGB) 1024x1024 tiles, RPCL, W9x7, Clayers=5, Clevels=6, Cblk={64,64}, Cprecincts={256,256} | 4CIF Video (2bpp) 704x576 (4:2:0 YCbCr), 1200 frames, W9x7, Clevels=5, Cblk={64,64} | 4 CIF Video (0.5 bpp) 704x576 (4:2:0 YCbCr), 4800 frames, W9x7, Clevels=5, Cblk={64,64} | |
| Core 2 Duo™ 2.4GHz x86 on WOW64 [All results reported in frames/second] | v5.2.6 (expand) | 4.36 | 6.33 | 0.0370 | 0.135 | 62.50 | 142.86 |
| v6.0 (expand) | 4.92 | 7.84 | 0.0377 | 0.136 | 63.83 | 144.14 | |
| vs6.0 (expand) | 4.93 | 7.92 | 0.0377 | 0.136 | 65.93 | 162.16 | |
| v5.2.6 (render) | 0.0368 | 0.134 | |||||
| v6.0 (render) | 0.0373 | 0.142 | |||||
| vs6.0 (render) | 0.0373 | 0.146 | |||||
| v6.0 (vex_fast) | 6.00 | 10.90 | 67.85 | 179.44 | |||
| vs6.0 (vex_fast) | 5.98 | 10.87 | 70.20 | 193.09 | |||
| Core 2 Duo™ 2.4GHz x64 on WinXP64 [All results reported in frames/second] | v5.2.6 (expand) | 4.43 | 6.72 | 0.0360 | 0.131 | 61.86 | 144.14 |
| v6.0 (expand) | 4.98 | 8.28 | 0.0365 | 0.132 | 63.16 | 147.24 | |
| vs6.0 (expand) | 7.04 | 10.91 | 0.0532 | 0.181 | 90.23 | 207.79 | |
| v5.2.6 (render) | 0.0357 | 0.131 | |||||
| v6.0 (render) | 0.0360 | 0.137 | |||||
| vs6.0 (render) | 0.0522 | 0.197 | |||||
| v6.0 (vex_fast) | 5.76 | 10.42 | 66.84 | 184.06 | |||
| vs6.0 (vex_fast) | 8.43 | 15.30 | 101.59 | 295.38 | |||
| Core 2 Duo™ 2.4GHz best vs6.0 speed improvement over best v5_2_6 speed | expand (x86 or x64) | 1.59 | 1.62 | 1.44 | 1.45 | 1.44 | 1.44 |
| fastest possible (x86 or x64) | 1.90 | 2.28 | 1.44 | 1.45 | 1.63 | 2.05 | |
| Core 2 Duo™ 2.4GHz best vs6.0 speed improvement over best v6.0 speed | expand (x86 or x64) | 1.41 | 1.32 | 1.41 | 1.33 | 1.41 | 1.41 |
| fastest possible (x86 or x64) | 1.41 | 1.40 | 1.41 | 1.39 | 1.50 | 1.60 | |
| Dual Quad-Core Xeon™ 2.66GHz x64 on WinXP64 [All results reported in frames/second] | |||||||
| v6.0 (vex_fast) | 24.89 | 44.14 | |||||
| vs6.0 (vex_fast) | 34.59 | 58.18 | |||||
| Dual Quad-Core Xeon™ 2.66GHz on x86-64 Mandriva Linux (Spring 2007) [All results reported in frames/second] | |||||||
| v6.0 (vex_fast) | 24.24 | 43.17 | |||||
| vs6.0 (vex_fast) | 35.09 | 60.15 | |||||
| Notes: | 1) "vs6.0" refers to the separately licensed Kakadu "speed-pack" -- this is for sure the fastest software implementation of JPEG2000. | ||||||
| 2) "expand" means "kdu_expand" for images and "kdu_v_expand -overlapped_frames" (v5.2.6) or "kdu_v_expand -overlapped_frames -in_memory 1" (v6.0 / vs6.0) for video | |||||||
| 3) "render" means "kdu_render -cpu" for images; it measures the throughput achievable using `kdu_region_decompressor', which is at the heart of all high level Kakadu display rendering tools. Note that the special "-cpu" option is new to v6.0; we ported the option to v5.2.6 just to obtain a comparison for rendering applications. You will note that the render option is noticeably faster for large tiled images than "kdu_expand". This is because "kdu_expand" always processes a complete horizontal row of tiles concurrently, which can cause cache thrashing if an entire large image is being decompressed at once. By contrast, Kakadu's rendering objects operate on smaller numbers of tiles, writing their results to a random access display buffer. You will also notice that rendering of tiled imagery has been accelerated in version 6.0. | |||||||
| 4) "vex_fast" means "kdu_vex_fast -display" -- this is a new application to v6.0 and is also the fastest way to render video. | |||||||
| 5) Speeds on x86 MAC OS platforms are pretty much identical to those quoted for WinXP64 -- exactly the same speedup options are available on 64-bit Intel MAC's as on 64-bit Windows boxes, for all variants. | |||||||
| 6) Speeds on 64-bit Linux platforms are pretty much identical to those quoted for WinXP64 -- exactly the same speedup options are available for X86-64 Linux, 64-bit Windows, and 64-bit MAC OSx86 binaries. In fact, the results quoted above for the digital cinema case suggest that 64-bit Linux may be slightly faster in some circumstances. | |||||||
| 7) Speeds on 32-bit Linux platforms should be a little bit lower (up to about 10% slower) than those for 32-bit (x86) Windows platforms, because the 32-bit block decoder acceleration code is not available. 64-bit Linux does not suffer from this disadvantage, as mentioned above. | |||||||