Google Engineer Breaks Down Steps for VP8 Optimization
Road map for VP8 codec improvements laid bare.
Google software engineer John Koleszar addressed the open source community regarding the VP8 codec and the steps needed to further optimize to codec within the WebM project.
Koleszar notes Scott LaVarnway's work in creating an x86 version of the quantizer and moved on to request a SIMD version of the ARNR temporal filtering code from the community. Koleszar also asked for newer extensions for the assembly code, as it currently only takes advantage of the SSE2 instruction set.
The last improvement Koleszar called for in the VP8 encoder was for someone to explore alternative motion search strategies, eventually hoping to decouple motion search entirely, leaving the motion field calculations to the graphics processors.
For the decoder, Koleszar highlights the work of Jeff Muizelaar, Johan Koenig, and Tim Terriberry. While he doesn't specifically ask for help on any one item as he did with the encoder, he does highlight some of the ongoing work. Terriberry is working had on the bool decoder, which is called multiple times per each bit in the input stream. Currently, the code uses a simple clamp on the innermost loops for checking and performs less frequent copies into a circular buffer. Terriberry's patch uses a more complex clamp and removes the circular buffer.
Meanwhile Muizelaar's work has combined IDCT and summation with the predicted block into a single function. Doing this reduces memory transfers and therefore reduces cache pollution. Koenig is implementing Muizellaar's work into ARM processors.
Speaking of embedded processors, Koleszar ended his post with a description of the work being done on not-desktop platforms. Fritz Koenig is working to optimize the VP8 codec for the Atom platform, quite a task considering the x86 assembly code for the codec was written for an out-of-order processor.
The Atom, of course, is in-order, so Koleszar and company are debating scheduling the code for Atom and then checking to see what performance issues arise on x86. Regardless, Koleszar notes that a lot of work lies ahead.
Finally, he spends some time on intrinsics and whether or he and his fellow programmers should use them when trying optimize the codec for multiple processors and platforms.
"If you have experience in dealing with a lot of assembly code across several similar-but-kinda-different platforms, these maintainability issues might be familiar to you. I hope you'll share your thoughts and experiences on the codec-devel mailing list," Koleszar said.
Comments
comments powered by DisqusNews
-
The First Point Release For Ubuntu 22.04 is Now Available
Canonical has released the first point upgrade for Jammy Jellyfish which includes important new toolchains and fixes.
-
Kali Linux 2022.3 Released
From the creators of the most popular penetration testing distributions on the planet, comes a new release with some new tools and a community, real-time chat option.
-
The 14" Pinebook Pro Linux Laptop is Shipping
After a considerable delay, the 14" version of the Pinebook Pro laptop is, once again, available for purchase.
-
OpenMandriva Lx ROME Technical Preview Released
OpenMandriva’s rolling release distribution technical preview has been released for testing purposes and adds some of the latest/greatest software into the mix.
-
Linux Mint 21 is Now Available
The latest iteration of Linux Mint, codenamed Vanessa, has been released with a new upgrade tool and other fantastic features.
-
Firefox Adds Long-Anticipated Feature
Firefox 103 has arrived and it now includes a feature users have long awaited…sort of.
-
System76 Refreshes Their Popular Oryx Pro Laptop with a New CPU
The System76 Oryx Pro laptop has been relaunched with a 12th Gen CPU and more powerful graphics options.
-
Elive Has Released a New Beta
The Elive team is proud to announce the latest beta version (3.8.30) of its Enlightenment-centric Linux distribution.
-
Rocky Linux 9 Has Arrived
The latest iteration of Rocky Linux is now available and includes a host of new features and support for new architecture.
-
Slimbook Executive Linux Ultrabook Upgrading Their CPUs
The Spanish-based company, Slimbook, has made available their next generation Slimbook Executive Linux ultrabooks with a 12th Gen Intel Alder Lake CPU.
dgdg
They use Orc, it seems to help them a lot:
http://code.entropywave.com/projects/orc/