But, the subjective test was not good enough for me. I surfed the Web in quest for some objective stuff, but the only thing I've found is the test image at MP3bench. Of course, ALL of the VQF pages have links to some kind of a "comparison", but it's always the same useless table.
So I decided to put some real data on the Web. I've done some testing myself and here are the results:
I took a sequence of wide spectrum audio data (including
the frequencies up to 22 kHz) and I coded it with MPEG Layer3 Producer
(128 kbps Joined Stereo) and Yamaha SoundVQ Encoder 2.50b1 (48 kbps/ch.
Stereo), with the best quality offered by the coders. Then I wrote a 3D
spectrum analyzer in MATCAD, and made contour plots of the spectra before
and after coding. And here it is! Colors vary from red (peaks in power
spectra) to blue and violet (the lowest signal power). The horizontal axes
are normalized time axes (0-174 ms), and the vertical axes are normalized
frequency axes (0-22050 Hz). Well, judge by yourselves!
![]() |
![]() |
More examples:
![]() |
But, VQF is not better at everything! Here is an example of one problem with VQF which I discovered accidentally, only in spectrum analysis (I guess I didn't listen very well). It's the pre-echoes problem, which has been solved in MP3 by a technique called "window switching".
The pre-echoes appear when a quiet sequence is followed by a strong percussive sound (with short attack time) like castanets or a triangle. Such input causes relatively high instantaneous quantization errors, and transform-based coding spreads these errors over the time domain. The solution is to use short coding blocks (windows) to limit the echoes spreading, and then switch back to longer blocks after.
![]() |