Reference:
Teuvo Kohonen, Ilari T. Nieminen, and Timo Honkela. On the quantization error in SOM vs. VQ: A critical and systematic study. In José Carlos Príncipe and Risto Miikkulainen, editors, Proceedings of WSOM'09, volume 5629 of Lecture Notes in Computer Science, pages 133–144. Springer, 2009.
Abstract:
The self-organizing map (SOM) is related to the classical vector quantization (VQ). Like in the VQ, the SOM represents a distribution of input data vectors using a finite set of models. In both methods, the quantization error (QE) of an input vector can be expressed, e.g., as the Euclidean norm of the difference of the input vector and the best-matching model. Since the models are usually optimized in the VQ so that the sum of the squared QEs is minimized for the given set of training vectors, a common notion is that it will be impossible to find models that produce a smaller rms QE. Therefore it has come as a surprise that in some cases the rms QE of a SOM can be smaller than that of a VQ with the same number of models and the same input data. This effect may manifest itself if the number of training vectors per model is on the order of small integers and the testing is made with an independent set of test vectors. An explanation seems to ensue from statistics. Each model vector in the VQ is determined as the average of those training vectors that are mapped into the same Voronoi domain as the model vector. On the contrary, each model vector of the SOM is determined as a weighted average of all of those training vectors that are mapped into the "topological" neighborhood around the corresponding model. The number of training vectors mapped into the neighborhood of a SOM model is generally much larger than that mapped into a Voronoi domain around a model in the VQ. Since the SOM model vectors are then determined with a significantly higher statistical accuracy, the Voronoi domains of the SOM are significantly more regular, and the resulting rms QE may then be smaller than in the VQ. However, the effective dimensionality of the vectors must also be sufficiently high.
Suggested BibTeX entry:
@inproceedings{kohonen2009wsom,
author = {Teuvo Kohonen and Ilari T. Nieminen and Timo Honkela},
booktitle = {Proceedings of WSOM'09},
editor = {Jos\'{e} Carlos Pr\'{i}ncipe and Risto Miikkulainen},
pages = {133-144},
publisher = {Springer},
series = {Lecture Notes in Computer Science},
title = {On the Quantization Error in {SOM} vs. {VQ}: A Critical and Systematic Study},
volume = {5629},
year = {2009},
}
|