This article is from the MPEG FAQ, by Frank Gadegast phade@cs.tu-berlin.de with numerous contributions by others.
Today, there is no alternative to expensive listening tests. During the ISO-
MPEG process, a number of international listening tests have been
performed, with a lot of trained listeners. All these tests used the "triple
stimulus, hidden reference" method and the "CCIR impairment scale" to
assess the sound quality.
The listening sequence is "ABC", with A = original, BC = pair of original
/ coded signal with random sequence, and the listener has to evaluate both
B and C with a number between 1.0 and 5.0. The meaning of these values
is:
5.0 = transparent (this should be the original signal)
4.0 = perceptible, but not annoying (first differences noticable)
3.0 = slightly annoying
2.0 = annoying
1.0 = very annoying
 
Continue to: