by
Arthur M. Noxon
Acoustic Sciences Corp.
Eugene, OR 97402 U.S.A.
ABSTRACT
Coherent and incoherent reflections are very
different, both in physical and psychoacoustic properties. Perception
effects such as imaging and musicality are very sensitive to the
type and tuning of reflections off nearby surfaces. Coherent reflections
can have strong correlation coefficients and add information to
the direct signal. Incoherent reflections with random phase signals
are weak in correlation and provide strong masking effects.
INTRODUCTION
Diffusion is the process of mixing up sound.
In a 100% diffuse sound field, there is no sense of acoustic direction,
sound comes equally from all directions. Diffusion may be at times
a desirable condition for acoustic energy. It is created by a sequence
of diffusing reflections. A sound reflection can be either coherent
or incoherent. This quality is very important to be specified because
the coherence of a reflection has a significant sound masking effect
on perception.
SOUND DIFFUSERS
A device that helps to develop
the state of diffusion by increasing the scattering of sound is
called a sound diffuser. There are four types of sound diffusion
mechanisms.
1. Diffraction (sound bends around corners)
2. Refraction (turns by changing wave speeds)
3. Reflection (changing direction upon impact)
4. Resonance (resonant storage and reradiation)
The first three sound turning mechanisms
are pretty well known. They change the direction of sound but not
the time wise evolution of the waveform itself. The scattered sound
has the same sonic signature as the incident sound, they are highly
correlated and therefore a coherent diffusion process takes place.
The last process, resonance, is not usually
considered to be a sound diffuser. Incident sound on a resonator
will stimulate the build up and decay of sound in the resonator.
Resonant discharges are often practically point sources and so the
reradiated sound is well distributed in space. The sound of a ringing,
resonant decay has its own time wise evolution. The incident wavetrain
will have a pressure vs. time signature that is not followed by
the sound of the ensuing resonant decay. Correlation between the
incident waveform and the resonant discharge is very low. Resonance
forms the basis for an incoherent class of sound reflections.
TIME DELAYED REFLECTIONS
AND PERCEPTION
There are distinct time periods
that relate to the various properties of perception. Reflections
within the first few milliseconds following the direct sound belong
to localization, i.e. the perception as to where sound is coming
from. Reflections within the next 30 to 50 ms belong to fusion,
the develop- ment of sound tone recognition. Reflections outside
of 60 ms develop the impression of echo and ambience. The coherency
of reflections with respect to the direct signal may well effect
the quality of perception differently in each of these three time
regions. Once this relationship is known, it can be utilized by
recording engineers and acoustic designers to better achieve desired
performance.
Reflections of sound that follow
the direct signal within 50 ms are not distinctly heard but are
blended together, fused into a composite sound. If only one reflection
is heard, the phase add and cancel comb filter coloration effects
will be heard. If there are many reflections, randomly off set in
time, the phase add effect averages out to zero and the composite
sounds just like the direct signal. Whenever reflections do not
sound like the direct signal, the composite also does not sound
like the direct signal. The goal of this paper to introduce and
measure coherent and incoherent reflections and then to subjectively
evaluate the impact of each when audited within the 50 ms sound
fusion time window. It will be shown that incoherent reflections,
which may be acceptable in the 60 ms plus time period as ambience
or echo signals, are degrading to musical quality if perceived during
the 50 ms sound fusion period.
This summation or coloration of signals smeared
together within the 50 ms perception window is a distinct aspect
of tone recognition but not the whole picture for listening. It
does not account for the consequence of variations in the time ordered
detail of the harmonic structure in the attack transient. The accuracy
of musical quality belongs to the 20 ms attack transient. It is
the only event in which the timing and the phase alignment of the
overtones in complex signals is detectable. Over the last few years
speaker manufacturers have recognized and accommodated both tune
and phase alignment in the design of speakers. It is no longer sufficient
to know how the sound level of each of each partial varies with
time, we must also have correct time alignment and phase of the
partials. One technique that measures in this area of psychoacoustic
perception is the correlation test.
COHERENT REFLECTIONS
Correlation
is a measure of how similar one signal is to another. If a direct
signal (Figure la) causes a simple time delayed reflection (Figure
Ib) the correlation factor between the two signals (Figure Ic) is
zero everywhere in time except at the time delay and then their
correlation is 100%.
If the specular reflection is splintered,
scattered out in time, correlation will still exist, but spread
out over the range of time over which the multiple reflections take
place. Two types of splintered reflection systems were tested and
both show correlation to exist over a longer time period than that
of the single flat wall bounce.
In Figure 2a is shown a reflection/absorption
diffusion grid that is composed of alternating depth of reflecting
surfaces interspersed with sound absorbing segments. In this system,
every other reflector is curved to backscatter over a wider angle
than the adjacent flat reflecting strips.
The Figure 2b shows the ETC for the reflection
of 400 to 20K. The multiple reflections are spread over a 2 ms time
period. The correlation measurement between the direct signal and
the reflection (Figure 2c) also shows a 2 ms wide correlation. Each
of the time delayed, scattered reflections is specular, a coherent
and faithful reproduction of the direct signal.
A different type of diffuser is composed
of a set of troughs at various depths, shown in Figure 3a. High
frequency sound entering these wells ricochet some number of times
depending on the angle of incidence and the well depth. The ETC
of Figure 3b shows a spread in time of the reflected signal of about
5 ms. The correlation for this diffuser using 1/3 octave noise at
3K is (Figure 3c) also spread over a 5 ms period of time. This short
wavelength reflection is coherent.
INCOHERENT REFLECTIONS
Zero correlation occurs when the reflected
signal bears little to no resemblance to the direct signal. This
can occur when the reflected signal has no amplitude because it
was absorbed. Incident sound onto 2" of medium density fiberglass
does not reflect 1/3 octave noise at 3K. Figure 4a shows the ETC
of this absorbed "reflection". The correlation test in
Figure 4b shows "zero" because the silent reflection bears
no resemblance to time wise signature of the direct signal.
There is zero correlation if the reflecting
signal is not really reflected at all but instead is an independent
sound. A whistle tone at 1K was played while the direct 1/3 octave
noise at 3K was tested. The tone bears no resemblance to the direct
signal and Figure 5 shows zero correlation.
MIXED REFLECTIONS
A series of correlation tests
were run on each of three types of reflecting surfaces. The signal
used was 1/3 octave bandwidth noise on 1/3 octave centers between
125 Hz and 3 KHz. The correlation between the direct signal and
the reflection was made for flat wall bounces, for the absorption/reflection
diffuser and for the multidepth trough diffuser. For the higher
frequencies the correlation of all three reflectors is strong with
the time window being spread out according to the degree of multi-reflections
for each.
The test set up for this sequence (Figure
1) uses an incident angle of 45° and picks up the reflection
also about 45°. There are two data collecting runs. The first
one (Figures 6 through 12) ranges in 1/3 octave increments from
125 Hz to 500 Hz using 1/3 octave pink noise. The analyzer steps
0.2 ms, just over 100 times to draw out the correlation curve. The
correlation time window is just over 20 ms and is time delayed sufficient
to catch the leading edge of the reflecting wavefront.
Because narrow band noise
is used, the correlation signature will appear as a sine wave of
the frequency that is the center frequency of the 1/3 octave noise.
The amplitude of the correlation measurement depends on the amplitude
of the received signal and how similar it is to the direct signal.
The absorption/reflection diffuser should provide some attenuation
of the correlation signal due to reduced reflecting signal strength.
The random well depth diffuser has no absorption and any loss in
correlation amplitude can be related to either an off axis concentration
of reflected sound (lobe beaming) or a signal correlation problem.
The 125 Hz through 500 Hz survey shows the
absorption/reflection diffuser to mimic the bare wall bounce faithfully
except for a full bandwidth amplitude reduction due to absorption.
The random well depth diffuser has a thinning of correlation in
the 200 Hz 1/3 octave band (Figure 8c) and again at 500 Hz (Figure
12c). The 200 Hz incoherent reflection is attributable to wood panel
resonance and the 500 Hz problem belongs to the 1/4 wavelength resonance
of the deepest wells.
A higher frequency series
(Figure 13 through 15) shows the same test except the step in 50
ms, four times faster than before. The full test window now is 5
ms. The wave form appears to be longer but only because the time
scale is shorter. The weak correlation for random well depth diffusers
still exists at 1000 Hz (Figure 13) but by the 2K octave and above
both diffusing systems have full and adequate correlation except
that the random depth wells have additional multiple reflections
(Figure 15) that stretch out over the 5 ms window.
The ETC for the random depth
well diffuser was taken with the mic in the bottom of one of the
deep wells for frequencies between 200 Hz and 20K Hz. The only indication
of possible resonance effects is (Figure 16) the rapid drop of initial
reflections followed by a resurgence of energy discharge between
4 and 7 ms after the initial reflection. The waterfall (Figure 17)
was taken to try to identify the resonance. It ranges from 50 to
500 Hz over a time period of almost 100 ms. By using the heavy time
averaging window of 40 ms, the structural resonance effects below
250 and the 1/4 wavelength at 375 Hz become evident.
There seems to be low correlation when the
reflected signal is involved with resonance even though the energy
of the reflection is high. The resonant discharge produces a tone
that has its own time wise identity, not a simple time delayed and
coherent reflection of the direct signal.
INCOHERENT REFLECTIONS
AND PERCEPTION
There is a subjective aspect
to incoherent reflections. The demonstration of this effect was
first performed in a respected hi end audio manufacturer's demo
room at the 1988 CES, Las Vegas. The audio playback system had random
depth well diffuser panels behind and between the speakers, set
up to diffuse the front wall bounce.
A CD track was played that had solo classical
guitar work. The perceived musical quality of a plucked low E guitar
string was radically affected by the reflections out of the random
well depth diffuser. All 15 people in attendance simultaneously
could repeatedly witness this effect. Its characteristic was identified
as being a "colorless note". The fundamental string tone
was present but its expected rich harmonic structure seemed to be
obscured.
When the random depth well diffuser panel
was covered over with a blanket and the musical section was replayed;
the easily recognized and all so familiar sound of a plucked acoustic
guitar string returned. High frequency string sounds did not seem
to have this "colorless" quality, only the lower frequencies,
those with substantial transient partials in the middle octaves,
250 to 750 Hz.
ATTACK TRANSIENT
FIDELITY
Correlation of continuous sound
is a straight-forward statistical sampling process. Trying to do
correlation on the attack transient of a plucked guitar string is
more difficult because of the short time period of the attack and
the long time period of the sustain. A study of the attack transient
wave form itself does show the effects of different types of reflecting
surfaces.
The signal out of a plucked electric guitar
string is shown (Figure 18) with rapid harmonic detail changes in
the first 50 ms for a 125 Hz note. The overall long term spectrum
for this pluck (Figure 19) shows strong and regular upper partials.
The signal was recorded and played back over a small, average speaker.
Its sound was reflected off of the three types of surfaces and captured
by a mic and storage scope. The first 40 ms of each bounce shows
the evolution of the attack transient into the sustain wave form.
The first pluck (Figure 20)
series shows a substantial initial 10 ms transient difference between
the random well depth diffuser and the other two. Beyond the attack
transient is seen a wave form change. With the random well depth
diffuser there is strong third harmonic detail added to the positive
peaks of the fundamental.
A second pluck at 125 Hz was recorded, this
time with more harmonic detail due to a shifted finger position.
Again the specular/absorptive reflection is very similar (Figure
21) to the wall reflected signal except for reduced amplitude. The
random depth well reflector shows again the first 10 ms attack transient
distortion. It also shows harmonic distortion in the sustain particularly
with accentuated rise times in the positive part of each fundamental
peak.
The persistent upper partial
distortion in the 3 to 400 Hz region out of the random depth diffuser
led to another test, this time at 400 Hz. Again, serious distortion
(Figure 22) in the first 50 ms of the attack transient is observed.
Also in the sustain is seen more than simple reduction of levels
as with the specular/absorptive diffuser. Here, every other cycle
is louder and sharper peaked while adjoining pulses are quieter
and more grounded than the other two reflecting surfaces.