Re: Quad & Discrete 5.1

Date	1998-11-29 11:27
From	Richard Dobson
Subject	Re: Quad & Discrete 5.1
	So often it can be the case that people can understand quite different things by the same term. I have not come across this explanation of the 'hole-in-the-middle' effect before - inter-speaker angle. I always understood that the primary problem with angles above 60 is poor localization. My main reference for this, as for so many things, is F.R. Moore's Book 'Elements of Computer Music' - pp 353-359, 'Intensity panning'. He refers to a simple linear crossfade between adjacent channels: "This effect, which has been called the 'hole-in-the-middle effect', is due to the fact that in the midway position both loudspeakers will produce the sound with one-half the amplitude associated with the endpoints of the path. ... In other words, the distance cue based on intensity would imply that the sound is sqrt(2) = 1.414 times farther from the listener at the midpoint of its travel that it is at either endpoint." He then presents constant-power panning techniques as simply a more correct way to pan sounds. Nowhere does he suggest it as a solution to the acoustic problem of speaker placement or poor recording technique. Curtis Roads presents the same argument in his more recent book 'The Computer Music Tutorial', pp 459-461. The possibility of such a simplistic linear crossfade probably never arises for recording engineers (especially, using analogue equipment), as faders and panpots are (probably?) never linear anyway. However, it is commonly done by synthesists/programmers, who are thus much more prone to the H.I.M. effect. So, in fact, constant-power panning ~increases~ signal intensity at the mid-point, by comparison with a linear crossfade pan. Composers using software such as Csound are creating artificial sounds which have never existed (and perhaps could not exist!) in the physical world. So 'good recording technique' is neither a possible fault, nor a solution. On the other hand, composers are frequently very concerned to achieve good stereophonic or even periphonic imaging of their sounds, so the more information we can have about how to do that, the better! Part of the problem may be that most sounds are still nominally monophonic at the moment of synthesis (i.e have no distance or acoustic location at all), and rendering into multiple channels is achieved by a combination of intensity panning (pairwise) and reverberation. In many cases, the final localisation of a sound is achieved not at the composition stage, but in final live projection where the composer uses a mixing desk to direct sounds (which may be on anything from stereo to eight-channel tape) anywhere amongst the available speakers. The composers is thus entirely dependent on the sophistication or otherwise of the desk (faders? joysticks?) to control location and intensity. To solve this problem at the compositional stage, composers need to define not only the sound of an instrument, but also the acoustic environment in which that sound will be placed, and probably trajectories along which that sound will move in that space. This implies knowledge, to some degree, of final speaker placement, much as some composers exploit the placement of first and second violins at opposite sides of the orchestra, or the location of choirs in multiple galleries in St Mark's, Venice. Of course, these placements cannot be guaranteed, and the only base-line performance medium on which modern electro-acoustic composers can rely is still stereo. Some composers (notably Ambrose Field) are looking to Ambisonics as the final solution to the 'diffusion problem'; but even there, the technology is still predicated on the rendering of a 'real' acoustic space, and electro-acoustic composers are notorious for wanting to do 'unreal' things too... As a final codicil to this, I would like to mention the Microsoft proposals for a 'wave-format-positional' soundfile format. The proposale are at that moment nothing more than questions, eg: Assumption is that the listener is at the origin[0,0,0]. Is this valid? Is origin between the ears, ore are ears coincident in space? Handle inter-aural distance, HRTF? Cartesian coordinates(x,y,z) better than spherical or cylindrical? How does a channel indicate it is not spatially positioned? Should we accomodate moving sound sources? Can we accomodate moving listeners? There are others; the whole white-paper on the new proposed formats is at the music-dsp site: http://shoko.calarts.edu/~glmrbuy/musicdsp/sourcecode/mulchaud.rtf For this format to materialize, Microsoft needs feedback; my contact, Martin Puryear is very responsive, and keen to get feedback from everyone. Whatever some people may think of Microsoft generally, I am pleased that these initiatives are happening. It is hard to imagine Apple doing anything similar with AIFF, given their traditional indifference to audio! Just possibly, the development of this format might lead to new facilities for composers, not least in Csound, to define the acoustical attributes of a sound in a generic way (intensity, directivity, position in 3d space, etc), knowing that the results will be diffused as accurately as the playback hardware will allow. Then, all questions of panning algorithms can, at last, be re-delegated to the sound engineer. At the moment, Csound requires us to be both, which is why all these confusions have arisen in the first place. Richard Dobson Xopher wrote: > > I'd like to clarify some things about constant power panning and the "hole > in the middle" problem. In particular, constant power panning is not the > solution to this problem. The "hole in the middle" problem arises when the > listener is positioned so the angle between the left speaker, the listener, > and the right speaker is larger than about 60 degrees. Remember that the > stereophonic technique simply creates the illusion of a spatialized sound; > this illusion simply breaks down when the above angle is greater than 60 > degrees. Note that this also problem implies that one needs at least 6 > speakers to pan a sound in a circle. Constant power panning potentially > makes this problem worse, because it lowers the volume of sound from each > speaker when the sound is centered. The problem is solved by good > loudspeaker placement. (and good recording technique, too.)