Proceedings of the International Federation of Automatic Control
June 27-29, 1995, Boston, MA
The Relationship of Binocular Convergence
and Errors in Judged Distance to Virtual Objects
Stephen R. Ellis, Urs J. Bucher and Brian M. Menges
NASA Ames Research Center
Moffett Field CA, 94035
silly@eos.arc.nasa.gov
The full paper may be viewed in PostScript format.
Abstract: Errors in judged depth
of nearby virtual objects presented via see-through helmet mounted displays
are shown to be linked to changes in binocular vergence. This effect varies
measurably across the subjects examined and correlates with the magnitude
of the subjects' individual depth judgment errors. The relationship is demonstrated
by visually superimposing computer-generated virtual objects on physical
backgrounds in a manner similar to that suggested for some practical applications.
Suggestions for improved control of virtual objects under these viewing
conditions are briefly discussed.
Keywords: see-through display, virtual
objects, stereo vision, computer-aided work, binocular convergence
1. Judged Distance to Virtual Objects in the Near
Visual Field
The perceptual cues to space have been classically divided into either
static or dynamic or classified either as monocular or binocular information
sources. More recent analyses of depth perception focusing on the behavioral
affordances of vision have usefully reclassified the classical depth cues
into three categories, those important with respect to personal space (2m
~ 1-2 eye heights), those relevant for action space (3-30 m ~ 2-20 eye heights),
and those relevant for vista space (>30m ~ >20 eye heights). (Cutting
and Vishton, in press).
This reclassification of sources of information concerning the spatial
layout surrounding a viewer is particularly useful since it focuses attention
on what vision is to be used for in each of these distinct regions. Depending
on the category of relevance, different cues to depth have varying importance.
In particular binocular convergence and accommodation play roles mainly
relevant for personal space associated with coordinated, manipulative activity.
Understanding the interaction of these physiological responses and depth
perception will have growing importance as head mounted displays of virtual
objects are introduced into the workplace.
Head or panel mounted see-through displays of conformal, computer generated
imagery have been used in aircraft cockpits for many years as heads-up displays
(Weintraub & Ensing, 1992) or as helmet mounted sights, i.e. the Honeywell
sights. But these applications have almost universally presented users with
virtual images at the far end of action space or into vista space. More
recent applications of such displays are designed to present to their users
spatially conformal computer generated virtual objects for medical and manufacturing
applications (Rolland, 1994; Janin, Mizell & Caudell, 1993; Azuma &
Bishop, 1994). This work has focused attention on precise calibration of
the displays and also on perceptual phenomena that might degrade performance
even in well calibrated systems.
Previous reports have indicated that indeed such phenomena are observable.
The present two experiments are designed to investigate their causes and
practical implications. In particular, previous observations (Ellis &
Bucher, 1994) have shown that optical superposition of a virtual object
on a physical backdrop changes its judged position. In particular, if a
physical surface is introduced at the judged depth of the stereoscopic virtual
image constituting the virtual object, the virtual object is judged t o
be closer to the observer. This effect is enhanced by slowly moving the
physical surface. Since a change in rendering to make the virtual image
completely occlude the backdrop did not affect its judged depth and since
the motion of the backdrop is like ly to have attracted visual attention
and binocular convergence (Ellis & Bucher, 1994), it was concluded that
the change in judged depth was not due to the occlusion. Rather, it was
suspected that the effect was due to an increase in binocular convergen
ce associated with the physical object. The following experiment is a direct
test of this hypothesis using an unobtrusive, nonius technique to detect
convergence.
2. Measurment of Changes in Static Convergence
During Viewing of Virtual Objects
2.1 Methods
Stimuli
A stereoscopic virtual image of an upside down, axially rotating (2 rpm)
tetrahedron was presented at a distance of 108 cm away from the subjects'
eyes by a head-mounted see-through display. This display was operated under
normal room illumination and wa s an improved, previously described device
(Ellis & Bucher, 1994). One diopter accommodative relief was provided
for the virtual image only. Display resolution was better than 5 arcmin.
The depicted size of the presented virtual object was randomly scaled
from 70 to 130% of its nominal size for each trial preventing subjects'
use of angular size as a depth cue. The wire-frame tetrahedron had a nominal
10 cm base and 5 cm height. The width of the wire frame lines was about
9 arcmin. The lines of the wire frame an all other computer generated lines
had a luminance of about 65 cd/m2. and were seen against a 2.9 cd/m2 gray
cloth background placed 2.2 m from the subject. Occasional varia tions in
the depicted depth of the tetrahedron which were not analyzed further were
also introduced to insure that the subjects did not notice that the same
depicted depth was repeated. But these additions proved to be unnecessary
due to large and variable, experimentally induced perceptual effects that
influenced the judged depth of the target.
A physical surface that was introduced along the line of sight to the
tetrahedron was provided by a rotating checkerboard made for foamcore. The
checkerboard was a disk 29 cm in diameter with 5 cm black and white checks
having either 1.3 cd/m2 or 17.8 cd/m2 luminance.
Subjects
Five men and one woman with stereo resolution of better than 1 arcmin
as measured with the stereo vision test on an Orthorater participated in
the experiment. Some subjects had vision corrected by contact lenses or
glasses and were able to wear their corrections during the experiment. Subjects'
ages ranged from late teens to late '40's and included laboratory personnel
as well as paid subjects recruited by a contractor at Ames.
Task
The first part of the subject's task was to mechanically place a yellow-green
LED (about 20 cd/m2) pointer under the nadir of the slowly rotating, wire-
frame tetrahedron, which had a size randomly selected for each trail. After
aligning the pointer, the subjects were presented with two sets of nonius
lines just flanking the tetrahedron. These lines were then adjusted to equal
visual directions on each side of the tetrahedron by moving the lower left
and right segments (see Figure 1). The second part of
the task involved a second adjustment of the pointer to the tetrahedron's
depth after a slowly, irregularly rotating (~2 rpm) opaque checkerboard
was introduced along the line of sight to the tetrahedron. The tetrahedron
was presented a second time at the same depicted depth in this new configuration
but the experimental variations generally concealed this fact from the subjects
so that they believed each trial, with or without the checkerboard, involved
a potentially different depicted depth.
After the second judgment of the tetrahedron's depth, the nonius lines
were flashed briefly (ca 250 msec) next to the tetrahedron while the subjects
fixated it. Then the subjects made a forced choice indicating whether the
upper or lower pair of the f lashed nonius lines were closer. The eye assignments
of each segment of the nonius lines were randomly selected so the that meaning
of the alternative possibilities in terms of convergence or divergence varied
randomly across the trials. The assignment of the lower part of the left
nonius line and the upper part of the right line to one eye and the other
upper-lower pair to the other eye, produced a differential effect doubling
the relative misalignment for any given vergence change. The subject reported
whether the upper or lower segments of the paired nonius lines were closer.
Three different experimental conditions were used. In the "on"
condition the checkerboard was mechanically introduced at the judged depth
of the virtual tetrahedron object. For the "in front" condition
the checker- board was introduced 30 cm in front of the judged depth. In
the control condition the second judgment was a replication of the first
judgment in that the subject made a second judgment of the depth of the
virtual image. But this time instead of aligning the nonius lines, the subject
made the forced-choice judgment of the nonius lines configuration without
the addition of the checkerboard. The control thus was identical to the
experimental conditions except the checkerboard was not introduced into
the line of sight. This control, thus, provides a check on the subjects'
judgment bias or changes of their convergence during the experiment. Each
condition was repeated 15 times for each subject in a randomized block design
in which blocks of 5 replications of each condition were repeated. The 6
p ossible orders of the 3 conditions were distributed randomly across the
6 subjects in the experiment.
The change in judged distance of the virtual object, the tetrahedron,
was analyzed in a single factor repeated measures ANOVA. Chi-square analyzes
were conducted on each individual subject's distribution of judgments of
convergence/divergence for each of the 3 experimental conditions. Taking
the control condition as a baseline, the relative strength of convergence
could be measured by a likelihood ratio of the probability of convergence
in each experimental condition divided by the probability of convergence
in the control.
2.2 Results
Single factor repeated measures analysis of the effect of superposition
of the checkerboard and virtual images replicated the previous observations
that the virtual object was moved closer to the viewer (F(2,10)= 7.549 p
< 0.01). Individual data are shown in Figure 2. This
effect was somewhat stronger for the "on" condition than for the
"in front" case and varied in strength across the 6 subjects.
One subject interestingly showed essentially no effect.
Table 1.
Frequency of convergence and divergence during depth judgments of virtual
objects |
|
Converg. |
Diverg. |
Likelihood ratio of convergence to divergence |
On |
84 |
21 |
1.58 |
In front |
70 |
35 |
1.32 |
Control |
53 |
52 |
-- |
The cause of this individual subject's variation is iluminated by considering
all subjects' tendency to reltively converge during judgment of the depth
of the virtual object in the presence of the checkerboard. This tendency
is summarized for the experime nt in Table 1 which displays the frequency
of convergence or divergence indicated by the nonius judgments for all subjects
in the three experimental conditions (Chi- square = 20.37, df=2, p< .001).
The control case shows an almost perfect 50:50 break wh ile the other two
conditions show decided convergence, the "on" condition being
somewhat stronger.
Each subject's individual likelihood ratio of converence was computed
as the ratio of the probability of convergence in an experimental condition
to the probbility of convergence in the control. These ratios are plotted
in Figure 2 for each subject. A 2X2 Chi square contingency
was computed to compare the distribution of convergence and divergence for
each experimental condition to that of the control condiion. This was done
separately for each subject. Statistically significant distributions are
indicated by asterisks in Figure 2.
2.3 Discussion
The individual subject's results in Figure 2 are
sorted by the size of the change in the judged position of the virtual object
for the "on" condition. These results can then be compared with
the likelihood ratio of relative convergence . As is clear from the figure,
the two measurements are almost perfectly correlated across the subjects.
The only subject not to show a displacement of the virtual object caused
by the checkerboard, also is the only one to show essentially no relative
con vergence. The results for the "in front" condition show a
weaker apparent displacement of the virtual image but also show a correlation
convergence. The correlation of relative convergence with magnitude of displacement
for the "on" and "in front" condit ions is across subjects
and conditions, r= 0.894, df=10; (t=6.31, p < .002). The results generally
support the supposition that the change in judged depth could be due to
a change in convergence, but the mechanism underlying this change remains
to be clarified.
One possibility is that the effect observed here is the reverse of the
inward shift of accommodation with infinity-collimated virtual image displays
discussed at length by Roscoe (eg. 1991). In his case users of these displays
do not fully accommodate to infinity, but remain focused somewhat closer.
The reflex coupling of accommodation and vergence would be expected to change
the vergence as well. In the cases we investigate in the present experiment,
subjects might be accommodating beyond 1 diopter when the small, relatively
poor quality, ~20/100 acuity, virtual object is presented. Their accommodation
and vergence could be then brought closer by the insertion of the real surface
which provides a much bigger and sharper stimulus to accommodation and to
v ergence through the accommodation vergence reflex. Another view of this
change may be that a fixation disparity present during viewing of the virtual
object is removed by the physical target.
Further experimentation will test these alternatives and examine possible
relationships to perspective vergence (Enright, 1991) or proximal vergence
(Cuiffreda, 1992) which appear to indicate that higher-level spatial interpretations
of the visual image c an simulate the vergence system. The results from
the present experiment, while indicating that there is a clear oculomotor
response associated with the error in judged depth, do not resolve the question
of causation since the vergence change could produc e the change in judged
depth through disparity reinterpretation just as easily as a perceptual
interpretation of occlusion could produce a vergence response through proximal
vergence. This question can only be experimentally resolved if a viewing
conditio n can be found that would differentiate the lower level from higher
level convergence cues.
The present results were observed with a static eye point and can be
expected to change when significant head-movement coupled motion parallax
is introduced. But it is important to realize that many of the possible
new applications of head-mounted see- th rough displays involve situations
in which viewing will be relatively static and for which weight considerations
might suggest the use of monocular displays. The results of this experiment
indicate that such displays should have a variable focus control a nd probably
should be used with a bore-sighting procedure in which focus is adjusted
to a reference target so as to correct for any errors in depth due to inappropriate
vergence. Alternatively, the computer generated targets used with such displays
could be binocularly generated with distorted disparity to correct their
spatial perception, pushing them away from the viewer.
References
Azuma, R. and Bishop, G., 1994, Improving static and dynamic registration
in an optical see-through HMD, Proceedings of SIGGRAPH '94,
July 24-29, Orlando, FL.
Cuiffreda, Kenneth J., 1992, Components of clinicalnear vergence testing,
Journal of Behavioral Optometry, 3, 1, 3-13.
Cutting, J. E. & Vishton, P. M., 1995, Perceiving layout and knowing
distances: the integration, relative potency an d contextual use of different
information about depth, in W. Epstein and S. Rogers (Eds), Handbook
of Perception and Cognition, Vol. 5., Academic Press.
Ellis, S. R., 1991, Nature and origin of virtual environments: a bibliographical
essay, Computer Systems in Engineering, 2, 4, 321-327.
Ellis, S. R., 1994, What are virtual environments? Computer Graphics
and Applications, 14, 1, 17-22.
Ellis, Stephen R. and Bucher, Urs J., 1994, Distance perception of stereoscopically
presented virtual objects superimposed by a head mounted see through display,
Proceedings, 38th Annual Meeting of the Human Factors and Ergonomics
Society, Santa Monica CA.
Enright, J. T., 1991, Paradoxical monocular stereopsis and perspective
vergence, in Ellis, S. R., Kaiser, M. K., and Grunwald, A. J. (Eds), Pictorial
Communication in Virtual and Real Environments, London: Taylor &
Francis.
Foley, J. M., 1993, Stereoscopic distance perception, in Ellis, S. R.,
Kaiser, M. K., & Grunwald, A. J. (Eds), Pictorial Communication
in Virtual and Real Environments, London: Taylor & Francis.
Janin, A.L., Mizell, D.W. and Caudell, T.P., 1993, Calibration of head-mounted
displays for augmented reality applications, in Proceedings of IEEE
VRAIS '93, Seattle, WA.
Ritter, M., 1977, Effect of disparity and viewing distance on perceived
depth, Perception and Psychophysics, 22, 4, 400-407.
Rolland, J. P., Ariely, D. and Gibson, W., 1994. Towards quantifying
depth and size perception in 3D virtual environments, to appear in Presence.
Roscoe, S., 1991, The eyes prefer real images, in Ellis, S. R., Kaiser,
M. K., & Grunwald, A. J. (Eds), Pictorial Communication in Virtual
and Real Environments, London: Taylor & Francis.
Weintraub, D. J. and Ensing, M., 1992, Human Factors Issues in
Head-up Display Design: the Book of HUD, Wright Patterson AFB, OH:
CSERIAC.
The head mounted display used in this experiment was based on a mechanical
design by Ramon Alarcon.
Figure 1. Experimental procedure illustration. Top: alignment, magnification,
and interpupilary adjustment, Middle: Pointer and nonius lines adjustment,
Bottom: A variety of testing conditions are shown, the "on" or
"in front" placement of the rotating checkerboard and the flashed
nonius lines.
Figure 2. Results for 6 subjects sorted for magnitude of change in judged
distance in the "on" condition.
[papers]||[top]
Advanced Displays and Spatial Perception Laboratory
Human Information Processing Research Branch
Moffett Field, CA 94035-1000
|