Structure & Congruency

The Effects of Congruency Between Structural & Contextual Dominance in Image Processing

Methodology

Subjects

Definition of Variables

The Contextual Dominance Variable

The Image Structural Dominance Variable

The Image Complexity Variable

The Congruency Variable

Materials

Procedure

Gathering Eye-Movement Data

Recording Eye-Movement Data

Statistical Analysis

The Within-Group analysis

Analysis of Eye-Movement Data

Determining Dominance from Rank-Order Comparisons

Determining Contextual Dominance

The Between-Group Analysis

Congruency

Determination of Structural & Contextual Effects

The Effects across Complexity

Subjects

A total of 21 subjects were used in this study drawn from a graduate class in research design at Indiana University. 13 subjects across two experimental groups received events with the same sequence of images, the verbal context being the only difference between groups. 8 subjects made-up a third control group exposed to images only.

Definition of Variables

The Contextual Dominance Variable

The verbal contextual dominance variable was defined as a spoken phrase immediately preceding exposure to an image. For each stimulus event, contextual dominance was derived from the nouns within each phrase and then given points that were interpreted in terms of the image. Due to the need to compare the verbal contextual variable to the image structural variable, the contextual dominance was described in terms of the referents appearing in the image sectors.

The Image Structural Dominance Variable

The image was described in terms of two criteria, structural dominance and degree of complexity. The first level of image description, or structural dominance, was defined in terms of prioritized sectors of the image as assigned by a panel of visual design experts.

The Image Complexity Variable

The second level of analysis was that of complexity. Complexity was determined simply be tabulating the number of structural elements visible in the image, e.g. 10 lines, 20 circles, 6 shapes, etc. In this study images were grouped into simple, medium, and complex categories. The criteria for inclusion in the simple category was having less than 10 structural elements, in the medium category by having more than 10, but less than 50 elements, and in the complex category by having over 50 structural elements in the image.

The Congruency Variable

Identifying the degree of congruency between two groups was a statistical process of shared dominance comparisons. Based on the Experts' rankings of structural dominance, phrases were constructed which either referred to this dominance or referred to less dominant sectors of the image. Congruency was defined as an independent variable with two conditions. A "Congruent Condition" existed when the contextual phrase that was heard referred to elements identified as part of the image structural dominance. The phrase which referred to less dominant elements within the image was assigned to the non-congruent condition.

[go back to the BEGINNING of THIS document]

Materials

Fifteen images were divided into three groups of five according to the above complexity criteria. Structural dominance was determined and validated by a panel of visual design experts. Two contextual phrases were written for each image with one phrase written to match the structural dominance of the image and the second phrase focusing on weaker structural elements of the image. A sample from each of the three levels of complexity follow:

Simple Complexity

Fig 2 (Image #5)

Image Structure description: Shapes: 2 circular shapes (1/5 width diameter); Colors: black and blue on white background; Textures: solid (no textures); Relational: diagonally positioned with the blue circle in the upper/left ninth of the frame and the black circle in the lower/right ninth of the frame.
Contextual Phrases: Congruent: Since blue knights are instinctively afraid of black dragons, he ran.; Non-Congruent: Since black knights are instinctively afraid of blue dragons, he ran.

Moderate Complexity

Fig 3 (Image #8)

Image Structure description: Shapes: 1 hollow triangle shape made of moderately thick lines.; Colors: black on white background; Textures: solid (no textures); Relational: Isosceles triangle fills over 1/3 of frame and is centered in frame.
Contextual Phrases: Congruent: A pyramid points to the sky.; Non-Congruent: A pyramid is heaviest at its base.

Complex Complexity

Fig 4 (Image #14)

Image Structure description: Shapes: lines and a variety of shapes but mostly rectangles, 1/4 sphere, circle sections.; Colors: grayish beige, bluish green, white, yellow.; Textures: rough (stone-like), and slick; Relational: symmetrical 3-dimensional space, white object at bottom, illustration at top of image.
Contextual Phrases
: Congruent: The center alters were usually simple with only two candles and a cross.; Non-Congruent: The biblical phrase "The Lord is my shepherd" is reflected in the decorations around the half dome.

[go back to the BEGINNING of THIS document]

Procedure

Gathering Eye-Movement Data

Subjects were asked to sit in a specialized "head-stabilization" chair (Fig. 5) A small light was place adjacent to their right eye and a video camera was trained and focused on the eyeball and a portion of the light. The strategy was to record the reflection of the light on the cornea of the subject relative to the stationary light bulb. Following a registration process, the stimulus materials were presented.

Fig 5

Recording Eye-Movement Data

The videotape was subsequently played-back one frame at a time and the motion of the reflection of the light off of the subject's cornea was documented s being in one of nine sectors of the frame.

[go back to the BEGINNING of THIS document]

Statistical Analysis

The Within-Group analysis

Groups A, B, C, and the Expert Group were included in the analysis for each of the 15 images. All data were converted to percentages so as to allow for a direct comparison between the stimulus events and the Expert rankings. A Kendall Coefficient of Concordance (W ) was computed for each group associated with each image. This resulted in a rank-ordering of the means of fixations for each sector of the image, a Chi-Square, and a significance value for each group. If the result of the Kendall test was significant, then a more rigorous test of significance, the Friedman Two-Way Analysis of Variance by Ranks, was calculated. If significance was acceptable, calculated by the Friedman test, then a multiple comparison between individual rank order means was calculated to determine the significance of a sector's dominance (Siegal, 1988).

Analysis of Eye-Movement Data

The raw data, which consisted of an ASCII file of 75 data pairs, 1,C 2,C 3,B 4,E ... 75,H , were read into a HyperCard Stack and sorted into the total number of fixations registered in each of the nine sectors.

Determining Dominance from Rank-Order Comparisons

Dominance for experimental data (that which was derived from eye movement data), as well as the ranking by the experts, is defined as those sectors which produced significance from the Comparison of the Means. These sectors are then said to represent 100% for the Dominance for that experimental group of data and are discussed in terms of the percentage which each sector claimed of that dominance. For instance, in the example below(Fig. 6), the dominance for the congruent condition of image 8 (Sectors B and E) is 89% and 11% respectively.

Determining Contextual Dominance

Contextual dominance was determined by identifying the nouns within the contextual phrases and assigning points in relation to:

their position within the sentence,
the presence of visual modifiers that were also in the image, and
whether they were a focus of attention within the sentence.

The points accumulated for each noun were then assigned to the sectors of the image to which the noun referred. The sectors to which the nouns referred represented 100% of the contextual dominance and were represented in terms of the percentage which each sector claimed of the dominance.

[go back to the BEGINNING of THIS document]

The Between-Group Analysis

Congruency

A "Congruent Group" was defined experimentally as a group which received an audible contextual stimulus which contained a referent to a structurally dominant element within the image. The assignment of an experimental group to either the Congruent data group (CG-S) or the Non Congruent data group (NCG-S) was based on a strategic decision by the experimenter to match the contextual referents heard verbally with the structurally dominant attributes within the image.

The Degree of Congruency (°ree;CG) was a post-hoc statistic used to describe the relationship between data groups in terms of the overall similarity of their shared dominance. This statistic produced a number, from 0 to 100, with the most congruent condition being 100. This procedure was executed as follows (see Fig. 7):

Corresponding sectors within each of the two data groups being compared were sequentially added together producing a "Sector Sum". Also these same numbers were subtracted from each other to produce a "Sector Difference". The process was repeated for each sector as long as there were values present within the sectors being compared. The sector sums and the sector differences were totaled, then the difference between these totals was found, and this figure was divided by 2 to produce the final result. The formula for determining the Degree of Congruency (°CG) was as follows:

(SIGMA(sector sum) - SIGMA(sector difference)) ÷ 2 = °CG

For example, comparing the EXP group with the CG-S group below:

38 +89=127 & 31+11=42 (Sector Sum)=127+42=169
89-38=51 & 31-11=20 (Sector Difference)=51+20=71
(169-71)÷2= 98÷2= 49 = °CG

Fig. 7

Determination of Structural & Contextual Effects

In order to examine the effects of the structural dominance and contextual dominance on the Congruent and Non-Congruent data groups, the sector of primary dominance of each was compared. The percentage of the dominance accrued in the sector was considered to be representative of the contextual effect for that group. The same procedure was followed for the Non-Congruent data group.

A similar approach was utilized to determine the structural effect. The sector of primary dominance for the Expert data group determined which sector of the Congruent or Non-Congruent data group represented the structural effect for each group.

The Effects across Complexity

The images were grouped into 3 levels of complexity -- Simple (images 1-5), Moderately Complex (images 6-10), and Complex (images 11-15). By plotting the contextual effects for each image, an examination of the interaction effects of complexity was undertaken.

http://silver.ucs.indiana.edu/~appelman/D_THREE.html

The Effects of Congruency Between Structural & Contextual Dominance in Image Processing

Methodology

Subjects

Definition of Variables

The Contextual Dominance Variable

The Image Structural Dominance Variable

The Image Complexity Variable

The Congruency Variable

Materials

Procedure

Gathering Eye-Movement Data

Recording Eye-Movement Data

Statistical Analysis

The Within-Group analysis

Analysis of Eye-Movement Data

Determining Dominance from Rank-Order Comparisons

Determining Contextual Dominance

The Between-Group Analysis

Congruency

Determination of Structural & Contextual Effects

The Effects across Complexity

Subjects

Definition of Variables

The Contextual Dominance Variable

The Image Structural Dominance Variable

The Image Complexity Variable

The Congruency Variable

Materials

Simple Complexity

Moderate Complexity

Complex Complexity

Procedure

Gathering Eye-Movement Data

Recording Eye-Movement Data

Statistical Analysis

The Within-Group analysis

Analysis of Eye-Movement Data

Determining Dominance from Rank-Order Comparisons

Determining Contextual Dominance

The Between-Group Analysis

Congruency

(SIGMA(sector sum) - SIGMA(sector difference)) ÷ 2 = °CG

Determination of Structural & Contextual Effects

The Effects across Complexity

last updated: 8/24/96