Structural Equivalence - Eclectic Anthropology Server

Transcription

Structural Equivalence - Eclectic Anthropology Server
73
Social Networks, 1 (1978) 73-90
@Elsevier
Sequoia
S.A., Lausanne
- Printed
in the Netherlands
Structural Equivalence:
Meaning and Definition, Computation
Application*
and
Lee Douglas Sailer
University of cizlifornia, Irvine**
This paper presents a generalization of the concept of ‘Structural equivalence “* the key concept in algebraic approaches to the study of social
networks. Two points in a graph or set of relations will be called ‘Structurally related” if they are connected in the same ways to structural1.y
related points.
It is suggested that this new definition suitably weakens Lorrain and
White’s cutegorical approach, and is more appropriate. than CONCOR.
St~~t~~ra~ relatedness is compared to these approaches via several simple
examples.
Introduction
This paper is concerned with the algebraic approach to network analysis
(though
it isn’t always called that) exemplified
by White (19631, Boyd
(1969), Lorrain and White (197 l), Boyd et al. (197 l), Breiger et al. (1975),
Heil and White (1976), White et al. (1976), Boorman and White (1976),
D. White and Boyd (1977), and Boyd and Sailer (1978). These algebraic
ideas provide an underpinning for theories of social structure. These theories,
in turn, may be used to derive computational
definitions which accurately
reflect interesting aspects of the system under investigation. It is possible to
argue that the approach in this paper is consistent with the writings of Linton
(1936), Nadel(1957),
Merton (1957) and Goodenough
(1969).
The concept of “structural
equivalence” (sometimes called here SE) is
examined in four stages. First, the relationship
between structural
equivalence and standard
sociological concepts is discussed. Second, various
*In no way could this paper have been written without
the patience, persistance,
and perspicacity
of John Boyd and Douglas White. In addition, many of these ideas were a group creation of a seminar
at UC Irvine in 1977. This research was supported
in part by NSF grant #BNS 76-08386,
D. R. White,
H. Nutini, L. Brudner, Principal Investigators.
**Division of Social Sciencks, University of California, Irvine, Calif. 92717, U.S.A.
74
Lw Doughs Sailer
more detailed definitions
are contrasted.
Third, the details of one definition and a computational
procedure
for realising it are described. Fourth,
and finally, some applications are proposed.
Social structure,
roles and status
“Social structure
is the network of actually existing social relations”,
or at least so said Radcliffe-Brown
(1943: 190). “Social structure”
may be
taken to contrast with such systems as environment,
language, and beliefs.
Another common contrast is with function, content .and process. To understand my interpretation
of the quote above a careful distinction
must be
made. A “social relation” is not the tie between specific people, but rather a
set of ties in an entire population. The “network” is the patterning of social
relations over a set of persons, or positions, or groups, or organizations.
The
individual is important primarily as the vehicle for the extensive definition of
the relations in which we are interested. People are not social structure; the
interaction
of people may be structured,
but here I am concerned with the
interaction
of the interactions.
The interplay between patterns of kin ties
and patterns of economic ties, for example, is more interesting than (or at
least separate from) the relationship between two specific groups of people.
From this view, the specific actors are transitory.
They may change roles
through time even though the “social structure”
remains the same. (See
Nadel 1957: 16 - 17 for an early statement similar to this.)
Goodenough
(1969), following Linton, Merton, and Nadel, distinguishes
carefully between “status”, the rights and obligations of a role, and “social
identity”,
for Goodenough
the label for the special position occupied by one
of the actors in a specific relationship. ’ For Goodenough,
the methodological task is to find the rules which will translate informants’ beliefs about
the role structure (where “role” is the totality of status, social identity, and
other constructs) into behavior.
The approach here is different, but congruent. From data on the occurrence of interactions
(or of beliefs about them), can we define models for
which it will be easy to find rules relating the model to behavior? The social
scientist has two tasks before him then, to categorize
relations, and to
describe the relational interactions.
These tasks may entail the initial categorization of the actors in the network, i.e., by function.
Most occurrences in this paper of the technical (i.e., mathematical)
term
“relation” may be loosely replaced by the term “role”. For the purposes of
this paper, a role is a set of appropriate
behaviors exhibited by a pair of
actors in a particular context. A role may also be defined as a cover term or
gloss for certain individual attributes (which may be network attributes),
‘The terminology
used here is Goodenough’s.
The tarn
“role”
is used loosely
Structural equivalence
75
that is, a role is a function fulfilled by an individual. Important to the concept of role is the notion that a particular individual has certain connections
to other individuals, and that those others must themselves be in certain
positions.
Here it is assumed that roles are patterns of ties in observable and unobservable relations. The importance
of the reciprocal nature of roles is
thus sidestepped. The relations represent the actual occurrence of behavior,
and if subsets of the populations
are found to be reciprocal (e.g., doctors
and nurses), then let this be an external validation of the method. Certainly,
if “husband” and “wife” are relations in a data set, we would expect to find
that husbands have wives, but this is a different aspect of the problem. A
thorough treatment of this topic is necessary.
What is structural equivalence?
One of the easiest things to see in a network is who is connected to whom.
That is probably why so much effort has been expended in the development
of cluster and clique detection techniques. Cliques, however, are not to be
confused with roles. Here is an example of the difference. One would expect
the cliques in a kin network to be entities such as families, clans, lineages,
etc. The roles of interest in a kin network, though, are kin-types, e.g., father
and son. Families are certainly interesting structures in their own right, but
they are not roles. Rather, they are nodes in a higher order relation. Clusters
and cliques, as structures, correspond to such concepts as the family or clan.
The mathematical
structure which corresponds to the role, such as father or
boss, is the “block”. A “blockmodel”
is a set of such blocks and the relationships between
them.2 The cluster concept
is still relevant, however. A
“block” can be defined as a set of actors clustered together by virtue of their
structural equivalence.
Blocks are not defined by the amount of intra-role interaction,
as are
clusters, but by the intrinsic nature of the other blocks with which they
connect; e.g., judges interact with layers more than they do with other
judges; crooks interact with victims in a different way than they do with
other crooks. Notice that it is assumed that “status”, the rights and obligations of a role, is contained in or expressed by the patterns of ties labeled by
the name of the role. That is, it is exactly the actual exercise of rights and
obligations (and/or the expectations
of them) that we take as data, using
these data to discover the roles themselves.
Two people in the same role are substitutable.
This is what structural
equivalence
is, substitutability
with regard to relational ties. Relational
structure does not totally determine SE properties, though. Certainly, a great
*“Blockmodel”
and White (1976)
is used in a loose sense here, contrasted
with “clique structure”
for example.
provide an example of a very formal definition.
The spirit is the same.
Heil
76
Lee Douglas Sailer
deal of information
can be obtained from individual attributes,
but how
much information
can we tease out of relationships alone? That is the question asked here.
Definitions
of structural equivalence
Lorrain and White ( 197 1:63) label as “structurally
equivalent” any two
points that are related in the same ways to the same other points. That
is, for i, i, and k in a set N, and relations (sets of ordered pairs) R,, R,, . . . .
R, in N X N, a relation S is a structural equivalence
if iSj implies that, for
every k, iRk = jRk and kRi = kRj for each relation Ri.,
Of course, hardly anything or anybody is ever structurally
equivalent to
anybody else in the noisy and complex world of social relations. To overcome this obstacle. Lorrain and White apply what they call the “categorical
approach”.
They use various criteria to reduce the number of relations
derivable from the data, thus aggregating relationship data to enable nodes to
be blocked together.
Figure 1.
Map of the “‘algebraic approach” to the analysis of social structure.
combine
relations
DATA
-\
combine
individuals
'-._
-.fcf
-9,)
=%&..
od+.,_
I
CONCOR
B
Categorical
approach
-.
combine
individuals
-.
'. l
I
-+ Model
combine
relations
Furthermore,
they claim (p. 79) that this approach treats the blocking of
the nodes and the aggregation of the relations simultaneously,
as suggested
by the analogue of Category Theory. This, in fact, is not true. They identify
relations (morphisms in their terminology)
first, with no reference to blocks
at all, and only subsequently use this information
to identify nodes. Later in
this paper a technique
for blocking nodes first (and hence relations) that
coordinates
neatly with the so-called categorical approach is proposed. The
point is illustrated in Figure 1. Lorrain and White (hereafter LW) follow the
upper path from data, through the identification
of (clusters of) relations, to
a model. This paper takes the lower path. It is theoretically
desirable that this
diagram commute (at least conceptually,
if not computationally).
The ideal
approach takes the direct path from data to model.
Briefly, a blockmodel
is a reduced version of the original data, probably
with both fewer nodes (individuals)
and fewer relations. The task of the
investigator is to aggregate together SE nodes while preserving the meaningfulness of the relations. Breiger ct al. (1975) (hereafter BBA) rightly point
Structuralequivalence
71
out that such aggregations will rarely exist in practice. So they propose the
CONCOR algorithm as an attempt to make the best of an unclear situation.
There are several problems with CONCOR, however. Though in general
the ideas in BBA are quite sound, it is possible to come away from a reading
with a profound misunderstanding
of what CONCOR does. It is stated that
CONCOR is a blocking algorithm (as opposed to a clustering algorithm) and
that it may be applied to raw data (instead of first computing a proximity
measure of some kind). Let me explain why I think these two statements
are incorrect, beginning with the second.
In truth, CONCOR does seem to begin with raw network data, but what
does it do with it? First, it computes the Pearson product moment correlation for each pair of nodes based on their patterns of indegree. Below it is
suggested that ordinary correlation is a natural, if cumbersome,
measure of
SE, in the sense of LW, when the data are coded in a certain way. The point
is this: the first step taken by the CONCOR algorithm is to compute an
index of the structural equivalence of every pair of points. That CONCOR
works on raw data is an illusion, since any other index of structural equivalence could be input at this point, and CONCOR’s first step skipped.
CONCOR’s second (and each subsequent)
step is to compute correlations on the structural
equivalence matrix (from the previous step) until
the process converges to a partition of the nodes into two sets. What is
CONCOR doing? We may paraphrase LW - two points are SE if they are
related to the same points. In the same vein, we may paraphrase phase two
of the CONCOR algorithm - two points are SE if they are structurally
equivalent to the same points. This is totally different!
It is like saying that
lawyers are equivalent because they are equivalent to lawyers. The right
concept is that lawyers are equivalent because they are related to judges
and clients. (There is still a further generalization
of the concept described
below.)
Still, BBA is on the right track. Since SE, as defined by LW, is too strict
to be of any use, some weakening or generalization
must be found. LW
achieves this weakening by identifying relations. BBA generalizes the basic
definition, but inappropriately.
In passing, let me mention the definition of “lean fit” in BBA. Lean fit
is the notion that, for blocks, it is only important
to maximize zero submatrices in the relations, since nodes in the same block are not necessarily
connected
to each other, but the nodes to whom they are not connected
must be the same. This would be a very useful concept if it were utilized
by CONCOR (as it is by Heil and White (1976)). Notice, however, that the
claim in BBA (p. 333) is only that CONCOR seems to produce a result that
approximates
lean fit; there is no stronger logical connection between lean
fit and CONCOR in the paper.3
3Schwartz (1977) rejects CONCOR
on formal statistical
and computational
grounds aside from
the criticism of this paper. His criticism, using principal components
factor analysis, is thought to be
consistent with the ideas here.
78
Lee Lb&as
Sailer
An earlier version of the computational
formula presented below inspired
John Boyd to suggest a definition4
of SE that may be paraphrased
thus:
two points are SE if they are related in the same ways to points that are SE.
This is the proper generalization
of the LW definition
missed by BBA.
For example, two judges need not be connected
to the same crook in
order to be classified into the same block (as seems to be required by the LW
definition);
they need only each be connected to some crook, since crooks
are SE. This is why SE appears in the latter half of the definition as well as
in the first half.
Since “equivalent”
has such a well-established
meaning in mathematics,
we refer to Boyd’s concept as “structural relatedness” (SR).5 To understand
how SR works, refer to Figure 2 as you follow along. Assume that we are
only interested in arcs coming into nodes (following BBA) and that for the
moment we have only one relation, R. B is a structural relatedness relation
if and only if iBj implies that whenever there exists a k such that kRi, there
exists an m such that mRj and kBm. (This process is mirrored in the computational
version below.) Look at i and j. In order for i to be structurally
related to j the following must be true: for each k that chooses i there must
also be an m, to whom k is structurally related, that choosesj.
B has the desired property of weakening the original definition of SE. Its
main feature is that it does so in an intuitively
appealing way. Also, this
definition
can be generalized naturally to outdegree,
and to multiple relations as well. The apparent circularity of the de~nition may be avoided by
some tricky but equivalent rewording.
It also turns out that Boyd’s SR is not necessarily symmetric or transitive.
In Figs. 3(a) and (b) are simple examplesof how this might occur. In Fig. 3(a),
point 2 has a tie corresponding
to every tie that 3 has, so 3 is structurally
related to 2, but the converse does not hold. Likewise, the wavy lines in
Fig. 3(b) do not imply the existence of the broken lines, which would make
B transitive.
4Actually,
a whole array of possible definitions
has appeared,
some transitive,
some for both
in$egree and outdegree,
etc. In this paper, mainly one fairly general version is considered.
The iabel “B” sometimes refers to a relation, sometimes
to the algorithm
for deriving an SR relation, sometimes to a matrix of real numbers. It should be clear from the context.
Structural equivalence
79
Examples ofasymmetry and in transitivity.
Figure 3.
A
R
R
2.
I
R
__-- _-----
___
Rcl+:‘l
.3
B
I
‘.
t4
(a)
--__
t?
------?’B
--_ ---____---
_c--
(b)
An important
point missed by BBA is that at every iteration of any
iterative solution, the most important data, those most relevant to the task
of finding blocks in R, are still stored in the relation R. CONCOR throws
away R completely
after the first iteration, yet the information
that any
algorithm for finding blocks must utilize is contained in the ties in R. The
B-algorithm
below uses as much information
as possible at all times. Its
best guess at the structural relatedness of i and j is saved after each iteration
and used along with R to produce the next estimate.
Computing
B
There are many simple ways to estimate a sort of first-order SE. The
matching coefficient,
for example, and other measures described by Jardine
and Sibson (197 l), Sokal and Sneath (1963), and Hartigan (1975) could
possibly be interpreted
as measures of the degree to which two patterns of
connectivity
are the same. Pearson’s product
moment
correlation,
for
example, counts the number of identical connections
made by two points,
and then normalizes this count based on the number of choices made. It has
some disadvantages when applied to other than dichotomous
data, though
(Sailer 1972). None of these seems appropriate
for iteration, since it is not
clear what repeated application means. How should we compute a measure
like B? Here is a scheme which is intimately related to the definition presented above. Again, for indegree of a single relation alone, follow Figure 3
and this formula:
5
B;;’
=
max [min(Bi,,
k=lm=l
5
k=l
Rki
Rki, Rmi)]
80
Lee Douglas Sailer
where Rii is the “value” of the tie between i and j, typically a zero (0) or a
one (1). The min and max have been used so that, if the ties carry a value
from the interval [ 0, 11, instead of the set (0, I}, the definition is consistent
with elementary fuzzy set techniques (Zadeh 1965).
Every point is SR to itself, so we let B ’ = I, the identity matrix. We
assume that t iterations have been completed so that BFj is known for all i
and j. To compute Bii” , we find a k that is connected to i (a k such that
Rki > 0, otherwise min(Rki, Rnli) = 0 and nothing is summed), then find the
tn such that the path from k to m (via BL, ) to j (via R) is maximized, without
counting any m to j path in excess of the k to i path; we add the magnitude
of this path to the numerator.
In the denominator
we wish to find the best
m that could have had a tie to j, to standardize in the fuzzy case where k to
i was not very large, so that a poor m will suffice; the best possible m is k
itself, so we add min(Rki, Bik) = Rki to the denominator;
we repeat this for
every k related to i, and for every i and j.
Figure 4.
Calculate B jbr this relation.
This would be easier to follow in a simple example.
assume that every point is SR to itself, so let
B_!! =
Consider
Figure 4. We
1, ifi=j
0, otherwise
To compute B :2 we see that R 11 is nonzero, but there is no arc into point 2,
R MZ, such that min(B,,,
Rll, RmP) is nonzero. We sum a zero in the
numerator
and one in the denominator,
indicating that 1 is not SR to 2.
For R,, we sum zero in both the denominator
and the numerator. For RJ1,
we find that min(B,,, Rsl, R,,) = 1, so we sum 1 in the numerator and denominator;
this is evidence that 1 is SR to 2. For Rql we repeat step R21.
Thus
B:z =
0+0+1+0
~1+0+1+0
= 0.5
This process is repeated for i, j = 1, 2, 3, 4.
In Table l(a) are the results of the first iteration. Point 1 is SR to itself,
and to a lesser extent to 2 and 3 as well. Point 2 is slightly SR to everything.
Point 3 is completely SR to 1 and 2; and point 4 is SR to 2.
Notice that, for the calculation of Bf2, it was taken as evidence that 1 was
not SR to 2 that there was no point SR to 1 related to 2. Now, at the
second iteration, point 2 is (partially) such a point. B:, will thus be larger.
Structural equivalence
Table 1.
Intermediate
results from computation
81
of R in Figure 4.
(a) First iteration
2
3
4
1.0
0.5
0.5
0.0
0.33
1.0
0.33
0.33
1.0
1.0
1.0
0.0
I0.0
1.0
0.0
1.0
3
4
(b) Second iteration
gqqy&
(c) Third iteration
1
2
The results of the second iteration are in Table l(b). The final solution
(within 10W5) is in Table l(c).
Let me point out that in this example the matrix for each iteration is
monotonically
related to every other. If this were always true, there would
be no advantage to the iterative technique. Happily, this is not always the
case. Sometimes, when B:j is relatively very small, Bii’ ’ becomes very large,
especially for small t. For example, we might not have guessed from Table
l(a) that point 1 was totally SR to points 2 and 3, as it turned out to be by
Table l(c).
Once you understand
this mess, the rest is easy. For outdegree, reverse
all occurrences of ki and mj to ik and jm. For both indegree and outdegree,
compute the numerators
(ni and n,) and the denominators
(di and d,) as
above and then compute B:
B = (ni + n,)/(di
+ d,)
In the case of multiple relations we may compute all the ~2s and ds (one
pair for each relation), and then sum them in the obvious way.
Of course, there is a variety of options here. There are many ways to
combine the ~1sand ds into joint indexes of B and different ways of assessing
the strength of a path through two different relations. My feeling is that the
US and u’s are the building blocks. I see little difference between, say,
B = (ni + II,)/(Lfi + d,)
and
B = 0.5 f(ni/di) + (~,/d,)l
It would even be reasonable
to analyze indegree, outdegree,
and separate
relations separately,
and compare the answers individually. Current efforts
are directed this way.
There are two properties
of B to be discussed. One is the convergence
property of this B-algorithm. In a personal communication,
Steven Seidman
has provided a very simple proof of convergence.
Since the entries in the Bmatrix may never be larger than one (1 .O), and it can be shown that in each
iteration they may stay the same or get larger, but’never smaller, it is clear
that the process must eventually
stop. This is not a totally
satisfactory
solution, of course, since it is possible that the algorithm eventually converges to a matrix of all ones (especially in the sense that every human is
structurally
equivalent to every other). However, it seems likely that those
pairs of points which are really SR converge very early in the sequence, and
so it is possible to stop at some appropriate point chosen by ud i7oc criteria.6
There is also a demo~lstration
that interesting stable results can exist. For
a relation on ~1points, a stable solution for B (one that is guaranteed to repficate itself on subsequent iterations) depends directly on the ties in the relations V~Uthe solution to a set of U* simultaneous
linear equations in 11’ unknowns. For large II, operations research techniques find solutions to such
systems by iterative methods involving only II X n matrices much like the
algorithm for computing B.
The other is that CONCOR and B are each affected by the scale on which
the relations
are measured. There are various ways to handle this. For
example, it is possible to incorporate into any algorithm of this sort the technique of converting all values to ranks before computing. This is essentially
the technique
used to derive some nonparametric
statistical techniques.
Examples
and applications
The most important
question left to answer is “Does R work?“. 1 don’t
really know yet. There are reassuring results and others not so reassuring.
What I have are several examples, and a speculation.
As their first example, Lorrain and White analyze the two simple relations
in Figure S(a). I find it easy to think of P as “employs”
and P-’ as “is
61n the applications
below,
n,ax . If&+1
d. I < 0.1
[I
IJ
II
the criterion
is
Structural equivalence
Figure 5.
83
Toy example from L W: (a) the relations, (b) L W’s model, (c) B model.
-P
-p-1
1
1
A
5
A
5
A
3
2
A
3
4
4
(a)
0
%
P
295
legend
i--+j
=
iRj
i---aj =
iSj
o=
blocked together
03,4
employed by”. What kind of people are there in this system, and how are
they related? After considerable
calculation
and cogitation,
Lorrain and
White arrive at the result in Figure 5(b); there is a simple hierarchy.
Computation
of B for these two relations produces the matrix in Table 2
that is represented
in pictorial form by Figure 5(c). The wavy arrow indicates that 5 is structurally
related to 2, ie., 2 is substitutable
for 5. Point 5
is definitely
not substitutable
for 2 because 5 has no employees! Actually,
Bzs is substantially
greater than zero, so that with a lower cutoff exactly the
same result as Figure 5(b) may be obtained.7
Table 2.
B matrix for P and P-’ from Figure S(a),
5
1
2
3
4
7Since B produces
a matrix of real numbers, it is necessary to convert it somehow to a graphically
pleasing structure.
Typically,
a cutoff value is arbitrarily
chosen to transform
B into a relation which
can then be displayed as a hierarchical
tree, pre-order, or whatever.
In this example, B retained some additional info~~lation about P and P-’
that the categorical approach lost. This is remarkable because one would
assume that the formal algebraic approach is more “accurate” than the fuzzy
computational
approach.
Yet, R appears to be more informative
(in this
example), and in a way consistent with LW results. When less information
in
B is retained it produces the same result.
Lorrain and White mention (p. 64) that nodes in a cycle (such as a mutual
exchange system with more than two participants)
are isomorphic, but that
this is different from SE. They are correct that their concept of SE is not the
same as isomorphism,
but it should be. All the nodes in a simple cycle are
defil~itely playing the same role. They are sLlbstitutable back in the sense
that a role is played by pairs (or more) of actors. To substitute
1 for 2 in
F’igurc 6, for example, impbcs that after the natural permutation
has been
made an SE state has been reached.
Figure 6.
C~gcles,isonlor~~histrls,mod&, anlbi,uity.
data
/'-2\
Figure 6 illustrates one of the possible problems with B. Notice that there
are three images of the data that satisfy the definition of R, and that two of
them are in a real sense orthogonal.
The B-algorithm
finds a solution like
the one called A, which becomes solution C as cutoff thresholds are lowered.
What happens to solution B? The difference between A and R can be interpreted as separate dimensions of social space. In LW, greater direct control
is available to the investigator,
so that such problems do not occur. (This
may also mean that the investigator may be able to avoid undesirable results.)
Some analogous technique
for the conlputational
approach in this paper
wouId be useful.
The Sampson
monastery
data
Sampson (1969) has provided Lorrain and White, Breiger, Boorman, and
Arabie, D. White and Boyd, Heil and White, and this author with a body of
well-collected,
substantively
motivated
network
data
from
a failing
monastery.
BBA, HW, and LW each analyze the eight relations and replicate
some of the results obtained by Sampson by more traditional
methods.
D. White and Boyd (1978) extend the analysis using entailments instead of
the more strict equality, with equally promising results.
Here is an analysis of the small version of Sampson’s data found in LW to
illustrate the technique further, and more importantly,
to show that it in
fact makes the diagram in Figure 1 commute.
A generally positive relation, P, and a generally negative one, N, are extracted from Sampson’s data from near the end of his stay in the monastery.
A complete investigation
of P and N (and the LW analysis) is provided by
Boyd and Sailer ( 1978). Of relevance here are the comparable results using 8
in place of the categorical approach. The argument is not that these two
approaches
are different,
but that they are two sides of the SAME coin.
Their results must be the same, if the algebraic approach is to be internally
consistent.
LW follow this procedure:
Starting with two relations, P and N, they
generate all the possible distinct compositions
(e.g., PP, NN, NP, NN, PPP,
PPN, etc.). They find that there are “ . ..probably
at least several hundred”
compounds
(p_ 72), and so keep only those generated by strings of length
less than five or six Ps and Ns, leaving finally only seventy compound relations.
At this point they begin identifying
relations (which they call “morphism?)
on the basis of the relative size of their intersections,
e.g., P and
PPP share eight of the twelve possible ties and are thus defined to be equal.
Likewise, NN is identified with NNN. Because of these identifications,
the
seventy compound relations are reduced to only five distinct ones.
Mainly for expository
purposes, these five are reduced in the simplest
possible nontrivial way to two separate models of two relations each. That
is, when two relations are identified, several other pairs of relations must
also be combined in order to retain a consistent semigroup or category structure. In all cases but two, this reduced semigroup turns out to be the trivial
one with only one element. For the two reductions for which the semigroup
has two relations, there is no particular rationalization
aside from the fact
that they are easy to analyze.
Because of the identification
of many pairs of relations (and the subsequent creation of new relations in the reduced version which represent the
original relations), there is a partition induced on the original seven elements.
Within each block of this partition the individuals cannot be distinguished on
the basis of their ties in the two relations of the reduced set.
86
Lee Douglas Sailer
Figure 7.
L WModel 1.
P
N
Caley table
note:
Figure 8.
N connects to everything
L WModel 2.
P
N
N:--+
These two models, their partitions and relations, are shown in Figures 7
and 8-a
As discussed above, the categorical approach, despite its elegance, is combinatorially,
and hence computationally,
intractable.
Even for this small
example LW must severely limit the growth of the “category”
to strings of
length five or six9 and, more importantly,
their sociometric
strategies are
ad hoc at best. In the blockmodeling
work of White et al. (1976), and Boorman and White (1976), we see further tacit admission that this approach is
too cumbersome.
There they turn to CONCOR for relief. We now consider
the result of using B here in an analogous way.
*Notice that the labels “I”’ and “N” are definitely
not the original P and N. Rather, the set of all
relations
which has been identified
with P has induced the relation labeled “E” in Figures 7 and 8,
and then “P” was chosen as a reasonable label. (Likewise for “N”.)
9As LW points out (p. 90), it really does not make any difference
to the final result in this case,
though it could do in general.
Table 3.
B-matrix, computed for P and N
1
2
3
4
1
2
3
4
5
6
7
1
0.96
0.92
0.74
1
0.55
0.90
0.93
1
0.87
0.91
0.85
0.86
0.87
0.88
0.66
0.91
0.50
0.86
0.91
0.84
0.95
0.71
0.93
1
0.95
0.74
0.85
5
0.89
0.60
0.91
0.88
1
0.62
0.90
6
0.92
0.93
0.91
0.92
0.94
1
0.96
7
0.94
0.81
0.90
0.88
0.94
0.83
1
The first step is to compute B for the two relations P and N. This result
is shown in Table 3. Here are three points:
(1) As explained above, the B-matrix in Table 1 could be considered THE
model of the social roles in the system generated by P and N. This matrix
alone contains explicitly the various substitutabilities
of pairs of actors and
hence their positions in the relation data. B contains no info~ation
not
contained in P and N, and, if humans were only matrix perceivers, we would
need to go no further in our analysis.
(2) B is non-transitive and non-symmetric
(in Zadeh’s (1965) fuzzy sense).
This is not surprising since we know that the ties are somewhat amorphous.
When the system is more strictly structured, say in a formal organization, we
would expect B to be closer to a (fuzzy) equivalence relation.
(3) The values in B are all close to one (1.0). This is caused, again, by the
relative homogeneity
of the ties in this particular case.‘O
Since we are not able to decode the type of data structure in Table 3, we
reduce it to a less complicated
one. Two points are equivalent to the extent
that they are mutually related, which we can compute from B by replacing
each element by the smaller of itself and its reciprocal pair. Likewise, we can
then dichotomize
on some appropriate cutoff value (e.g., 0.95) and perform
a transitive closure. Call this relation“p”.
The blocks in the partition associated with the resulting equivalence relation are (1,3, 5, 7), (4), and (2,6).
This is similar to the second partition found by LW, the blocks of items
3 and 4 having been switched. Suppose we look at the relations P/o and N/o
in Figure 9.
Even though the partitions are slightly different, the semigroups are the
same! In the same way that P and N interact in the LW semigroup, so do
they here. To the extent that semigroups provide a reasonable model of
social structure in the monastery, B captures the same structure as the “categorical” approach, as expected.”
%‘here is co ncern over this tendency of the values in B to become so large. Various ways to control
this are being considered. Notice, though, that it doesn’t really matter! Don’t be misled by the
absolute size of the values; their relative size is important.
“Boorman and White (1976) provide a detailed explanation of why semigroups might provide
interesting models of social structure.
88
Lee Douglas
Figure 9.
Sailer
Quotient relations induced hv 0.
P:,
N: ---*
But look at the relative computational
complexity
associated with these
two approaches.
The LW approach is at least an order of magnitude more
expensive, with the cost-ratio increasing rapidly with the size of the problem.
This is why H. White and his colleagues must repeatedly resort to the use of
CONCOR.
Incorporating
attribute
irzfornzatiorl
All actors are substitutable
for themselves. This is axiomatic. Recall that
in the algorithm for computing B, B’ is initiallized by the identity matrix.
What other initial structures make sense?
This relational approach is academic in the sense that it arbitrarily rejects
any information
provided by individual attributes. The fact that two people
are rich might be as important
in establishing them in a structurally
equivalent position as the fact that they both have employees. It is possible to
convert attributes
to relations and to use these derived relations as you
would ordinary ones, but it is not necessary to do so. Suppose we have an
initial guess at the structural relatedness of two individuals based on the
similarity of their individual attributes. For example, for each pair we “correlate” their age, mode of dress, income, race, sex, etc. If we use these data
gains.
as our initial B1, instead of I, we might accomplish two noteworthy
First, the algorithm might converge more rapidly, having been given a head
start, so to speak. Second, by starting from a B based on otherwise unused
data, we might be able to control the kind of ambiguous solutions discussed
in the section on the Toy example in Figure 6.
This section raises another comforting thought, as well. To the extent that
the individual similarity is a useful analytical principle, it seems that we have
been using the concept of structural equivalence all along, and in a form that
is quite naturally integrated into the present one.
Structural equivalence
89
Conclusions
I think of the matrix of real numbers between zero and one produced by
B as the real model of structural relatedness. It captures a great deal of information about the role structure reflected in the relations. It is not reduced
enough to make the information
it contains available to the human eye,
however. Hierarchical
clusters, multidimensional
scales, trees, graphs, semilattices, and other structures might each be appropriate to represent the role
structure of a particular domain. But it is desirable that the data structure
used for reduction be kept separate from the concept of structural equivalence itself. That is a major part of what I have tried to do here.
It is hoped that B, or more likely some computationally
economic approximation of it, will provide a method of preprocessing a large variety of network
data sets before applying nearly any other analyses. The concept that B and
its variants attempt to capture pervades the quantitative
analysis of social
structure. The B-matrix provides an attractive starting point for more elaborate analysis (see Boyd and Sailer 1978).
Finally, it is clear that any study of actually occurring social networks is
bound to require the use of computers. We must therefore consciously direct
our theory building apparatus toward theories that are in some sense computable. Such an approach is assumed here. The very existence of structural
equivalence as a concept depends on the notion of processing large amounts
of data in a routine way.
References
Boorman,
1976
S. A. and H. C. White
“Social
structure
from multiple
networks,
II. Role structures”.
American Journal of
Sociology 81:1385 - 1446.
Boyd, J. P.
“The algebra of group kinship”. Journal of Mathematical Psychology 6:139 - 167.
1969
Boyd, J. P. and L. Sailer
1978
A Pre-order Model for Role Structures: Finding Topological Clusters, Xerox. Irvine:
University of California.
Boyd, J. P., J. H. Haehl and L. D. Sailer
Journal of Mathematical Sociology 2137 - 61.
1971
“Kinship systems and inverse semigroups”.
Breiger, R. L., S. A. Boorman and P. Arabie
1975
“An algorithm
for clustering
relational data with applications
to social network analysis
and comparison
with multidimensional
scaling”. Journal of Mathematical Psychology 12:
328 - 383.
Goodenough,
W. H.
1969
“Rethinking
‘status’ and ‘role’: Toward a general model of the cultural organization
of
social relationships.”
In S. A. Tyler (ed.), Cognitive Anthropology :3 11 - 330. New York:
Holt, Rinehart and Winston.
Hartigan, J. A.
1975
Clustering Techniques. New York: Wiley.
Heil, G. H. and H. C. White
1976
“An algorithm
for finding simultaneous
homomorphic
correspondences
between graphs
and their image
Behavioral Science 21:26 - 35.
_ graphs”.
- _
Jardine, N. and R. Sibson
1971
Mathematical,Taxonomy.
New York: Wiley.
Linton, R.
The Study ofMon. New York: Appleton-Century-Crofts
Inc.
1936
Lorrain, I?. and H. C. White
“Structural
equivalence
of individual in social networks”.
Journal of Mathematical Soci171
ology I :49 - 80.
Merton,
R. K.
1957
Social 7?tcory and Social Structure. Revised and enlarged ed. Glencoe, Illinois: Free Press.
Nadel, S. F.
The Theory of Social Structure. Melbourne:
University Press.
1957
Radcliffe-Brown,
A. R.
1940
Sfructure and Functiun in Primitive Society. Neu: York: Free Press.
Sailer, L. D.
1972
IBYU Users Manual: For the Analysis of Affinity Data, Ancilloe Scientiarum I. Irvine:
School of Social Sciences, University of California.
1978
Algebraic Role Theoi-y: A Generalization of Structural Lquivolence as a Tool for the Network Analysis of Social Structure. Irvine: School of Social Sciences, University
of
California.
Sampson, S. F.
Crisis in a Cloister. No. 69-5775. Ann Arbor, Michigan: University Microfilms.
1969
Schwartz, .I. E.
“An examination
of CONCOR and related methods
for blocking
sociometric
data”. In
1976
D. R. Heise (ed.), Sociologica methodology 1977; 255 - 282, San Francisco:
Jossey-Bass.
Sokal. R. R. and P. H. A. Sneath
1963
Principles of ~~rn~rical Taxonomy. London: Freeman.
White, D. R. and J. P. Boyd
1977
Mediativify as a Principle for Studying the Core of Social Networks. Xerox.Irvine: School
of Social Sciences, University of California.
White, H. C.
The Anatomy of Kinship. Englewood Cliffs, New Jersey: Prentice-Hall.
1963
White, H. C., S. A. Boorman and R. L. Breiger
“Social structure from multiple networks, I. Blockmodels
of roles and positions”.
American
I976
Journalof Sociology RI :730 - 780.
Zadch, L.
1965
“Fuzzy
Sets”. Information ond Cuntrof 8:338 - 353.