SlideShare ist ein Scribd-Unternehmen logo
1 von 27
Statistical mechanics of SCOTUS

             Edward D. Lee1,2
           Chase P. Broedersz1,2
             William Bialek1,2

                  1Department     of Physics, Princeton U.
      2Lewis-Sigler Institute for Integrative Genomics, Princeton U.
From complex to simple
Group behavior of social organisms can
manifest complex patterns at the group
level even though they might reduce
down to simple rules for individuals
[1โ€”3]. For example, starlings seem to
interact only with local neighbors, and
these local interactions produce global
patterns like on the right. We call this
emergent collective phenomena
because such behavior is not explicitly
encoded in the behavior of individuals,
but arises from interactions.
Is it the case that decisions by groups
of people, despite apparent higher         Flocks of starlings form complex patterns
order behaviors, are likewise reducible    (http://webodysseum.com/videos/spectacu
to simpler rules?                          lar-starling-flocks-video-murmuration/).
US Supreme Court (SCOTUS)
We investigate the voting behavior of the SCOTUS. We explore the structure
of decisions to gain insight into how these decisions are reached. We show
results for the second Rehnquist Court (1994โ€”2005, ๐‘ = 895 cases) and
discuss other โ€œnatural courtsโ€โ€”periods of time when member remain
constantโ€”where relevant.
SCOTUS facts:
โ€ข highest court in the US government         โ€ข write a majority and a minority
โ€ข nine Justices appointed for life             opinion, legally clarifying their
โ€ข vote on constitutionality of legislative     decisions, which can be
  and executive actions                        supplemented with separate opinions
โ€ข usually hears appeals from lower           โ€ข Justices must ultimately render a
  courts' decisions, which Justices            binary decision
  affirm or reverse                          โ€ข the second Rehnquist Court is
โ€ข sometimes Justices are recused for           typically considered as 4 liberals and
  conflict of interest, sickness, etc.         5 conservatives
โ€ข decisions by majority vote
Previous approaches to SCOTUS voting
Although there is a long history of scholarly study of SCOTUS, nearly all approaches rely on
important assumptionsโ€”even though they may be justified. Some assume Justices vote
independently given ideological preferences [5]. Others, if including interactions, do not
include them in a general model of voting or posit their structure instead of deriving it from
data [6]. Most importantly, many draw on complex, underlying cognitive frameworks like
rationality or expression of internal beliefs, which are validated by predictions made of the
data [5โ€”8]. However, it is impossible to validate all aspects of these complex models. Is
there a way to construct accurate models while abstaining from introducing complexities?

Characteristics of
previous approaches
                            โ€ข Decision-making
                              framework [7โ€”8]
        โ€ข Independent       โ€ข Ideological liberal vs.    โ€ข Externally posited
          voters given        conservative axis [5,7โ€”      interaction [6]
          ideological         8]
          preferences [5]   โ€ข Subset of votes deemed                            How do we approach
                              relevant [5]                                      the system with
                            โ€ข Not generally predictive                          minimal
                       Attitudinal           Game theoretical                   assumptions?
Back to the data
The data, published by [4],
immediately reveals strong
structure in Court behavior.
For example, we see that in
the second Rehnquist Court,
44% of votes were unanimous.
Overall, when considering the
natural courts shown on the
right, 36% of votes are
unanimous on average. Only
10% of votes fall along the
liberal vs. conservative divide.
Does an independent model
support this observation?          Distribution of voting data for natural courts starting in
                                   given year. Blue, 0 dissenting votes; red, 1; yellow, 2;
                                   green, 3; black, 4. Terms are the number of years that
                                   the same members remained on the Court. The number
                                   of votes on record for each set of years is in gray.
The simplest model
                        Independent voters
Each Justice ๐‘– has probability ๐‘ ๐‘– of voting to affirm, so the probability of ๐‘˜ votes in
the majority out of ๐‘› Justices is
                           ๐‘› ๐‘˜            ๐‘›โˆ’๐‘˜
                                                     ๐‘›
                 ๐‘ƒ ๐‘˜ =        ๐‘ 1โˆ’ ๐‘          +            ๐‘ ๐‘›โˆ’๐‘˜ 1 โˆ’ ๐‘ ๐‘˜
                           ๐‘˜                       ๐‘›โˆ’ ๐‘˜
An independent model fails to explain the distribution of votes in the majority.
This is not so surprising because weโ€™ve
assumed that all higher order behaviors are
encoded within the first moments. For
example, for a unanimous vote (๐‘˜ = 9) to
occur in the independent model, all Justices
would happen to vote the same way, but this
happens much more rarely than observed,
yielding 0.5% of the observed value. Indeed,
interactions are crucial to SCOTUS voting
behavior.
More evidence for interaction
Justices take at least two votes for a case: an initial secret vote and a final
decision. Justices may attempt to persuade each other in between, but it is
difficult to measure such interactions partly because the first vote is secret.

According to data available on the Waite Court (1874โ€”1887), 9% of final
votes had at least one dissenting vote while 40% had at least one in the initial
vote [5]. In general, Justices more often switch to the majority than the
reverse, suggestive of consensus-promoting interaction. Maltzmann et al.
show from memos that Justices strategically manipulate their communication
to attempt to influence the vote and written opinions of the Court [6].

Nonetheless, most models either treat the Justices as independent or do not
explicitly include interactions in a predictive framework. Indeed, it is difficult
to devise the right structure for interactions!

How do we account for interactions in a principled fashion?
Including interactionsโ€ฆ
Given that Justice ๐‘– can vote in two ways, we represent his or her vote as
 ๐œŽ ๐‘– โˆˆ โˆ’1,1 . Then, the independent model is the simplest model that fits
average voting records { ๐œŽ ๐‘– } and all higher order correlations,
{ ๐œŽ ๐‘– ๐œŽ๐‘— , ๐œŽ ๐‘– ๐œŽ๐‘— ๐œŽ ๐‘˜ โ€ฆ ๐œŽ ๐‘– ๐œŽ๐‘— โ€ฆ ๐œŽ ๐‘› }, are reducible to { ๐œŽ ๐‘– }. In fact, all higher
order statistics are as random as possible given the individual means, so there
is no reason any higher order correlations should match that of the data. We
can generalize this idea to a ๐‘šth order model that fits all correlations up to
order ๐‘š yet generates all > ๐‘š order correlations randomly. Since these
distributions are as random as possible given what is fit, that also means that
we make no further assumptions than what is given in the fitted correlations
or about how these distributions are generated.

With SCOTUS, we might expect that we need to account for the bloc behavior
(5 vs. 4) and unanimous behavior by including terms of the 4th, 5th and 9th
orders explicitly. However, let us take only the next step of fitting both ๐œŽ ๐‘–
and ๐œŽ ๐‘– ๐œŽ๐‘— .
โ€ฆas maximizing entropy
The formalization for generating these distributions is called the principle of maximum
entropy [9]. Entropy is a measure of the randomness of a distribution. The entropy of
a probability distribution ๐‘ƒ(๐œŽ) of the votes of a set of ๐‘› voters ๐œŽ = {๐œŽ1 , โ€ฆ , ๐œŽ ๐‘› } is
                              ๐‘† ๐‘ƒ ๐œŽ    =โˆ’            ๐‘ƒ ๐œŽ log ๐‘ƒ(๐œŽ)
                                                ๐œŽ
which we maximize while constraining ๐œŽ ๐‘– and ๐œŽ ๐‘– ๐œŽ๐‘—
                                                ๐‘›                     ๐‘›
                                                                1
                    ๐‘† ๐‘ƒ ๐œŽ , โ„Ž ๐‘– , ๐ฝ ๐‘–๐‘— = ๐‘† โˆ’         โ„Ž ๐‘– ๐œŽ๐‘– โˆ’               ๐ฝ ๐‘–๐‘— ๐œŽ ๐‘– ๐œŽ๐‘—
                                                                2
                                               ๐‘–=1                  ๐‘–,๐‘—=1
with Lagrange multipliers โ„Ž ๐‘– , ๐ฝ ๐‘–๐‘— . The resulting model is known as the Ising model
                                                  1
                                         ๐‘ƒ ๐œŽ = ๐‘’ โˆ’๐ป(๐œŽ)
                                                   ๐‘
                                            ๐‘›          1   ๐‘›
                        ๐ป ๐œŽ =โˆ’                โ„Ž ๐‘– ๐œŽ๐‘– โˆ’          ๐ฝ ๐‘–๐‘— ๐œŽ ๐‘– ๐œŽ๐‘—
                                           ๐‘–=1         2  ๐‘–,๐‘—=1


with a normalizing constant, the partition function ๐‘, and Hamiltonian ๐ป(๐œŽ).
Ising model
                                 1 โˆ’๐ป(๐œŽ)
                        ๐‘ƒ ๐œŽ = ๐‘’
                                  ๐‘
                           ๐‘›          1          ๐‘›
                 ๐ป ๐œŽ =โˆ’      โ„Ž ๐‘– ๐œŽ๐‘– โˆ’                    ๐ฝ ๐‘–๐‘— ๐œŽ ๐‘– ๐œŽ๐‘—
                          ๐‘–=1         2          ๐‘–,๐‘—=1


The โ„Ž ๐‘– loosely refer to the โ€œmean biasโ€ of each voter ๐œŽ ๐‘– , and the ๐ฝ ๐‘–๐‘—
loosely refer to the interaction between them, or โ€œcouplings.โ€ Since
votes, or โ€œstatesโ€ ๐œŽ, with a smaller ๐ป are more probable, โ„Ž ๐‘– > 0
implies that ๐œŽ ๐‘– is more likely to take value 1. Also, ๐ฝ ๐‘–๐‘— > 0 implies that
 ๐œŽ ๐‘– and ๐œŽ๐‘— are more likely to take the same value.

We can solve for the parameters โ„Ž ๐‘– and ๐ฝ ๐‘–๐‘— such that our model fits the
given moments โŸจ๐œŽ ๐‘– โŸฉ and โŸจ๐œŽ ๐‘– ๐œŽ๐‘— โŸฉ.
Mapping spins
We have yet to define how the values of ๐œŽ ๐‘– correspond to
actual vote. It is not as simple as calling one value affirm and
the other reverse: the outcome of affirming or reversing
depends on how the case is posed. It is entirely possible that
affirming one case is a liberal decision and conservative in
another. What is the right dimension along which to orient the
 ๐œŽ ๐‘– ? We abstain from making a choice, and introducing
external bias, by symmetrizing the up and down votes such
that โˆ’1 and 1 are equivalent.
This keeps ๐œŽ ๐‘– ๐œŽ๐‘— the same and fixes ๐œŽ ๐‘– = 0.
Correspondingly, โ„Ž ๐‘– = 0. We find that absence of a bias is a
reasonable assumption because bias is not the dominant term
for judicial voting behavior.
Model fit
Remarkably, the Ising model fits the
data well. One measure of the fit is to
consider the difference in entropy of the
  ๐‘šth order model with the data
 ๐ผ ๐‘š = ๐‘† ๐‘š โˆ’ ๐‘†data [2]. As we increase ๐‘š,
we capture more correlation and the
entropy of our models monotonically
decreases to that of the data, where
 ๐‘† ๐‘› = ๐‘†data . The furthest distance ๐ผ1 is
called the multi-information. Our model
captures 90% of the multi-information
(right).

Thus, it nearly captures all the structure
in the data. It also follows โˆ’ log ๐‘ƒ ๐œŽ โˆ     The model
 ๐ป(๐œŽ) for the most frequent states. The      captures 90%
least fit states only appear one or twice    of the multi-
on average in a bootstrap sample of the      information.
data.
Implications of Ising model fit
The fit by the Ising model shows that higher order behaviors
like ideological blocs and unanimity can emerge from lower
order behaviors at the level of pairwise interactions between
individuals. Including higher order terms will result in a
marginal improvement in the fit.

This result is surprising because it suggests that higher level
coordination is not the dominant explanation of voting
behavior. Previously, scholars have pointed to the high level of
consensus in the Court to as evidence for a โ€œnorm of
consensus,โ€ which seems analogous to an effective ninth
order term for behavior [6]. Our results point to a different
sort of decision-making structure.
Found coupling network




 ๐ถ ๐‘–๐‘— = ๐œŽ ๐‘– ๐œŽ๐‘— โˆ’ ๐œŽ ๐‘– ๐œŽ๐‘— and ๐ฝ ๐‘–๐‘— graphs. Justices with a liberal voting record are
colored blue whereas those with a conservative are colored red. Positive edges
are red and negative blue. Widths are proportional to magnitude. All ๐ถ ๐‘–๐‘— are
positive whereas some ๐ฝ ๐‘–๐‘— are negative. Justices are initialed: John Stevens (JS),
Ruth Ginsburg (RG), David Souter (DS), Steven Breyer (SB), Sandra Oโ€™Connor
(SO), Anthony Kennedy (AK), William Rehnquist (WR), Antonin Scalia (AS),
Clarence Thomas (CT).
Understanding couplings
As a simple check, we see that the average ๐ฝ ๐‘–๐‘— within
ideological blocs (blue to blue or red to red) are positive
while the average between (blue to red) is negative
(previous slide). The corresponding averages of ๐ถ ๐‘–๐‘— also
show this relative change although all ๐ถ ๐‘–๐‘— are positive,
obscuring the antagonistic tendency.

To better understand the distribution of ๐ฝ ๐‘–๐‘— , we consider
the effective field on ๐œŽ ๐‘– from its neighbors.
                                  ๐‘›
                              1
                     โ„Žeff =
                       ๐‘–             ๐ฝ ๐‘–๐‘— ๐œŽ๐‘—
                              2
                              ๐‘—=1
Note that it depends on the state of neighbors ๐œŽ๐‘— . Since
this distribution over all ๐œŽ is symmetric around 0, we
                                                              Distributions ๐‘ƒ โ„Žeff ๐œŽ . Red histogram is
only show the positive half (right).
                                                                                  ๐‘–

We fix ๐œŽ ๐‘– = 1 and compare the shifts in the distributions    distribution of fields from only conservative
                                                              Justices. Ordered from most liberal to most
of โ„Žeff of its neighbors ๐œŽ , which we measure by taking
    ๐‘—                     ๐‘—                                   conservative voting record from left to right,
the mean over standard deviation ๐œ‡/ฮฃ ๐‘—๐‘– of ๐‘ƒ โ„Žeff . In
                                                  ๐‘–           top to bottom. The more conservatively a
the absence of such fixing, ๐œ‡/ฮฃ ๐‘—๐‘– = 0 (next slide).          Justice votes, the more the mean field due to
                                                              conservatives marches to the right.
Shifts in โ„Ž eff
                                                             ๐‘–
                                      ๐œ‡
                                          = 0.8                              Average shifts in
                                     ฮฃ ๐‘—๐‘–                                    distributions of
 ๐œ‡                                                                           โ„Žeff over
     = 4.7                                                                    ๐‘–
ฮฃ ๐‘—๐‘–                    Liberals             Conservatives        ๐œ‡         ideological blocs
                                                                      = 4.3 when holding one
                                                                 ฮฃ ๐‘—๐‘–       member of a bloc,
                                                                             ๐‘–, at 1 at a time.
     Average shift in liberals (๐‘—)    ๐œ‡
                 when holding             = 0.8
         conservatives fixed (๐‘–)     ฮฃ ๐‘—๐‘–
As expected, ideological neighbors are much more affected by fixing ๐œŽ ๐‘– = 1
by a factor of 5-6, showing that ideological blocs are a natural division of the
Court. Overall, the Court always shifts in the same direction as the
perturbation. Thus, we find that higher order behavior as ideological blocs
and general unanimity are reflected in the couplings.

O'Connor and Kennedy shift conservatives (liberals) to ๐œ‡/ฮฃ ๐‘—๐‘– = 2.6 (1.4) and
 ๐œ‡/ฮฃ ๐‘—๐‘– = 3.1 (1.1), reaffirming their moderate credentials. Stevens, however,
has overall weakest connections with ๐œ‡/ฮฃ ๐‘—๐‘– = 0.36 (2.72) as if more isolated.
Caveats with couplings
We must be careful not to interpret the ๐ฝ ๐‘–๐‘— literally as corresponding to
behavioral interaction on the Court. The distinction that we cannot
make, which is indeed impossible with this data set, is to explain the
underlying mechanism for correlations. We may find two Justices that
vote together too much for chance, but it could be the case that either
they collaborate to a large extent or that their perspectives have been
shaped by a similar background. The latter involves a hidden third
actor, but it is indistinguishable from the other with only the voting
record. In many ways, possible confounding factors that contribute to
 ๐ฝ ๐‘–๐‘— reflect fundamental limitations of the data.

Our guiding principle is that we refrain from assuming anything beyond
what is already given from the data; other models do not have the
same claim minimal assumptions. Furthermore, we know from
anecdotal evidence that Justices persuade each other, so there are
certainly interactions captured by the ๐ฝ ๐‘–๐‘— .
Probing influence
With this model of voting behavior, we can probe the
behavior of the system under perturbations.

The quantity of interest here is the majority outcome of
the court
                                 ๐‘
                                ๐‘– ๐œŽ๐‘–
                        ๐›พ=
                            | ๐‘–๐‘ ๐œŽ ๐‘– |
because this is the decision rendered.

How sensitive is the average decision ๐œธ to a small
changes in the average behavior of a Justice ๐ˆ ๐’Š ?
Probing influence
Formally, this is the susceptibility of โŸจ๐›พโŸฉ
                       1 ๐œ•โŸจ๐›พโŸฉ 1
                 ๐œ“๐‘– =             =       ๐›พ๐œŽ ๐‘– โˆ’ ๐›พ ๐œŽ ๐‘–
                       ๐œ’ ๐‘– ๐œ•โ„Ž ๐‘–      ๐œ’๐‘–
which we have normalized over
                                      ๐œ•โŸจ๐œŽ ๐‘– โŸฉ
                                ๐œ’๐‘– =
                                        ๐œ•โ„Ž ๐‘–
to compare the Justices equally with respect to changes in
their averages. The values we find are
               SO    AK    WR    DS    SB    AS    RG    CT    JS    mean
        ๐๐’Š      0.834 0.809 0.719 0.650 0.644 0.623 0.616 0.608 0.421 0.658
95% confidence
                  0.001 0.002 0.002 0.002 0.002 0.002 0.002 0.002 0.002
         interval
   ๐ ๐’Š โˆ’ โŸจ๐โŸฉ      0.176 0.151 0.061 -0.008 -0.014 -0.035 -0.042 -0.050 -0.237
Are ideological medians influential?
The typical wisdom in the political science literature is that these
ideological medians are the most influential for Court decisions.
Basically, the argument is that voters who sit in the middle of a
unidimensional, symmetric preference space will be predictive of the
majority [10]. The relevant space is liberal vs. conservative as we have
confirmed with โ„Žeff . The Justices to whom the outcome is most
                   ๐‘–
sensitive here are the ideological medians SO and AK, in agreement
with the claim.

However, real systems may be complicated by interactions that
constrain how such a voter may cast her vote or how a majority forms
initially and persists. We do not find that it is the ideological medians
to whom the outcome of the court is most sensitive in general.
Importantly, our results are derived under minimal assumptions. We
do not assume ideological behaviorโ€”which though generally visible,
still has to be imposed by the observerโ€”and we account for
interactions.
Isolating interactions
We can further inquire about the nature of influence by leveraging our found
interactions. A Justice could affect the outcome in two ways:
1. Change own vote
2. Impact on colleaguesโ€™ votes through interactions

How do we distinguish between this two kinds of impact accounted for by
 ๐ ๐’Š ? We simulate how ๐œŽ ๐‘– might increase pressure on colleagues through
couplings by
1. increasing the average coupling with ๐œŽ ๐‘– such that neighborsโ€™ effective
       fields break symmetry around 0 to โŸจโ„Žeff โŸฉ = ๐ฝ ๐‘–๐‘— ๐œ–. However, this will also
                                             ๐‘—
       incur a shift to ๐œŽ ๐‘– โ‰  0, so
2. we add a compensating field โ„Žโ€ฒ๐‘– to โ„Žeff to fix ๐œŽ ๐‘– = 0.
                                           ๐‘–

The latter step ensures that we do not allow an average shift in an individualโ€™s
vote to affect the outcome. We denote the resulting change from pushing on
 ๐œŽ ๐‘– โ€™s neighbors ๐›ฟ๐›พ ๐‘– .
Isolating interactions
               AK    SO    DS    SB    RG    WR AS       CT    JS    mean
        ๐œน๐œธ ๐’Š    0.348 0.340 0.296 0.276 0.245 0.231 0.195 0.138 0.130 0.244
95% confidence
                0.001 0.001 0.001 0.001 0.001 0.002 0.003 0.003 0.003
interval
    ๐œน๐œธ ๐’Š โˆ’ โŸจ๐œน๐œธโŸฉ 0.104 0.095 0.051 0.032 0.001 -0.014 -0.049 -0.106 -0.114
Comparing with ๐œ“ ๐‘– โ€ฆ
AK and SO switch order and are relatively closer. Interactions may
differentiate between Justices for whom interactions are important. It
is not the case that Justices highest by ๐œ“ ๐‘– are also highest by ๐›ฟ๐›พ ๐‘–
across all natural courts although it is here.

WR falls from 1st to 6th place. WR is the Chief Justice who is responsible
for enforcing procedural rules and has prerogative for assigning
opinions. Interestingly, WR is consistently low by ๐›ฟ๐›พ ๐‘– but rises in rank
by ๐œ“ ๐‘– only being appointed Chief Justice.
Isolating interactions
               AK    SO    DS    SB    RG    WR AS       CT    JS    mean
        ๐œน๐œธ ๐’Š    0.348 0.340 0.296 0.276 0.245 0.231 0.195 0.138 0.130 0.244
95% confidence
                0.001 0.001 0.001 0.001 0.001 0.002 0.003 0.003 0.003
interval
    ๐œน๐œธ ๐’Š โˆ’ โŸจ๐œน๐œธโŸฉ 0.104 0.095 0.051 0.032 0.001 -0.014 -0.049 -0.106 -0.114
Comparing with ๐œ“ ๐‘– โ€ฆ
CT and JS are relatively much closer. CT and JS are the most extreme voters on
the conservative and liberal ends of the spectrum. The outcome is similarly
least sensitive to their couplings even though CT votes with the Court 80% of
the time and JS 72%. Thus, ideological hardliners are identified by a certain
voting pattern distinguishing between agreement with or concurrence with
the majority.

CT and AS are similarly biased ideologically, but AS seems to be more strongly
embedded in the interaction network.

All ๐›ฟ๐›พ ๐‘– > 0, reflecting the general tendency to consensus.
Conclusion
We propose deriving behavior from data instead of testing a hypothesized
framework. With this approach, we show that SCOTUS voting behavior can be
explained as behavior that emerges from pairwise interaction even though
higher order behaviors are manifest. This model shows the higher order
structures like ideological blocs and unanimity quite clearly through the
parameters.

We show how one can exploit the model of voting behavior by considering
the susceptibility of โŸจ๐›พโŸฉ to shifts in average voting behavior. We also isolate
the shifts in โŸจ๐›พโŸฉ specific to interactions and distinguish between Justices
similar by ๐œ“ ๐‘– along that second dimension of ๐›ฟ๐›พ ๐‘– .

However suggestive our results, the correspondence of parameters to real
behavior remains unclear. We hope to soon start a collaboration with political
scientists investigate whether an interpretable correspondence can be
established.
Works cited
1.  W. Bialek, A. Cavagna, et al., PNAS 109, 4786 (2012).
2.  E. Schneidman, M. Berry, et al., Nature 440, 1007 (2006).
3.  I. Couzin, J. Krause, et al., Nature 433, 7025 (2005).
4.  H. J. Spaeth, L. Epstein, et al., Supreme Court Database (2011).
5.  A.D. Martin & K. M. Quinn, Pol. Anal. 10, 134 (2002).
6.  L. Epstein, J. A. Segal, et al., Am. J. of Pol. Sci. 83, 557 (2001).
7.  S. Brenner & R. H. Dorff, J. of Th. Pol. 4, 2 (1992).
8.  F. Maltzmann & J. F. Spriggs II, et al., Crafting law on the Supreme
    Court (2000).
9. E. T. Jaynes, Phy. Rev. 106, 620 (1957).
10. D. Black, J. of Pol. Econ. 56, 23 (1948).
Further bibliography
Acknowledgements
            Funding from
       NSF grant CCF-0939370
Dept. of Physics, Princeton University

              Thanks to
 Sigma Xi for hosting this showcase.

Weitere รคhnliche Inhalte

Andere mochten auch

Jigsawbeings principles
Jigsawbeings principlesJigsawbeings principles
Jigsawbeings principlesGlobal
ย 
Twitter web application
Twitter web applicationTwitter web application
Twitter web applicationMiki Takashi
ย 
Alzheimers: a mini hack
Alzheimers: a mini hackAlzheimers: a mini hack
Alzheimers: a mini hackFrank Kelly
ย 
Media Evaluation
Media EvaluationMedia Evaluation
Media Evaluationrebeccatp45
ย 
Supervisor session
Supervisor sessionSupervisor session
Supervisor sessioniowusu
ย 
C48 d fd01
C48 d fd01C48 d fd01
C48 d fd01Felix Muaya
ย 
Critical
CriticalCritical
Criticaliowusu
ย 
Video color correction and more
Video color correction and moreVideo color correction and more
Video color correction and moreJoe Nasr
ย 
La Galerie D'oc organise le Vernissage de tibili
La Galerie D'oc organise le Vernissage de tibiliLa Galerie D'oc organise le Vernissage de tibili
La Galerie D'oc organise le Vernissage de tibiliFranรงoise Cassan
ย 
Contratiempotreatment
ContratiempotreatmentContratiempotreatment
ContratiempotreatmentJesus Barajas
ย 
Prostate Cancer Treatment - Active Surveillance
Prostate Cancer Treatment - Active SurveillanceProstate Cancer Treatment - Active Surveillance
Prostate Cancer Treatment - Active SurveillanceLindsay Rosenwald
ย 
Johnson & Johnson Announces Investment in Protagonist Therapeutics
Johnson & Johnson Announces Investment in Protagonist TherapeuticsJohnson & Johnson Announces Investment in Protagonist Therapeutics
Johnson & Johnson Announces Investment in Protagonist TherapeuticsLindsay Rosenwald
ย 
Adobe Premiere Pro Main Updates
Adobe Premiere Pro Main UpdatesAdobe Premiere Pro Main Updates
Adobe Premiere Pro Main UpdatesJoe Nasr
ย 
Bagaimanakah prinsip islam hahari
Bagaimanakah prinsip islam hahariBagaimanakah prinsip islam hahari
Bagaimanakah prinsip islam hahariL Adul Lalut Adul
ย 

Andere mochten auch (20)

Powerpoint act1
Powerpoint act1Powerpoint act1
Powerpoint act1
ย 
Jigsawbeings principles
Jigsawbeings principlesJigsawbeings principles
Jigsawbeings principles
ย 
Twitter web application
Twitter web applicationTwitter web application
Twitter web application
ย 
Alzheimers: a mini hack
Alzheimers: a mini hackAlzheimers: a mini hack
Alzheimers: a mini hack
ย 
Media Evaluation
Media EvaluationMedia Evaluation
Media Evaluation
ย 
Smoking
SmokingSmoking
Smoking
ย 
Smoking
SmokingSmoking
Smoking
ย 
L univers presentacio
L univers presentacioL univers presentacio
L univers presentacio
ย 
La lluna
La llunaLa lluna
La lluna
ย 
Supervisor session
Supervisor sessionSupervisor session
Supervisor session
ย 
C48 d fd01
C48 d fd01C48 d fd01
C48 d fd01
ย 
Critical
CriticalCritical
Critical
ย 
Absentee voting ofw
Absentee voting   ofwAbsentee voting   ofw
Absentee voting ofw
ย 
Video color correction and more
Video color correction and moreVideo color correction and more
Video color correction and more
ย 
La Galerie D'oc organise le Vernissage de tibili
La Galerie D'oc organise le Vernissage de tibiliLa Galerie D'oc organise le Vernissage de tibili
La Galerie D'oc organise le Vernissage de tibili
ย 
Contratiempotreatment
ContratiempotreatmentContratiempotreatment
Contratiempotreatment
ย 
Prostate Cancer Treatment - Active Surveillance
Prostate Cancer Treatment - Active SurveillanceProstate Cancer Treatment - Active Surveillance
Prostate Cancer Treatment - Active Surveillance
ย 
Johnson & Johnson Announces Investment in Protagonist Therapeutics
Johnson & Johnson Announces Investment in Protagonist TherapeuticsJohnson & Johnson Announces Investment in Protagonist Therapeutics
Johnson & Johnson Announces Investment in Protagonist Therapeutics
ย 
Adobe Premiere Pro Main Updates
Adobe Premiere Pro Main UpdatesAdobe Premiere Pro Main Updates
Adobe Premiere Pro Main Updates
ย 
Bagaimanakah prinsip islam hahari
Bagaimanakah prinsip islam hahariBagaimanakah prinsip islam hahari
Bagaimanakah prinsip islam hahari
ย 

ร„hnlich wie Statistical Mechanics of SCOTUS

Module-2_Notes-with-Example for data science
Module-2_Notes-with-Example for data scienceModule-2_Notes-with-Example for data science
Module-2_Notes-with-Example for data sciencepujashri1975
ย 
CHI-SQUARE TEST
CHI-SQUARE TESTCHI-SQUARE TEST
CHI-SQUARE TESTNathanBaloloy
ย 
Introduction of โ€œFairness in Learning: Classic and Contextual Banditsโ€
Introduction of โ€œFairness in Learning: Classic and Contextual Banditsโ€Introduction of โ€œFairness in Learning: Classic and Contextual Banditsโ€
Introduction of โ€œFairness in Learning: Classic and Contextual Banditsโ€Kazuto Fukuchi
ย 
Descriptive statistics-Skewness-Kurtosis-Correlation.ppt
Descriptive statistics-Skewness-Kurtosis-Correlation.pptDescriptive statistics-Skewness-Kurtosis-Correlation.ppt
Descriptive statistics-Skewness-Kurtosis-Correlation.pptRama Krishna
ย 
Quantum Models of Cognition and Decision
Quantum Models of Cognition and DecisionQuantum Models of Cognition and Decision
Quantum Models of Cognition and DecisionCatarina Moreira
ย 
Quantitative Methods for Lawyers - Class #14 - Power Laws, Hypothesis Testing...
Quantitative Methods for Lawyers - Class #14 - Power Laws, Hypothesis Testing...Quantitative Methods for Lawyers - Class #14 - Power Laws, Hypothesis Testing...
Quantitative Methods for Lawyers - Class #14 - Power Laws, Hypothesis Testing...Daniel Katz
ย 
03-Data-Analysis-Final.pdf
03-Data-Analysis-Final.pdf03-Data-Analysis-Final.pdf
03-Data-Analysis-Final.pdfSugumarSarDurai
ย 
Seventy years of RCTs
Seventy years of RCTsSeventy years of RCTs
Seventy years of RCTsStephen Senn
ย 
Statistical Analysis with R- III
Statistical Analysis with R- IIIStatistical Analysis with R- III
Statistical Analysis with R- IIIAkhila Prabhakaran
ย 
Statistics Chapter 01[1]
Statistics  Chapter 01[1]Statistics  Chapter 01[1]
Statistics Chapter 01[1]plisasm
ย 
ECONOMETRICS I ASA
ECONOMETRICS I ASAECONOMETRICS I ASA
ECONOMETRICS I ASAAdel Abouhana
ย 
Statistics78 (2)
Statistics78 (2)Statistics78 (2)
Statistics78 (2)Persikla Yousaf
ย 
Lecture 4 - probability distributions (2).pptx
Lecture 4 - probability distributions (2).pptxLecture 4 - probability distributions (2).pptx
Lecture 4 - probability distributions (2).pptxSinimol Aniyankunju
ย 

ร„hnlich wie Statistical Mechanics of SCOTUS (20)

Module-2_Notes-with-Example for data science
Module-2_Notes-with-Example for data scienceModule-2_Notes-with-Example for data science
Module-2_Notes-with-Example for data science
ย 
CHI-SQUARE TEST
CHI-SQUARE TESTCHI-SQUARE TEST
CHI-SQUARE TEST
ย 
Introduction of โ€œFairness in Learning: Classic and Contextual Banditsโ€
Introduction of โ€œFairness in Learning: Classic and Contextual Banditsโ€Introduction of โ€œFairness in Learning: Classic and Contextual Banditsโ€
Introduction of โ€œFairness in Learning: Classic and Contextual Banditsโ€
ย 
Statistics (recap)
Statistics (recap)Statistics (recap)
Statistics (recap)
ย 
Descriptive statistics-Skewness-Kurtosis-Correlation.ppt
Descriptive statistics-Skewness-Kurtosis-Correlation.pptDescriptive statistics-Skewness-Kurtosis-Correlation.ppt
Descriptive statistics-Skewness-Kurtosis-Correlation.ppt
ย 
Quantum Models of Cognition and Decision
Quantum Models of Cognition and DecisionQuantum Models of Cognition and Decision
Quantum Models of Cognition and Decision
ย 
Quantitative Methods for Lawyers - Class #14 - Power Laws, Hypothesis Testing...
Quantitative Methods for Lawyers - Class #14 - Power Laws, Hypothesis Testing...Quantitative Methods for Lawyers - Class #14 - Power Laws, Hypothesis Testing...
Quantitative Methods for Lawyers - Class #14 - Power Laws, Hypothesis Testing...
ย 
03-Data-Analysis-Final.pdf
03-Data-Analysis-Final.pdf03-Data-Analysis-Final.pdf
03-Data-Analysis-Final.pdf
ย 
T test
T test T test
T test
ย 
Seventy years of RCTs
Seventy years of RCTsSeventy years of RCTs
Seventy years of RCTs
ย 
Statistical Analysis with R- III
Statistical Analysis with R- IIIStatistical Analysis with R- III
Statistical Analysis with R- III
ย 
Crosstabs
CrosstabsCrosstabs
Crosstabs
ย 
Statistics Chapter 01[1]
Statistics  Chapter 01[1]Statistics  Chapter 01[1]
Statistics Chapter 01[1]
ย 
Diss Pres
Diss PresDiss Pres
Diss Pres
ย 
Lecture 4
Lecture 4Lecture 4
Lecture 4
ย 
Probability
ProbabilityProbability
Probability
ย 
Chi square
Chi squareChi square
Chi square
ย 
ECONOMETRICS I ASA
ECONOMETRICS I ASAECONOMETRICS I ASA
ECONOMETRICS I ASA
ย 
Statistics78 (2)
Statistics78 (2)Statistics78 (2)
Statistics78 (2)
ย 
Lecture 4 - probability distributions (2).pptx
Lecture 4 - probability distributions (2).pptxLecture 4 - probability distributions (2).pptx
Lecture 4 - probability distributions (2).pptx
ย 

Statistical Mechanics of SCOTUS

  • 1. Statistical mechanics of SCOTUS Edward D. Lee1,2 Chase P. Broedersz1,2 William Bialek1,2 1Department of Physics, Princeton U. 2Lewis-Sigler Institute for Integrative Genomics, Princeton U.
  • 2. From complex to simple Group behavior of social organisms can manifest complex patterns at the group level even though they might reduce down to simple rules for individuals [1โ€”3]. For example, starlings seem to interact only with local neighbors, and these local interactions produce global patterns like on the right. We call this emergent collective phenomena because such behavior is not explicitly encoded in the behavior of individuals, but arises from interactions. Is it the case that decisions by groups of people, despite apparent higher Flocks of starlings form complex patterns order behaviors, are likewise reducible (http://webodysseum.com/videos/spectacu to simpler rules? lar-starling-flocks-video-murmuration/).
  • 3. US Supreme Court (SCOTUS) We investigate the voting behavior of the SCOTUS. We explore the structure of decisions to gain insight into how these decisions are reached. We show results for the second Rehnquist Court (1994โ€”2005, ๐‘ = 895 cases) and discuss other โ€œnatural courtsโ€โ€”periods of time when member remain constantโ€”where relevant. SCOTUS facts: โ€ข highest court in the US government โ€ข write a majority and a minority โ€ข nine Justices appointed for life opinion, legally clarifying their โ€ข vote on constitutionality of legislative decisions, which can be and executive actions supplemented with separate opinions โ€ข usually hears appeals from lower โ€ข Justices must ultimately render a courts' decisions, which Justices binary decision affirm or reverse โ€ข the second Rehnquist Court is โ€ข sometimes Justices are recused for typically considered as 4 liberals and conflict of interest, sickness, etc. 5 conservatives โ€ข decisions by majority vote
  • 4. Previous approaches to SCOTUS voting Although there is a long history of scholarly study of SCOTUS, nearly all approaches rely on important assumptionsโ€”even though they may be justified. Some assume Justices vote independently given ideological preferences [5]. Others, if including interactions, do not include them in a general model of voting or posit their structure instead of deriving it from data [6]. Most importantly, many draw on complex, underlying cognitive frameworks like rationality or expression of internal beliefs, which are validated by predictions made of the data [5โ€”8]. However, it is impossible to validate all aspects of these complex models. Is there a way to construct accurate models while abstaining from introducing complexities? Characteristics of previous approaches โ€ข Decision-making framework [7โ€”8] โ€ข Independent โ€ข Ideological liberal vs. โ€ข Externally posited voters given conservative axis [5,7โ€” interaction [6] ideological 8] preferences [5] โ€ข Subset of votes deemed How do we approach relevant [5] the system with โ€ข Not generally predictive minimal Attitudinal Game theoretical assumptions?
  • 5. Back to the data The data, published by [4], immediately reveals strong structure in Court behavior. For example, we see that in the second Rehnquist Court, 44% of votes were unanimous. Overall, when considering the natural courts shown on the right, 36% of votes are unanimous on average. Only 10% of votes fall along the liberal vs. conservative divide. Does an independent model support this observation? Distribution of voting data for natural courts starting in given year. Blue, 0 dissenting votes; red, 1; yellow, 2; green, 3; black, 4. Terms are the number of years that the same members remained on the Court. The number of votes on record for each set of years is in gray.
  • 6. The simplest model Independent voters Each Justice ๐‘– has probability ๐‘ ๐‘– of voting to affirm, so the probability of ๐‘˜ votes in the majority out of ๐‘› Justices is ๐‘› ๐‘˜ ๐‘›โˆ’๐‘˜ ๐‘› ๐‘ƒ ๐‘˜ = ๐‘ 1โˆ’ ๐‘ + ๐‘ ๐‘›โˆ’๐‘˜ 1 โˆ’ ๐‘ ๐‘˜ ๐‘˜ ๐‘›โˆ’ ๐‘˜ An independent model fails to explain the distribution of votes in the majority. This is not so surprising because weโ€™ve assumed that all higher order behaviors are encoded within the first moments. For example, for a unanimous vote (๐‘˜ = 9) to occur in the independent model, all Justices would happen to vote the same way, but this happens much more rarely than observed, yielding 0.5% of the observed value. Indeed, interactions are crucial to SCOTUS voting behavior.
  • 7. More evidence for interaction Justices take at least two votes for a case: an initial secret vote and a final decision. Justices may attempt to persuade each other in between, but it is difficult to measure such interactions partly because the first vote is secret. According to data available on the Waite Court (1874โ€”1887), 9% of final votes had at least one dissenting vote while 40% had at least one in the initial vote [5]. In general, Justices more often switch to the majority than the reverse, suggestive of consensus-promoting interaction. Maltzmann et al. show from memos that Justices strategically manipulate their communication to attempt to influence the vote and written opinions of the Court [6]. Nonetheless, most models either treat the Justices as independent or do not explicitly include interactions in a predictive framework. Indeed, it is difficult to devise the right structure for interactions! How do we account for interactions in a principled fashion?
  • 8. Including interactionsโ€ฆ Given that Justice ๐‘– can vote in two ways, we represent his or her vote as ๐œŽ ๐‘– โˆˆ โˆ’1,1 . Then, the independent model is the simplest model that fits average voting records { ๐œŽ ๐‘– } and all higher order correlations, { ๐œŽ ๐‘– ๐œŽ๐‘— , ๐œŽ ๐‘– ๐œŽ๐‘— ๐œŽ ๐‘˜ โ€ฆ ๐œŽ ๐‘– ๐œŽ๐‘— โ€ฆ ๐œŽ ๐‘› }, are reducible to { ๐œŽ ๐‘– }. In fact, all higher order statistics are as random as possible given the individual means, so there is no reason any higher order correlations should match that of the data. We can generalize this idea to a ๐‘šth order model that fits all correlations up to order ๐‘š yet generates all > ๐‘š order correlations randomly. Since these distributions are as random as possible given what is fit, that also means that we make no further assumptions than what is given in the fitted correlations or about how these distributions are generated. With SCOTUS, we might expect that we need to account for the bloc behavior (5 vs. 4) and unanimous behavior by including terms of the 4th, 5th and 9th orders explicitly. However, let us take only the next step of fitting both ๐œŽ ๐‘– and ๐œŽ ๐‘– ๐œŽ๐‘— .
  • 9. โ€ฆas maximizing entropy The formalization for generating these distributions is called the principle of maximum entropy [9]. Entropy is a measure of the randomness of a distribution. The entropy of a probability distribution ๐‘ƒ(๐œŽ) of the votes of a set of ๐‘› voters ๐œŽ = {๐œŽ1 , โ€ฆ , ๐œŽ ๐‘› } is ๐‘† ๐‘ƒ ๐œŽ =โˆ’ ๐‘ƒ ๐œŽ log ๐‘ƒ(๐œŽ) ๐œŽ which we maximize while constraining ๐œŽ ๐‘– and ๐œŽ ๐‘– ๐œŽ๐‘— ๐‘› ๐‘› 1 ๐‘† ๐‘ƒ ๐œŽ , โ„Ž ๐‘– , ๐ฝ ๐‘–๐‘— = ๐‘† โˆ’ โ„Ž ๐‘– ๐œŽ๐‘– โˆ’ ๐ฝ ๐‘–๐‘— ๐œŽ ๐‘– ๐œŽ๐‘— 2 ๐‘–=1 ๐‘–,๐‘—=1 with Lagrange multipliers โ„Ž ๐‘– , ๐ฝ ๐‘–๐‘— . The resulting model is known as the Ising model 1 ๐‘ƒ ๐œŽ = ๐‘’ โˆ’๐ป(๐œŽ) ๐‘ ๐‘› 1 ๐‘› ๐ป ๐œŽ =โˆ’ โ„Ž ๐‘– ๐œŽ๐‘– โˆ’ ๐ฝ ๐‘–๐‘— ๐œŽ ๐‘– ๐œŽ๐‘— ๐‘–=1 2 ๐‘–,๐‘—=1 with a normalizing constant, the partition function ๐‘, and Hamiltonian ๐ป(๐œŽ).
  • 10. Ising model 1 โˆ’๐ป(๐œŽ) ๐‘ƒ ๐œŽ = ๐‘’ ๐‘ ๐‘› 1 ๐‘› ๐ป ๐œŽ =โˆ’ โ„Ž ๐‘– ๐œŽ๐‘– โˆ’ ๐ฝ ๐‘–๐‘— ๐œŽ ๐‘– ๐œŽ๐‘— ๐‘–=1 2 ๐‘–,๐‘—=1 The โ„Ž ๐‘– loosely refer to the โ€œmean biasโ€ of each voter ๐œŽ ๐‘– , and the ๐ฝ ๐‘–๐‘— loosely refer to the interaction between them, or โ€œcouplings.โ€ Since votes, or โ€œstatesโ€ ๐œŽ, with a smaller ๐ป are more probable, โ„Ž ๐‘– > 0 implies that ๐œŽ ๐‘– is more likely to take value 1. Also, ๐ฝ ๐‘–๐‘— > 0 implies that ๐œŽ ๐‘– and ๐œŽ๐‘— are more likely to take the same value. We can solve for the parameters โ„Ž ๐‘– and ๐ฝ ๐‘–๐‘— such that our model fits the given moments โŸจ๐œŽ ๐‘– โŸฉ and โŸจ๐œŽ ๐‘– ๐œŽ๐‘— โŸฉ.
  • 11. Mapping spins We have yet to define how the values of ๐œŽ ๐‘– correspond to actual vote. It is not as simple as calling one value affirm and the other reverse: the outcome of affirming or reversing depends on how the case is posed. It is entirely possible that affirming one case is a liberal decision and conservative in another. What is the right dimension along which to orient the ๐œŽ ๐‘– ? We abstain from making a choice, and introducing external bias, by symmetrizing the up and down votes such that โˆ’1 and 1 are equivalent. This keeps ๐œŽ ๐‘– ๐œŽ๐‘— the same and fixes ๐œŽ ๐‘– = 0. Correspondingly, โ„Ž ๐‘– = 0. We find that absence of a bias is a reasonable assumption because bias is not the dominant term for judicial voting behavior.
  • 12. Model fit Remarkably, the Ising model fits the data well. One measure of the fit is to consider the difference in entropy of the ๐‘šth order model with the data ๐ผ ๐‘š = ๐‘† ๐‘š โˆ’ ๐‘†data [2]. As we increase ๐‘š, we capture more correlation and the entropy of our models monotonically decreases to that of the data, where ๐‘† ๐‘› = ๐‘†data . The furthest distance ๐ผ1 is called the multi-information. Our model captures 90% of the multi-information (right). Thus, it nearly captures all the structure in the data. It also follows โˆ’ log ๐‘ƒ ๐œŽ โˆ The model ๐ป(๐œŽ) for the most frequent states. The captures 90% least fit states only appear one or twice of the multi- on average in a bootstrap sample of the information. data.
  • 13. Implications of Ising model fit The fit by the Ising model shows that higher order behaviors like ideological blocs and unanimity can emerge from lower order behaviors at the level of pairwise interactions between individuals. Including higher order terms will result in a marginal improvement in the fit. This result is surprising because it suggests that higher level coordination is not the dominant explanation of voting behavior. Previously, scholars have pointed to the high level of consensus in the Court to as evidence for a โ€œnorm of consensus,โ€ which seems analogous to an effective ninth order term for behavior [6]. Our results point to a different sort of decision-making structure.
  • 14. Found coupling network ๐ถ ๐‘–๐‘— = ๐œŽ ๐‘– ๐œŽ๐‘— โˆ’ ๐œŽ ๐‘– ๐œŽ๐‘— and ๐ฝ ๐‘–๐‘— graphs. Justices with a liberal voting record are colored blue whereas those with a conservative are colored red. Positive edges are red and negative blue. Widths are proportional to magnitude. All ๐ถ ๐‘–๐‘— are positive whereas some ๐ฝ ๐‘–๐‘— are negative. Justices are initialed: John Stevens (JS), Ruth Ginsburg (RG), David Souter (DS), Steven Breyer (SB), Sandra Oโ€™Connor (SO), Anthony Kennedy (AK), William Rehnquist (WR), Antonin Scalia (AS), Clarence Thomas (CT).
  • 15. Understanding couplings As a simple check, we see that the average ๐ฝ ๐‘–๐‘— within ideological blocs (blue to blue or red to red) are positive while the average between (blue to red) is negative (previous slide). The corresponding averages of ๐ถ ๐‘–๐‘— also show this relative change although all ๐ถ ๐‘–๐‘— are positive, obscuring the antagonistic tendency. To better understand the distribution of ๐ฝ ๐‘–๐‘— , we consider the effective field on ๐œŽ ๐‘– from its neighbors. ๐‘› 1 โ„Žeff = ๐‘– ๐ฝ ๐‘–๐‘— ๐œŽ๐‘— 2 ๐‘—=1 Note that it depends on the state of neighbors ๐œŽ๐‘— . Since this distribution over all ๐œŽ is symmetric around 0, we Distributions ๐‘ƒ โ„Žeff ๐œŽ . Red histogram is only show the positive half (right). ๐‘– We fix ๐œŽ ๐‘– = 1 and compare the shifts in the distributions distribution of fields from only conservative Justices. Ordered from most liberal to most of โ„Žeff of its neighbors ๐œŽ , which we measure by taking ๐‘— ๐‘— conservative voting record from left to right, the mean over standard deviation ๐œ‡/ฮฃ ๐‘—๐‘– of ๐‘ƒ โ„Žeff . In ๐‘– top to bottom. The more conservatively a the absence of such fixing, ๐œ‡/ฮฃ ๐‘—๐‘– = 0 (next slide). Justice votes, the more the mean field due to conservatives marches to the right.
  • 16. Shifts in โ„Ž eff ๐‘– ๐œ‡ = 0.8 Average shifts in ฮฃ ๐‘—๐‘– distributions of ๐œ‡ โ„Žeff over = 4.7 ๐‘– ฮฃ ๐‘—๐‘– Liberals Conservatives ๐œ‡ ideological blocs = 4.3 when holding one ฮฃ ๐‘—๐‘– member of a bloc, ๐‘–, at 1 at a time. Average shift in liberals (๐‘—) ๐œ‡ when holding = 0.8 conservatives fixed (๐‘–) ฮฃ ๐‘—๐‘– As expected, ideological neighbors are much more affected by fixing ๐œŽ ๐‘– = 1 by a factor of 5-6, showing that ideological blocs are a natural division of the Court. Overall, the Court always shifts in the same direction as the perturbation. Thus, we find that higher order behavior as ideological blocs and general unanimity are reflected in the couplings. O'Connor and Kennedy shift conservatives (liberals) to ๐œ‡/ฮฃ ๐‘—๐‘– = 2.6 (1.4) and ๐œ‡/ฮฃ ๐‘—๐‘– = 3.1 (1.1), reaffirming their moderate credentials. Stevens, however, has overall weakest connections with ๐œ‡/ฮฃ ๐‘—๐‘– = 0.36 (2.72) as if more isolated.
  • 17. Caveats with couplings We must be careful not to interpret the ๐ฝ ๐‘–๐‘— literally as corresponding to behavioral interaction on the Court. The distinction that we cannot make, which is indeed impossible with this data set, is to explain the underlying mechanism for correlations. We may find two Justices that vote together too much for chance, but it could be the case that either they collaborate to a large extent or that their perspectives have been shaped by a similar background. The latter involves a hidden third actor, but it is indistinguishable from the other with only the voting record. In many ways, possible confounding factors that contribute to ๐ฝ ๐‘–๐‘— reflect fundamental limitations of the data. Our guiding principle is that we refrain from assuming anything beyond what is already given from the data; other models do not have the same claim minimal assumptions. Furthermore, we know from anecdotal evidence that Justices persuade each other, so there are certainly interactions captured by the ๐ฝ ๐‘–๐‘— .
  • 18. Probing influence With this model of voting behavior, we can probe the behavior of the system under perturbations. The quantity of interest here is the majority outcome of the court ๐‘ ๐‘– ๐œŽ๐‘– ๐›พ= | ๐‘–๐‘ ๐œŽ ๐‘– | because this is the decision rendered. How sensitive is the average decision ๐œธ to a small changes in the average behavior of a Justice ๐ˆ ๐’Š ?
  • 19. Probing influence Formally, this is the susceptibility of โŸจ๐›พโŸฉ 1 ๐œ•โŸจ๐›พโŸฉ 1 ๐œ“๐‘– = = ๐›พ๐œŽ ๐‘– โˆ’ ๐›พ ๐œŽ ๐‘– ๐œ’ ๐‘– ๐œ•โ„Ž ๐‘– ๐œ’๐‘– which we have normalized over ๐œ•โŸจ๐œŽ ๐‘– โŸฉ ๐œ’๐‘– = ๐œ•โ„Ž ๐‘– to compare the Justices equally with respect to changes in their averages. The values we find are SO AK WR DS SB AS RG CT JS mean ๐๐’Š 0.834 0.809 0.719 0.650 0.644 0.623 0.616 0.608 0.421 0.658 95% confidence 0.001 0.002 0.002 0.002 0.002 0.002 0.002 0.002 0.002 interval ๐ ๐’Š โˆ’ โŸจ๐โŸฉ 0.176 0.151 0.061 -0.008 -0.014 -0.035 -0.042 -0.050 -0.237
  • 20. Are ideological medians influential? The typical wisdom in the political science literature is that these ideological medians are the most influential for Court decisions. Basically, the argument is that voters who sit in the middle of a unidimensional, symmetric preference space will be predictive of the majority [10]. The relevant space is liberal vs. conservative as we have confirmed with โ„Žeff . The Justices to whom the outcome is most ๐‘– sensitive here are the ideological medians SO and AK, in agreement with the claim. However, real systems may be complicated by interactions that constrain how such a voter may cast her vote or how a majority forms initially and persists. We do not find that it is the ideological medians to whom the outcome of the court is most sensitive in general. Importantly, our results are derived under minimal assumptions. We do not assume ideological behaviorโ€”which though generally visible, still has to be imposed by the observerโ€”and we account for interactions.
  • 21. Isolating interactions We can further inquire about the nature of influence by leveraging our found interactions. A Justice could affect the outcome in two ways: 1. Change own vote 2. Impact on colleaguesโ€™ votes through interactions How do we distinguish between this two kinds of impact accounted for by ๐ ๐’Š ? We simulate how ๐œŽ ๐‘– might increase pressure on colleagues through couplings by 1. increasing the average coupling with ๐œŽ ๐‘– such that neighborsโ€™ effective fields break symmetry around 0 to โŸจโ„Žeff โŸฉ = ๐ฝ ๐‘–๐‘— ๐œ–. However, this will also ๐‘— incur a shift to ๐œŽ ๐‘– โ‰  0, so 2. we add a compensating field โ„Žโ€ฒ๐‘– to โ„Žeff to fix ๐œŽ ๐‘– = 0. ๐‘– The latter step ensures that we do not allow an average shift in an individualโ€™s vote to affect the outcome. We denote the resulting change from pushing on ๐œŽ ๐‘– โ€™s neighbors ๐›ฟ๐›พ ๐‘– .
  • 22. Isolating interactions AK SO DS SB RG WR AS CT JS mean ๐œน๐œธ ๐’Š 0.348 0.340 0.296 0.276 0.245 0.231 0.195 0.138 0.130 0.244 95% confidence 0.001 0.001 0.001 0.001 0.001 0.002 0.003 0.003 0.003 interval ๐œน๐œธ ๐’Š โˆ’ โŸจ๐œน๐œธโŸฉ 0.104 0.095 0.051 0.032 0.001 -0.014 -0.049 -0.106 -0.114 Comparing with ๐œ“ ๐‘– โ€ฆ AK and SO switch order and are relatively closer. Interactions may differentiate between Justices for whom interactions are important. It is not the case that Justices highest by ๐œ“ ๐‘– are also highest by ๐›ฟ๐›พ ๐‘– across all natural courts although it is here. WR falls from 1st to 6th place. WR is the Chief Justice who is responsible for enforcing procedural rules and has prerogative for assigning opinions. Interestingly, WR is consistently low by ๐›ฟ๐›พ ๐‘– but rises in rank by ๐œ“ ๐‘– only being appointed Chief Justice.
  • 23. Isolating interactions AK SO DS SB RG WR AS CT JS mean ๐œน๐œธ ๐’Š 0.348 0.340 0.296 0.276 0.245 0.231 0.195 0.138 0.130 0.244 95% confidence 0.001 0.001 0.001 0.001 0.001 0.002 0.003 0.003 0.003 interval ๐œน๐œธ ๐’Š โˆ’ โŸจ๐œน๐œธโŸฉ 0.104 0.095 0.051 0.032 0.001 -0.014 -0.049 -0.106 -0.114 Comparing with ๐œ“ ๐‘– โ€ฆ CT and JS are relatively much closer. CT and JS are the most extreme voters on the conservative and liberal ends of the spectrum. The outcome is similarly least sensitive to their couplings even though CT votes with the Court 80% of the time and JS 72%. Thus, ideological hardliners are identified by a certain voting pattern distinguishing between agreement with or concurrence with the majority. CT and AS are similarly biased ideologically, but AS seems to be more strongly embedded in the interaction network. All ๐›ฟ๐›พ ๐‘– > 0, reflecting the general tendency to consensus.
  • 24. Conclusion We propose deriving behavior from data instead of testing a hypothesized framework. With this approach, we show that SCOTUS voting behavior can be explained as behavior that emerges from pairwise interaction even though higher order behaviors are manifest. This model shows the higher order structures like ideological blocs and unanimity quite clearly through the parameters. We show how one can exploit the model of voting behavior by considering the susceptibility of โŸจ๐›พโŸฉ to shifts in average voting behavior. We also isolate the shifts in โŸจ๐›พโŸฉ specific to interactions and distinguish between Justices similar by ๐œ“ ๐‘– along that second dimension of ๐›ฟ๐›พ ๐‘– . However suggestive our results, the correspondence of parameters to real behavior remains unclear. We hope to soon start a collaboration with political scientists investigate whether an interpretable correspondence can be established.
  • 25. Works cited 1. W. Bialek, A. Cavagna, et al., PNAS 109, 4786 (2012). 2. E. Schneidman, M. Berry, et al., Nature 440, 1007 (2006). 3. I. Couzin, J. Krause, et al., Nature 433, 7025 (2005). 4. H. J. Spaeth, L. Epstein, et al., Supreme Court Database (2011). 5. A.D. Martin & K. M. Quinn, Pol. Anal. 10, 134 (2002). 6. L. Epstein, J. A. Segal, et al., Am. J. of Pol. Sci. 83, 557 (2001). 7. S. Brenner & R. H. Dorff, J. of Th. Pol. 4, 2 (1992). 8. F. Maltzmann & J. F. Spriggs II, et al., Crafting law on the Supreme Court (2000). 9. E. T. Jaynes, Phy. Rev. 106, 620 (1957). 10. D. Black, J. of Pol. Econ. 56, 23 (1948).
  • 27. Acknowledgements Funding from NSF grant CCF-0939370 Dept. of Physics, Princeton University Thanks to Sigma Xi for hosting this showcase.