Stochastic transitivity

Stochastic transitivity models^[1]^[2]^[3]^[4] are stochastic versions of the transitivity property of binary relations studied in mathematics. Several models of stochastic transitivity exist and have been used to describe the probabilities involved in experiments of paired comparisons, specifically in scenarios where transitivity is expected, however, empirical observations of the binary relation is probabilistic. For example, players' skills in a sport might be expected to be transitive, i.e. "if player A is better than B and B is better than C, then player A must be better than C"; however, in any given match, a weaker player might still end up winning with a positive probability. Tightly matched players might have a higher chance of observing this inversion while players with large differences in their skills might only see these inversions happen seldom. Stochastic transitivity models formalize such relations between the probabilities (e.g. of an outcome of a match) and the underlying transitive relation (e.g. the skills of the players).

A binary relation ${\textstyle \succsim }$ on a set ${\mathcal {A}}$ is called transitive, in the standard non-stochastic sense, if $a\succsim b$ and $b\succsim c$ implies $a\succsim c$ for all members $a,b,c$ of ${\mathcal {A}}$ .

Stochastic versions of transitivity include:

Weak Stochastic Transitivity (WST): $\mathbb {P} (a\succsim b)\geq {\tfrac {1}{2}}$ and $\mathbb {P} (b\succsim c)\geq {\tfrac {1}{2}}$ implies $\mathbb {P} (a\succsim c)\geq {\tfrac {1}{2}}$ , for all $a,b,c\in {\mathcal {A}}$ ;^[5]^: 12^[6]^: 43rg
Strong Stochastic Transitivity (SST): $\mathbb {P} (a\succsim b)\geq {\tfrac {1}{2}}$ and $\mathbb {P} (b\succsim c)\geq {\tfrac {1}{2}}$ implies $\mathbb {P} (a\succsim c)\geq \max\{\mathbb {P} (a\succsim b),\mathbb {P} (b\succsim c)\}$ , for all $a,b,c\in {\mathcal {A}}$ ;^[5]^: 12
Linear Stochastic Transitivity (LST): $\mathbb {P} (a\succsim b)=F(\mu (a)-\mu (b))$ , for all $a,b\in {\mathcal {A}}$ , where $F:\mathbb {R} \to [0,1]$ is some increasing and symmetric^[clarify] function (called a comparison function), and $\mu :{\mathcal {A}}\to \mathbb {R}$ is some mapping from the set ${\mathcal {A}}$ of alternatives to the real line (called a merit function).

A toy example

The marble game - Assume two kids, Billy and Gabriela, collect marbles. Billy collects blue marbles and Gabriela green marbles. When they get together they play a game where they mix all their marbles in a bag and sample one randomly. If the sampled marble is green, then Gabriela wins and if it is blue then Billy wins. If $B$ is the number of blue marbles and $G$ is the number of green marbles in the bag, then the probability $\mathbb {P} ({\text{Billy}}\succsim {\text{Gabriela}})$ of Billy winning against Gabriela is

$\mathbb {P} ({\text{Billy}}\succsim {\text{Gabriela}})={\frac {B}{B+G}}={\frac {e^{\ln(B)}}{e^{\ln(B)}+e^{\ln(G)}}}={\frac {1}{1+e^{\ln(G)-\ln(B)}}}$ .

In this example, the marble game satisfies linear stochastic transitivity, where the comparison function $F:\mathbb {R} \to [0,1]$ is given by $F(x)={\frac {1}{1+e^{-x}}}$ and the merit function $\mu :{\mathcal {A}}\to \mathbb {R}$ is given by $\mu (M)=\ln(M)$ , where $M$ is the number of marbles of the player. This game happens to be an example of a Bradley–Terry model.^[7]

Applications

Ranking and Rating - Stochastic transitivity models have been used as the basis of several ranking and rating methods. Examples include the Elo-Rating system used in chess, go, and other classical sports as well as Microsoft's TrueSkill used for the Xbox gaming platform.
Models of Psychology and Rationality - Thurstonian models^[8] (see Case 5 in law of comparative judgement), Fechnerian models^[3] and also Luce's choice axiom^[9] are theories that have foundations on the mathematics of stochastic transitivity. Also, models of rational choice theory are based on the assumption of transitivity of preferences (see Von Neumann's utility and Debreu's Theorems), these preferences, however, are often revealed with noise in a stochastic manner.^[10]^[11]^[12]
Machine Learning and Artificial Intelligence (see Learn to Rank) - While Elo and TrueSkill rely on specific LST models, machine learning models have been developed to rank without prior knowledge of the underlying stochastic transitivity model or under weaker than usual assumptions on the stochastic transitivity.^[13]^[14]^[15] Learning from paired comparisons is also of interest since it allows for AI agents to learn the underlying preferences of other agents.
Game Theory - Fairness of random knockout tournaments is strongly dependent on the underlying stochastic transitivity model.^[16]^[17]^[18] Social choice theory also has foundations that depend on stochastic transitivity models.^[19]

Connections between models

Positive Results:

Every model that satisfies Linear Stochastic Transitivity must also satisfy Strong Stochastic Transitivity, which in turn must satisfy Weak Stochastic Transitivity. This is represented as: LST $\implies$ SST $\implies$ WST ;
Since the Bradley-Terry models and Thurstone's Case V model^[8] are LST models, they also satisfy SST and WST;
Due to the convenience of more structured models^[clarify], a few authors^[1]^[2]^[3]^[4]^[20]^[21] have identified axiomatic justifications^[clarify] of linear stochastic transitivity (and other models), most notably Gérard Debreu showed that:^[10] Quadruple Condition^[clarify] + Continuity^[clarify] $\implies$ LST (see also Debreu Theorems);
Two LST models given by invertible comparison functions $F(x)$ and $G(x)$ are equivalent^[clarify] if and only if $F(x)=G(\kappa x)$ for some $\kappa \geq 0.$ ^[22]

Negative Results:

Stochastic transitivity models are empirically unverifiable^[clarify],^[4] however, they may be falsifiable;
Distinguishing^[clarify] between LST comparison functions $F(x)$ and $G(x)$ can be impossible even if an infinite amount of data is provided over a finite number of points^[clarify];^[23]
The estimation problem^[clarify] for WST, SST and LST models are in general NP-Hard,^[24] however, near optimal polynomially computable estimation procedures are known for SST and LST models.^[13]^[14]^[15]

References

^ ^a ^b Fishburn, Peter C. (November 1973). "Binary choice probabilities: on the varieties of stochastic transitivity". Journal of Mathematical Psychology. 10 (4): 327–352. doi:10.1016/0022-2496(73)90021-7. ISSN 0022-2496.
^ ^a ^b Clark, Stephen A. (March 1990). "A concept of stochastic transitivity for the random utility model". Journal of Mathematical Psychology. 34 (1): 95–108. doi:10.1016/0022-2496(90)90015-2.
^ ^a ^b ^c Ryan, Matthew (2017-01-21). "Uncertainty and binary stochastic choice". Economic Theory. 65 (3): 629–662. doi:10.1007/s00199-017-1033-4. ISSN 0938-2259. S2CID 125420775.
^ ^a ^b ^c Oliveira, I.F.D.; Zehavi, S.; Davidov, O. (August 2018). "Stochastic transitivity: Axioms and models". Journal of Mathematical Psychology. 85: 25–35. doi:10.1016/j.jmp.2018.06.002. ISSN 0022-2496.
^ ^a ^b Donald Davidson and Jacob Marschak (Jul 1958). Experimental tests of a stochastic decision theory (PDF) (Technical Report). Stanford University.
^ Michel Regenwetter and Jason Dana and Clintin P. Davis-Stober (2011). "Transitivity of Preferences" (PDF). Psychological Review. 118 (1): 42–56. doi:10.1037/a0021150. PMID 21244185.
^ Bradley, Ralph Allan; Terry, Milton E. (December 1952). "Rank Analysis of Incomplete Block Designs: I. The Method of Paired Comparisons". Biometrika. 39 (3/4): 324. doi:10.2307/2334029. JSTOR 2334029.
^ ^a ^b Thurstone, L. L. (1994). "A law of comparative judgment". Psychological Review. 101 (2): 266–270. doi:10.1037/0033-295X.101.2.266. ISSN 0033-295X.
^ Luce, R. Duncan (Robert Duncan) (2005). Individual choice behavior : a theoretical analysis. Mineola, N.Y.: Dover Publications. ISBN 0486441369. OCLC 874031603.
^ ^a ^b Debreu, Gerard (July 1958). "Stochastic Choice and Cardinal Utility" (PDF). Econometrica. 26 (3): 440–444. doi:10.2307/1907622. ISSN 0012-9682. JSTOR 1907622.
^ Regenwetter, Michel; Dana, Jason; Davis-Stober, Clintin P. (2011). "Transitivity of preferences". Psychological Review. 118 (1): 42–56. doi:10.1037/a0021150. ISSN 1939-1471. PMID 21244185.
^ Cavagnaro, Daniel R.; Davis-Stober, Clintin P. (2014). "Transitive in our preferences, but transitive in different ways: An analysis of choice variability". Decision. 1 (2): 102–122. doi:10.1037/dec0000011. ISSN 2325-9973.
^ ^a ^b Shah, Nihar B.; Balakrishnan, Sivaraman; Guntuboyina, Adityanand; Wainwright, Martin J. (February 2017). "Stochastically Transitive Models for Pairwise Comparisons: Statistical and Computational Issues". IEEE Transactions on Information Theory. 63 (2): 934–959. arXiv:1510.05610. doi:10.1109/tit.2016.2634418. ISSN 0018-9448.
^ ^a ^b Chatterjee, Sabyasachi; Mukherjee, Sumit (June 2019). "Estimation in Tournaments and Graphs Under Monotonicity Constraints". IEEE Transactions on Information Theory. 65 (6): 3525–3539. arXiv:1603.04556. doi:10.1109/tit.2019.2893911. ISSN 0018-9448. S2CID 54740089.
^ ^a ^b Oliveira, Ivo F.D.; Ailon, Nir; Davidov, Ori (2018). "A New and Flexible Approach to the Analysis of Paired Comparison Data". Journal of Machine Learning Research. 19: 1–29.
^ Israel, Robert B. (December 1981). "Stronger Players Need not Win More Knockout Tournaments". Journal of the American Statistical Association. 76 (376): 950–951. doi:10.2307/2287594. ISSN 0162-1459. JSTOR 2287594.
^ Chen, Robert; Hwang, F. K. (December 1988). "Stronger players win more balanced knockout tournaments". Graphs and Combinatorics. 4 (1): 95–99. doi:10.1007/bf01864157. ISSN 0911-0119. S2CID 44602228.
^ Adler, Ilan; Cao, Yang; Karp, Richard; Peköz, Erol A.; Ross, Sheldon M. (December 2017). "Random Knockout Tournaments". Operations Research. 65 (6): 1589–1596. arXiv:1612.04448. doi:10.1287/opre.2017.1657. ISSN 0030-364X. S2CID 1041539.
^ Sen, Amartya (January 1977). "Social Choice Theory: A Re-Examination". Econometrica. 45 (1): 53–89. doi:10.2307/1913287. ISSN 0012-9682. JSTOR 1913287.
^ Blavatskyy, Pavlo R. (2007). Stochastic utility theorem. Inst. for Empirical Research in Economics. OCLC 255736997.
^ Dagsvik, John K. (October 2015). "Stochastic models for risky choices: A comparison of different axiomatizations". Journal of Mathematical Economics. 60: 81–88. doi:10.1016/j.jmateco.2015.06.013. ISSN 0304-4068.
^ Yellott, John I. (April 1977). "The relationship between Luce's Choice Axiom, Thurstone's Theory of Comparative Judgment, and the double exponential distribution". Journal of Mathematical Psychology. 15 (2): 109–144. doi:10.1016/0022-2496(77)90026-8. ISSN 0022-2496.
^ Rockwell, Christina; Yellott, John I. (February 1979). "A note on equivalent Thurstone models". Journal of Mathematical Psychology. 19 (1): 65–71. doi:10.1016/0022-2496(79)90006-3. ISSN 0022-2496.
^ deCani, John S. (December 1969). "Maximum Likelihood Paired Comparison Ranking by Linear Programming". Biometrika. 56 (3): 537–545. doi:10.2307/2334661. ISSN 0006-3444. JSTOR 2334661.

[:02-1] Fishburn, Peter C. (November 1973). "Binary choice probabilities: on the varieties of stochastic transitivity". Journal of Mathematical Psychology. 10 (4): 327–352. doi:10.1016/0022-2496(73)90021-7. ISSN 0022-2496.

[:12-2] Clark, Stephen A. (March 1990). "A concept of stochastic transitivity for the random utility model". Journal of Mathematical Psychology. 34 (1): 95–108. doi:10.1016/0022-2496(90)90015-2.

[:22-3] Ryan, Matthew (2017-01-21). "Uncertainty and binary stochastic choice". Economic Theory. 65 (3): 629–662. doi:10.1007/s00199-017-1033-4. ISSN 0938-2259. S2CID 125420775.

[:32-4] Oliveira, I.F.D.; Zehavi, S.; Davidov, O. (August 2018). "Stochastic transitivity: Axioms and models". Journal of Mathematical Psychology. 85: 25–35. doi:10.1016/j.jmp.2018.06.002. ISSN 0022-2496.

[Davidson.Marschak.1958-5] Donald Davidson and Jacob Marschak (Jul 1958). Experimental tests of a stochastic decision theory (PDF) (Technical Report). Stanford University.

[6] Michel Regenwetter and Jason Dana and Clintin P. Davis-Stober (2011). "Transitivity of Preferences" (PDF). Psychological Review. 118 (1): 42–56. doi:10.1037/a0021150. PMID 21244185.

[7] Bradley, Ralph Allan; Terry, Milton E. (December 1952). "Rank Analysis of Incomplete Block Designs: I. The Method of Paired Comparisons". Biometrika. 39 (3/4): 324. doi:10.2307/2334029. JSTOR 2334029.

[Thurstone_1994_266–270-8] Thurstone, L. L. (1994). "A law of comparative judgment". Psychological Review. 101 (2): 266–270. doi:10.1037/0033-295X.101.2.266. ISSN 0033-295X.

[9] Luce, R. Duncan (Robert Duncan) (2005). Individual choice behavior : a theoretical analysis. Mineola, N.Y.: Dover Publications. ISBN 0486441369. OCLC 874031603.

[Debreu_440–444-10] Debreu, Gerard (July 1958). "Stochastic Choice and Cardinal Utility" (PDF). Econometrica. 26 (3): 440–444. doi:10.2307/1907622. ISSN 0012-9682. JSTOR 1907622.

[11] Regenwetter, Michel; Dana, Jason; Davis-Stober, Clintin P. (2011). "Transitivity of preferences". Psychological Review. 118 (1): 42–56. doi:10.1037/a0021150. ISSN 1939-1471. PMID 21244185.

[12] Cavagnaro, Daniel R.; Davis-Stober, Clintin P. (2014). "Transitive in our preferences, but transitive in different ways: An analysis of choice variability". Decision. 1 (2): 102–122. doi:10.1037/dec0000011. ISSN 2325-9973.

[:42-13] Shah, Nihar B.; Balakrishnan, Sivaraman; Guntuboyina, Adityanand; Wainwright, Martin J. (February 2017). "Stochastically Transitive Models for Pairwise Comparisons: Statistical and Computational Issues". IEEE Transactions on Information Theory. 63 (2): 934–959. arXiv:1510.05610. doi:10.1109/tit.2016.2634418. ISSN 0018-9448.

[:52-14] Chatterjee, Sabyasachi; Mukherjee, Sumit (June 2019). "Estimation in Tournaments and Graphs Under Monotonicity Constraints". IEEE Transactions on Information Theory. 65 (6): 3525–3539. arXiv:1603.04556. doi:10.1109/tit.2019.2893911. ISSN 0018-9448. S2CID 54740089.

[:62-15] Oliveira, Ivo F.D.; Ailon, Nir; Davidov, Ori (2018). "A New and Flexible Approach to the Analysis of Paired Comparison Data". Journal of Machine Learning Research. 19: 1–29.

[16] Israel, Robert B. (December 1981). "Stronger Players Need not Win More Knockout Tournaments". Journal of the American Statistical Association. 76 (376): 950–951. doi:10.2307/2287594. ISSN 0162-1459. JSTOR 2287594.

[17] Chen, Robert; Hwang, F. K. (December 1988). "Stronger players win more balanced knockout tournaments". Graphs and Combinatorics. 4 (1): 95–99. doi:10.1007/bf01864157. ISSN 0911-0119. S2CID 44602228.

[18] Adler, Ilan; Cao, Yang; Karp, Richard; Peköz, Erol A.; Ross, Sheldon M. (December 2017). "Random Knockout Tournaments". Operations Research. 65 (6): 1589–1596. arXiv:1612.04448. doi:10.1287/opre.2017.1657. ISSN 0030-364X. S2CID 1041539.

[19] Sen, Amartya (January 1977). "Social Choice Theory: A Re-Examination". Econometrica. 45 (1): 53–89. doi:10.2307/1913287. ISSN 0012-9682. JSTOR 1913287.

[20] Blavatskyy, Pavlo R. (2007). Stochastic utility theorem. Inst. for Empirical Research in Economics. OCLC 255736997.

[21] Dagsvik, John K. (October 2015). "Stochastic models for risky choices: A comparison of different axiomatizations". Journal of Mathematical Economics. 60: 81–88. doi:10.1016/j.jmateco.2015.06.013. ISSN 0304-4068.

[22] Yellott, John I. (April 1977). "The relationship between Luce's Choice Axiom, Thurstone's Theory of Comparative Judgment, and the double exponential distribution". Journal of Mathematical Psychology. 15 (2): 109–144. doi:10.1016/0022-2496(77)90026-8. ISSN 0022-2496.

[23] Rockwell, Christina; Yellott, John I. (February 1979). "A note on equivalent Thurstone models". Journal of Mathematical Psychology. 19 (1): 65–71. doi:10.1016/0022-2496(79)90006-3. ISSN 0022-2496.

[24] Cani, John S. (December 1969). "Maximum Likelihood Paired Comparison Ranking by Linear Programming". Biometrika. 56 (3): 537–545. doi:10.2307/2334661. ISSN 0006-3444. JSTOR 2334661.

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[8]

[9]

[10]

[11]

[12]

[13]

[14]

[15]

[16]

[17]

[18]

[19]

[20]

[21]

[22]

[23]

[24]

Stochastic transitivity

Contents

A toy example

Applications

Connections between models

See also

References