- Get link
- X
- Other Apps
Here's a short summary of the relationship between common models used in baseball, chess, psychology and education. The starting point for examining the connections between various extended models in these areas. The next steps include multiple attempts, guessing, ordinal and multinomial outcomes, uncertainty and volatility, multiple categories and interactions. There are also connections to standard optimization algorithms (neural nets, simulated annealing).
Baseball
Common in baseball and other sports, the log5 method provides an estimate for the probability p of participant 1 beating participant 2 given respective success probabilities p1,p2. Also let q∗=1−p∗ in the following. The log5 estimate of the outcome is then:
p=p1q2p1q2+q1p2=p1/q1p1/q1+p2/q2pq=p1q1⋅q2p2
The final form uses the odds ratio, pq. Additional factors can be easily chained using this form to provide more complex estimates. For example, let pe be an environmental factor, then:
pq=p1q1⋅q2p2⋅qepe
Chess
The most common rating system in chess is the Elo rating system. This has also been adopted for various other uses, e.g. hot or not'' websites. This system assigns ratings R1,R2 to players 1 and 2 such that the probability of player 1 beating player 2 is approximately:
p=eR1/CeR1/C+eR2/C
Here C is just a scaling factor (typically 400/ln10 ). The Elo rating is connected to log5 via setting eR/C=p/q. We then recover:
pq=eR/Cp=eR/C1+eR/CR=C⋅ln(p/q)
Note that p is also the probability of this player beating another player with Elo rating 0. The Elo system generally includes enhancements accounting for ties, first-move advantage and also an online algorithm for updating ratings. We'll revisit these features later.
Psychology
The Bradley-Terry-Luce (BTL) model is commonly used in psychology. Given two items, the probability p that item 1 is ranked over item 2 is approximately:
p=Q1Q1+Q2
In this context Q∗ typically reflects the amount of a certain quality. That this model is equivalent to the previous models is immediate:
Q=eR/C=p/qR=C⋅ln(Q)=C⋅ln(p/q)p=Q1+Q
Psychometrics
The dichotomous (two-response) Rasch and item response models are commonly used in psychometrics. For the Rasch model, let r1 represent a measurement of ability and r2 the difficulty of the test item. The Rasch model estimates the probability of correct response p as:
p=er1−r21+er1−r2
The one-parameter item response model estimates:
p=11+er2−r1
These are clearly equivalent to each other and to the previous models.
Baseball
Common in baseball and other sports, the log5 method provides an estimate for the probability p of participant 1 beating participant 2 given respective success probabilities p1,p2. Also let q∗=1−p∗ in the following. The log5 estimate of the outcome is then:
p=p1q2p1q2+q1p2=p1/q1p1/q1+p2/q2pq=p1q1⋅q2p2
The final form uses the odds ratio, pq. Additional factors can be easily chained using this form to provide more complex estimates. For example, let pe be an environmental factor, then:
pq=p1q1⋅q2p2⋅qepe
Chess
The most common rating system in chess is the Elo rating system. This has also been adopted for various other uses, e.g. hot or not'' websites. This system assigns ratings R1,R2 to players 1 and 2 such that the probability of player 1 beating player 2 is approximately:
p=eR1/CeR1/C+eR2/C
Here C is just a scaling factor (typically 400/ln10 ). The Elo rating is connected to log5 via setting eR/C=p/q. We then recover:
pq=eR/Cp=eR/C1+eR/CR=C⋅ln(p/q)
Note that p is also the probability of this player beating another player with Elo rating 0. The Elo system generally includes enhancements accounting for ties, first-move advantage and also an online algorithm for updating ratings. We'll revisit these features later.
Psychology
The Bradley-Terry-Luce (BTL) model is commonly used in psychology. Given two items, the probability p that item 1 is ranked over item 2 is approximately:
p=Q1Q1+Q2
In this context Q∗ typically reflects the amount of a certain quality. That this model is equivalent to the previous models is immediate:
Q=eR/C=p/qR=C⋅ln(Q)=C⋅ln(p/q)p=Q1+Q
Psychometrics
The dichotomous (two-response) Rasch and item response models are commonly used in psychometrics. For the Rasch model, let r1 represent a measurement of ability and r2 the difficulty of the test item. The Rasch model estimates the probability of correct response p as:
p=er1−r21+er1−r2
The one-parameter item response model estimates:
p=11+er2−r1
These are clearly equivalent to each other and to the previous models.
Comments
Post a Comment