- Get link
- X
- Other Apps
Let's say we have a game that can be reasonably modeled as two independent Poisson processes with team i having parameter λi. If one team wins in regulation with team i scoring ni, then it's well-known we have the MLE estimate ^λi=ni. But what if the game ends in a tie in regulation with each team scoring n goals and we have sudden-death overtime? How does this affect the MLE estimate for the winning and losing teams?
Assuming without loss of generality that team 1 is the winner in sudden-death overtime. As we have two independent Poisson processes, the probability of this occurring is λ1λ1+λ2. Thus, the overall likelihood we'd like to maximize is L=e−λ1λ1nn!e−λ2λ2nn!λ1λ1+λ2. Letting l=logL we get l=−λ1+nlogλ1−λ2+nlogλ2−2logn!+logλ1−log(λ1+λ2). This gives ∂l∂λ1=−1+nλ1+1λ1+1λ1+λ2∂l∂λ2=−1+nλ2+1λ1+λ2. Setting both partials equal to 0 and solving, we get (n−^λ1)(^λ1+^λ2)+^λ2=0(n−^λ2)(^λ1+^λ2)−^λ2=0, and so ^λ1=(n+1)2n2n+1^λ2=n2n2n+1. For example, if both teams score 3 goals in regulation and team 1 wins in sudden-death overtime, our MLE estimates are ^λ1=337,^λ2=247.
Intuitively this makes sense, because 2n goals were scored in regulation time, hence we "expect" that the overtime goal occurred around a fraction 12n of regulation, so team 1 scored n+1 goals in about 2n+12n regulation periods and team 2 scored n goals in about 2n+12n regulation periods. The standard Poisson process MLE estimates here coincide with the estimates we derived above.
Does this work in practice? Yes! I tested it on my NCAA men's lacrosse model, and it increased the out-of-sample testing accuracy by 0.5%. Surprisingly large for such a small change!
Assuming without loss of generality that team 1 is the winner in sudden-death overtime. As we have two independent Poisson processes, the probability of this occurring is λ1λ1+λ2. Thus, the overall likelihood we'd like to maximize is L=e−λ1λ1nn!e−λ2λ2nn!λ1λ1+λ2. Letting l=logL we get l=−λ1+nlogλ1−λ2+nlogλ2−2logn!+logλ1−log(λ1+λ2). This gives ∂l∂λ1=−1+nλ1+1λ1+1λ1+λ2∂l∂λ2=−1+nλ2+1λ1+λ2. Setting both partials equal to 0 and solving, we get (n−^λ1)(^λ1+^λ2)+^λ2=0(n−^λ2)(^λ1+^λ2)−^λ2=0, and so ^λ1=(n+1)2n2n+1^λ2=n2n2n+1. For example, if both teams score 3 goals in regulation and team 1 wins in sudden-death overtime, our MLE estimates are ^λ1=337,^λ2=247.
Intuitively this makes sense, because 2n goals were scored in regulation time, hence we "expect" that the overtime goal occurred around a fraction 12n of regulation, so team 1 scored n+1 goals in about 2n+12n regulation periods and team 2 scored n goals in about 2n+12n regulation periods. The standard Poisson process MLE estimates here coincide with the estimates we derived above.
Does this work in practice? Yes! I tested it on my NCAA men's lacrosse model, and it increased the out-of-sample testing accuracy by 0.5%. Surprisingly large for such a small change!
Comments
Post a Comment