February 20, 2025

Building Your Own Men's AFL Game Score Progression Simulator

February 20, 2025/ Tony Corke

In this (long) blog I’ll walk you through the concepts and R code behind the creation of a fairly simple score progression simulator.

(There’s a link for you to download the entire code yourself at the end of the blog.)

All we’ll be interested in are “events” - period starts, period ends, goals and behinds - and the algorithm will determine for us, given the event that’s just occurred, what the next event is, and how far away in time it will take place.

To be able to do that, the first thing we’re going to need is some data about the typical time between events based on historical games, which we can obtain using the Swiss Army knife of footy data, the fitzRoy() R package.

January 24, 2025

How Many Disposals Do You Need to Get the Umpires' Attention?

January 24, 2025/ Tony Corke

In the previous blog we investigated the nature of the relationships between various player metrics and the Coaches’ Votes attracted by those players.

Today we’ll be doing the same, but for umpires’ Brownlow Votes.

January 23, 2025

How Many Disposals Do You Need to Get the Coaches' Attention?

January 23, 2025/ Tony Corke

In the previous blog we investigated the differences between coaches and umpires in the player statistics they appear to take most notice of when casting their respective player-related votes.

We found some similarities (both are very influenced by disposal counts), and some differences (coaches are more influenced by whether the player is on the winning or losing team), but one thing we didn’t investigate was the specific nature of the relationships between individual player metrics and voting behaviour. For example, we know that disposals are an important metric in determining Brownlow and Coaches’ votes, but we don’t know exactly how the number of votes that a player receives varies as the disposal count changes.

January 13, 2025

Do Umpires and Coaches Notice Different Things In Assigning Player Votes?

January 13, 2025/ Tony Corke

At the conclusion of each game in the men’s AFL home and away season, umpires and coaches are asked to vote on who they saw as the best players in the game. Umpires assign 3 votes to the player they rate as best, 2 votes to the next best, 1 vote to the third best, and (implicitly) 0 votes to every other player. It is these votes that are used to award the Brownlow Medal at the end of the season.

Similarly, the coaches of both teams are asked to independently cast 5-4-3-2-1 votes for the players they see as the five best, meaning that each player can end up with anywhere between 0 and 10 Coaches’ votes.

The question for today is: to what extent can available player game statistics data tell us whether and how coaches and umpires differ in how they arrive at their votes.

(Note that we’ll not be getting into the issue of individual umpire or coach quirks, snubs, or biases, and instead be looking at the data across all voting umpires and coaches.)

January 04, 2025

Measuring Strength of Schedule in Terms of Expected Wins

January 04, 2025/ Tony Corke

A few weeks back I analysed the men’s 2025 AFL schedule with a view to determining which teams had secured relatively easier overall fixtures, and which had secured relatively more difficult overall fixtures.

We investigated various approaches there and reached some conclusions about relative team fixture difficulty, but none of the methods provided an intuitive way to interpret the outputs.

On a related note, this week I had a kind email from a reader who suggested that there might be an opportunity to continuously update teams’ ‘fixture difficulty rating’ (which is just another term for strength of schedule) during the season, as this service was frequently provided by various fantasy leagues for English Football and other sports.

All of which got me to revisiting my strength of schedule methodology.

January 01, 2025

Simulation Replicates and Returns to a Perfect Model

January 01, 2025/ Tony Corke

The Situation

We’ve built a model designed to estimate the probability of a binary event (say, for example, the probability that the home team wins on the line market in the AFL).

It’s a good model - very good, in fact, because it is perfectly calibrated. In other words, when the true probability of an event is X% it’s average estimate of the probability of that event is X%.

Those probability estimates, however, are the result of running some simulation replicates with a stochastic element, which means that those estimates will diverge from X% to an extent determined by how many replicates we run.

December 06, 2024

An Analysis of Strength of Schedule for the Men's 2025 AFL Season

December 06, 2024/ Tony Corke

The men’s AFL fixture for 2025 was recently released and, as is tradition here, we’ll analyse it to see which teams we think have an easier or harder schedule.

December 01, 2024

We Need to Talk About MoSHBODS ...

December 01, 2024/ Tony Corke

Last year’s men’s seasons results for MoSHBODS and MoSSBODS - as forecasters and as opinion-sources for wagering - were at odds with what had gone before.

Other analyses have suggested that the MoS twins might have been a bit unlucky in the extent to which 2024 was different from bookmaker expectations, and I’ve never been one for knee-jerk reactions to single events, but the performance has nonetheless made me think more deeply about the algorithms underpinning the two Rating Systems, more details on which were provided in this blog from 2020, and from the blogs to which it links.

October 12, 2024

What if Squiggle Used xScore?

October 12, 2024/ Tony Corke

Over the past few blogs (here and here) I’ve been investigating different methods for untangling skill from luck in forecasting game margins and, in this blog, we’ll try another approach, this time using what are called xScores.

One source of randomness in the AFL is how well a team converts the scoring opportunities it creates into goals versus behinds. Given enough data, analysts far cleverer than I can estimate how often a shot of a particular type taken from a particular point of the field under particular conditions should result in a goal, a behind, or no score at all.

So, we can adjust for that randomness in conversion by replacing the result of every scoring opportunity by the average score that we would expect an average player to generate from that opportunity given its specific characteristics. By summing the expected score associated with every scoring opportunity for a team in a given game we can come up with an expected score, or xScore, for that team.

For this blog, I’ll be using the xScores created by Twitter’s @AFLxScore for the years 2017 to 2020, and those created by Twitter’s @WheeloRatings for the years 2021 to 2024.

Let’s look firstly at the season-by-season Squiggle results of using, as a game’s margin, the xScore margin instead of the actual margin.

October 11, 2024

Squiggle Performances Revisited: Alternative Sources of Truth

October 11, 2024/ Tony Corke

In the previous blog, I compared Squiggle forecasters’ actual margin prediction MAE results with a distribution of potential MAE outcomes from the same forecasts across 10,000 simulated 2024 season as one way of untangling the skill and luck portions of those actual results.

Those simulations require us to select “ground truth” for the underlying expected margin in each game. In the previous blog we used bookmaker data with an added random component of a Normal variable with mean 0 and standard deviation 8 as that ground truth.

October 09, 2024

Eight Years of Squiggle Performance

October 09, 2024/ Tony Corke

The Squiggle website is a place where forecasters can post their forecasts for the winning team and winning margin, and provide probability estimates for upcoming games of men’s AFL football, and see how well or otherwise they perform relative to other forecasters. The only criteria for posting there is that the forecasters must have a history of performing “reasonably” well, and must not include any human-related inputs such as bookmaker prices in their models.

It’s been running since 2017 and, since 2018, has included a derived forecater, named s10, which is a weighted average of the 10 best Squiggle models, based on mean absolute margin error, from the previous season. The MoS model had been included in s10 in every year from 2018 to 2024, but will be absent in 2025 due to a relatively lowly 22nd place finish.

In this blog, among other things, I want to get a sense of the extent to which that apparently below-average performance might be attributed to skill versus luck.

July 11, 2024

Is Favourite-Longshot Bias Evident in Bookmaker Data for the AFL?

July 11, 2024/ Tony Corke

More than once here on the MoS website we’ve looked at the topic of favourite-longshot bias (FLB), which asserts that bookmakers apply a higher profit margin to the prices of underdogs than they do to favourites. In one MoS piece (15 years ago!) I had more of a cursory look and found some evidence for FLB using 2006 to 2008 data, and, in another piece, a few years later I had a more detailed look and found only weak to moderate evidence using opening TAB data from 2006 to 2010.

At this point I think it’s fair to say that the jury is still out on FLB’s existence, and waiting for more convincing evidence either way (and very unhappy at having been sequestered for 13 years in the meantime).

July 09, 2024

Removing Vig from Bookmaker Prices

July 09, 2024/ Tony Corke

Bookmakers, love them or lose to them, are good at their basic job, which is accurately estimating the probability of outcomes, and they give clues about their probability estimates in the prices they set. The problem is, those clues are cloaked in profit.

July 08, 2024

The Relationship Between Expected Victory Margins and Estimated Win Probabilities

July 08, 2024/ Tony Corke

There are no doubt a number of viable ways of doing this, but one obvious approach is to fit a logistic equation of the form shown at right.

This provides an S-shaped mapping where estimated win probabilities respond most to changes in expected margins when those margins are near zero. It also ensures that all estimated probabilities lie between 0 and 1, which they must.

I’ve used this form of mapping for many years with values of k in the 0.04 to 0.05 range, and have found it to be very serviceable. I’ve also previously fitted it to bookmaker data and found that it generally provides an excellent fit.

May 19, 2024

Are V/AFL Scores (Still) Like Snowflakes?

May 19, 2024/ Tony Corke

Almost 10 years ago I wrote a blog that, among other things, noted that the score progressions - the goals.behinds numbers at the end of each quarter for both teams - were unique for every game ever played, regardless of the order in which you considered the two teams’ score progressions, home first then away, or away first and then home, choosing at random for every game. At that point, the statement was true for 14,490 games.

It seemed pretty startling then but, as of the end of 2024’s Round 9, the statement is STILL true, and that’s now for 16,487 games. V/AFL games remain as snowflake-like as ever.

February 11, 2024

From One Season to the Next

February 11, 2024/ Tony Corke

With the 2024 Men’s AFL season just weeks away, I thought it timely to look at different perspectives of how teams have historically performed in home and away seasons from one season to the next.

November 24, 2023

An Extra Slice of An Analysis of Strength of Schedule for the Men's 2024 AFL Season

November 24, 2023/ Tony Corke

I was thinking about the Strength of Schedule metric used in this blog from yesterday, and it struck me that, rather than using the raw values of the opponent team’s MoSHBODS rating and (for some metrics) the net Venue Performance Values (VPVs) for a game, we could, instead, convert these numbers into a win probability, which might make the resulting aggregate Strength of Schedule value more readilly interpretable.

November 23, 2023

An Analysis of Strength of Schedule for the Men's 2024 AFL Season

November 23, 2023/ Tony Corke

The men’s AFL fixture for 2024 was recently released and, once again this year, we’ll analyse it to see what it means for all 18 teams.

December 14, 2022

An Analysis of Strength of Schedule for the Men's 2023 AFL Season

December 14, 2022/ Tony Corke

The men’s AFL fixture for 2023 was released earlier this week, and tradition requires that the MoS website publishes its assessment of which teams fared best and which worst in that fixture given what the MoS models think about relative team strengths and venue effects.

May 19, 2022

Goals and Behinds: How Correlated are They?

May 19, 2022/ Tony Corke

This week I’ve been investigating the use of the Skellam Distribution in modelling AFL scores. That distribution can be derived as the difference between two, correlated, Poisson variables, which potentially makes it useful in an AFL context for modelling differences between team metrics.

Statistical Analyses