mscroggs.co.uk
mscroggs.co.uk

subscribe

Blog

A 20,000-to-1 baby?

 2018-03-23 
This morning, I heard about Arnie Ellis on the Today programme. Arnie is the first baby boy to be born in his family in five generations, following ten girls. According to John Humphrys, there is a 20,000-to-1 chance of this happening. Pretty quickly, I started wondering where this number came from.
After a quick Google, I found that this news story had appeared in many of today's papers, including the Sun and the Daily Mail. They all featured this 20,000-to-1 figure, which according to The Sun originally came from Ladbrokes.

What is the chance of this happening?

If someone is having a child, the probability of it being a girl is 0.5. The probability of it being a boy is also 0.5. So the probaility of having ten girls followed by a boy is
$$\left(\tfrac12\right)^{10}\times\tfrac12=\frac1{2048}.$$
If all 11 children were siblings, then this would be the chance of this happening—and it's a long way off the 20,000-to-1. But in Arnie's case, the situation is different. Luckily in the Daily Mail article, there is an outline of Arnie's family tree.
Here, you can see that the ten girls are spread over five generations. So the question becomes: given a baby, what is the probability that the child is male and his most recently born ten relatives on their mother's side are all female?
Four of the ten relatives are certainly female—Arnie's mother, grandmother, great grandmother and great great grandmother are all definitely female. This only leaves six more relatives, so the probability of a baby being in Arnie's position is
$$\left(\tfrac12\right)^{6}\times\tfrac12=\frac1{128}.$$
This is now an awful lot lower than the 20,000-to-1 we were told. In fact, with around 700,000 births in the UK each year, we'd expect over 5,000 babies to be born in this situation every year. Maybe Arnie's not so rare after all.
This number is based on the assumption that the baby's last ten relatives are spread across five generations. But the probability will be different if the relatives are spread over a different number of generations. Calculating the probability for a baby with any arrangement of ancestors would require knowing the likelihood of each arrangement of relatives, which would require a lot of data that probably doesn't exist. But the actual anwer is probably not too far from 127-to-1.

Where did 20,000-to-1 come from?

This morning, I emailed Ladbrokes to see if they could shed any light on the 20,000-to-1 figure. They haven't got back to me yet. (Although they did accidentally CC me when sending the query on to someone who might know the answer, so I'm hopeful.) I'll update this post with an explanaation if I do hear back.
Until then, there is one possible explanation for the figure: we have looked at the probability that a baby will be in this situation, but we could instead have started at the top of the family tree and looked at the probability that Beryl's next ten decendents were girls followed by a boy. The probability of this happening will be lower, as there is a reasonable chance that Beryl could have no female children, or no children at all. Looking at the problem this way, there are more ways for the situation to not happen, so the probability of it happening is lower.
But working the actually probability out in this way would again require data about how many children are likely in each generation, and would be a complicated calculation. It seems unlikely that this is what Ladbrokes did. Let's hope they shed some light on it...
                        
(Click on one of these icons to react to this blog post)

You might also enjoy...

Comments

Comments in green were written by me. Comments in blue were not written by me.
@Steve Spivey: Nothing
Matthew
                 Reply
Any response from Ladbrokes yet?
Steve Spivey
                 Reply
 Add a Comment 


I will only use your email address to reply to your comment (if a reply is needed).

Allowed HTML tags: <br> <a> <small> <b> <i> <s> <sup> <sub> <u> <spoiler> <ul> <ol> <li> <logo>
To prove you are not a spam bot, please type "t" then "h" then "e" then "o" then "r" then "e" then "m" in the box below (case sensitive):

Archive

Show me a random blog post
 2026 

May 2026

World Cup stickers 2026

Apr 2026

A new puzzle every day
Mixing Wordle with other games

Feb 2026

Christmas (2025) is over
 2025 

Dec 2025

Christmas card 2025

Nov 2025

Christmas (2025) is coming!

Sep 2025

The partridge puzzle

Aug 2025

TMiP 2025 puzzle hunt

Jun 2025

A nonogram alphabet

Mar 2025

How to write a crossnumber

Jan 2025

Christmas (2024) is over
Friendly squares
 2024 

Dec 2024

A regular expression Christmas puzzle
Christmas card 2024

Nov 2024

Christmas (2024) is coming!

Feb 2024

Zines, pt. 2

Jan 2024

Christmas (2023) is over
 2023 
▼ show ▼
 2022 
▼ show ▼
 2021 
▼ show ▼
 2020 
▼ show ▼
 2019 
▼ show ▼
 2018 
▼ show ▼
 2017 
▼ show ▼
 2016 
▼ show ▼
 2015 
▼ show ▼
 2014 
▼ show ▼
 2013 
▼ show ▼
 2012 
▼ show ▼

Tags

polynomials propositional calculus warwick preconditioning stirling numbers games light christmas card matrix multiplication pythagoras folding paper binary fonts big internet math-off european cup tennis guest posts pi approximation day weak imposition chalkdust magazine squares crosswords coins determinants rugby errors talking maths in public hats approximation world cup wordle royal institution arrangement puzzles arithmetic draughts friendly squares bluesky braiding 24 hour maths pizza cutting python books probability wave scattering go inline code databet matrices graphs bots data crossnumber fence posts curvature kenilworth geogebra mean exponential growth harriss spiral a gamut of games latex cross stitch simultaneous equations numerical analysis asteroids gerry anderson quadrilaterals advent calendar logic live stream matrix of minors nonograms royal baby electromagnetic field frobel boundary element methods data visualisation dates newcastle bempp bubble bobble convergence map projections cambridge reuleaux polygons captain scarlet dinosaurs sound correlation triangles hexapawn inverse matrices estimation manchester science festival football ucl zines programming mathsjam php accuracy pi weather station folding tube maps signorini conditions logo craft fractals puzzles pokémon wordle matrix of cofactors countdown finite group runge's phenomenon crossnumbers menace alphabets mathslogicbot datasaurus dozen news palindromes kings golden ratio rhombicuboctahedron logs hyperbolic surfaces matt parker sorting london flexagons national lottery standard deviation recursion plastic ratio crochet game show probability video games trigonometry gather town regular expressions javascript christmas anscombe's quartet the aperiodical dataset tetris pac-man statistics phd oeis realhats youtube machine learning gaussian elimination geometry chebyshev edinburgh platonic solids mathsteroids sobolev spaces london underground bodmas computational complexity pascal's triangle people maths error bars hannah fry wool chess interpolation sport reddit rust turtles tmip raspberry pi martin gardner game of life pokémon coventry golden spiral final fantasy noughts and crosses misleading statistics manchester ternary dragon curves nine men's morris finite element method partridge puzzle numbers stickers speed graph theory thirteen radio 4

Archive

Show me a random blog post
▼ show ▼
© Matthew Scroggs 2012–2026