In 1961, famed astronomer and astrophysicist Frank Drake formulated an equation for estimating the number of extraterrestrial civilizations in our galaxy at any given time. Known as the “Drake Equation“, this formula was a probabilistic argument meant to establish some context for the Search for Extraterrestrial Intelligence (SETI). Of course, the equation was theoretical in nature and most of its variables are still not well-constrained.

For instance, while astronomers today can speak with confidence about the rate at which new stars form, and the likely number of stars that have exoplanets, they can’t begin to say how many of these planets are likely to support life. Luckily, Professor David Kipping of Columbia University recently performed a statistical analysis that indicates that a Universe teeming with life is “the favored bet.”

History has proved time and again that mathematical modelling is no substitute for a telescope (or other data collection device). Nonetheless, some theoreticians have recently put forward a statistical analysis which suggests that life is probably very rare in the universe – despite the apparent prevalence of habitable-zone exoplanets, being found by the Kepler mission and other exoplanet search techniques.

You would be right to be skeptical, given the Bayesian analysis undertaken is based on our singular experience of abiogenesis – being the origin of life from non-life, here on Earth. Indeed, the seemingly rapid abiogenesis that occurred on Earth soon after its formation is suggested to be the clinching proof that abiogenesis on habitable-zone exoplanets must be rare. Hmm…

Bayes theorem provides a basis for estimating the likelihood that a prior assumption or hypothesis (e.g. that abiogenesis is common on habitable-zone exoplanets) is correct, using whatever evidence is available. Its usage is nicely demonstrated in solving the Monty Hall problem.

Go here for the detail, but in a nutshell: There are three doors, one with a car behind it and the other two have goats. You announce which door you will pick – knowing that it carries a 1/3 probability of hiding the car. Then Monty Hall, who knows where the car is, opens another door to reveal a goat. So, now you know that door always had a zero probability of hiding the car. So, the likelihood of the remaining door hiding the car carries the remaining 2/3 probability of the system, since there was always an absolute 1/1 probability that the car was behind one of the three doors. So, it makes more sense for you to open that remaining door, instead of the first one you picked.

In this story, Monty Hall opening the door with a goat represents new data. It doesn’t allow you to definitively determine where the car is, but it does allow you to recalculate the likelihood that your prior hypothesis (that the car is behind the first door you picked) is correct.

Applying Bayesian analysis to the problem of abiogenesis on habitable-zone exoplanets is a bit of a stretch. Speigel and Turner argue that the evidence we have available to us – that life began quite soon after the Earth became habitable – contributes nothing to estimating the likelihood that life arises routinely on habitable-zone exoplanets.

They remind us that we need to acknowledge the anthropic nature of the observation we are making. We are here after 3.5 billion years of evolution – which has given us the capacity to gather together the evidence that life began here 3.5 billion years ago, shortly after the Earth first became habitable. But that is only because this is how things unfolded here on Earth. In the absence of more data, the apparent rapidity of abiogenesis here on Earth could just be a fluke.

This is a fair point, but a largely philosophical one. It informs the subsequent six pages of Spiegel and Turner’s Bayesian analysis, but it is not a conclusion of that analysis.

The authors seek to remind us that interviewing one person and finding that she or he likes baked beans does not allow us to conclude that all people like baked beans. Yes agree, but that’s just statistics – it’s not really Bayesian statistics.

If we are ever able to closely study an exoplanet that has been in a habitable state for 3.5 billion years and discover that either it has life, or that it does not – that will be equivalent to Monty Hall opening another door.

But for now, we might just be a fluke… or we might not be. What we need is more data.