Grand Slam Statistics
Rod
Cross June
2009
Each match played at each Grand Slam
tournament has a Match Statistics summary on the web site, similar to the one shown
here for the 2009 Australian Open menÕs singles final. The numbers contain
valuable information for both players and coaches. The numbers are also useful
for statistics calculations.
As shown by the Nadal vs Federer example,
the winner is not necessarily the one who won the most points or the most
games. The winner is the one who won the important points and the important
games and ended up with the most sets. In the long run, it makes no difference
in a 5-set match if the player loses one of those sets 6-7 or 2-6, although it
might temporarily dent his confidence to lose a set 0-6. Similarly, it makes no difference
(other than to pride and confidence level) if the player loses a game to love
or loses it after a few deuce points.

Definition of terms
1st Serve % = fraction of first serves that
went in (the other first serves
being a fault). Nadal served 175 first serves. 112 of those or 64% were good
and the other 63 were faults.
Winning % on 1st Serve = fraction of good
first serves won by the server. Nadal won 66% of his good 1st serves, or 74 out
of 112.
Winning % on 2nd Serve = fraction of second
serves won by the server (some of
those serves being lost due to a double fault). Nadal won 48% or 30 of his 63
2nd serves.
Nadal won 74 + 30 = 104 points while he was
serving, out of a possible 175, so he lost 71 points while he was serving.
ThatÕs why Federer won 71 out of 175 points when he was receiving.
Similarly, Nadal won 69 out of 172 points
while he was receiving, so he won 104 + 69 = 173 points altogether.
Nadal had 16 opportunities to break
FedererÕs serve, and was successful on 7 occasions. He obviously tried hard to
win those points and was successful 44% of the time. When he was receiving, he
won only 40% of the time on average.
On the other hand, Federer won 41% of the points when he was receiving,
but he won only 32% of the time when he had a break point opportunity. Either
he wasnÕt trying hard enough, or he tried too hard and hit the ball out too
often or Nadal decided that he wanted the point more than Federer wanted it.
Whatever the case, Federer lost the match because he was unable to convert
enough break points and because Nadal converted too many. A whole match can
depend, in retrospect, on just one critical break point.
Probability
of winning a point
The outcome of a match, in a statistical
sense, depends to a large extent on the probability, p, that the server wins a point when he
(or she) is serving. The formula for p is
p = fw + (1 - f) s
where f = 1st Serve %, w = Winning % on 1st
Serve and s = Winning % on 2nd
Serve
For Nadal, f = 0.64, w = 0.66 and s = 0.48 so
p(Nadal) = 0.64 x 0.66 + 0.36 x 0.48 = 0.595. For Federer, f = 0.52, w = 0.73
and s = 0.45 so p(Federer) = 0.52 x 0.73 + 0.48 x 0.45 = 0.596. Both players
had an essentially equal chance of winning a point when they were serving, so
that had an equal chance of winning (or losing) a game when they were serving.
Federer had a bad serving day. He won 73% of his first serves when they went
in, but he got only 52% of them in.
For any given match, the point probability,
p, of the server winning a point can also be calculated from the total number
of service points won by each player. For example, suppose that player A wins
W1 of his N1 good first serves
during the whole match and also wins W2 of his N2 second serves. Then
p(A)
= (W1 + W2)/(N1 + N2)
f =
N1/(N1 + N2), w
= W1/N1 and s = W2/N2.
Probability
of winning a game
How many times did Nadal step up to serve,
and how many times did he win the game when he was serving? We can work this
out from the Table. The total number of games in the match, from the set score,
was 50, not counting the tie-break game. So they each stepped up to serve 25
times. Federer broke NadalÕs serve 6 times (the 6 conversions) and Nadal broke
FedererÕs serve 7 times (his 7 conversions). Nadal won 25 games in total (not
counting the tie break game), so he won 18 of his own service games plus the 7
games he won when he broke FedererÕs serve. Federer also won 25 games, so he won 19 of his own service
games plus the 6 he won when he broke NadalÕs serve. NadalÕs chance of winning
one of his own service games was therefore 18/25 = 0.72, while FedererÕs chance
was 19/25 = 0.76. Nadal won the same number of games as Federer by winning the
tie-break game.
Grand
Slam Parameters
Every match is described by a different set
of parameters for each winner and loser. I looked at each of four Grand Slam
tournaments (in 2008 and 2009) and at the last 31 matches in each tournament
from Round 3 to the final. The average values for each tournament, for both men
and women singles, are shown in Table 1, where A = winner and B = loser of each
match.

g = probability of server winning his or her serve game.
r
= 1 - p = probability of
server losing a point when serving = probability of the receiver winning the
point.
C = Break Point Conversion % = fraction or
percentage of break point opportunities won (converted) by the receiver.
TB = fraction of sets that end in a tie
break.
Table 1 lists average quantities, averaged over the N winners
and the N losers of each match. For women, N was 31 for the French Open and
Wimbledon, 28 at the Australian Open
(since three of the matches were not completed) and 30 at the US Open since the
statistics were not correctly recorded in one match.
Some observations from this table are as follows:
The 1st Serve % (f) is about
0.62 at all tournaments for both winners and losers, for both men and women.
Men at the Australian Open averaged 0.67 in 2009.
Winning women players win about 70% of
their good 1st serves at all 4 tournaments (wA ~ 0.70), while losing players
win only about 58% of their good 1st serves (wB ~ 0.58). Men do better, with wA
~ 0.78 and wB ~ 0.66.
Winning women players win about 50% of
their 2nd serves at all 4 tournaments (sA ~ 0.50), while losing players win
only about 40% of their 2nd serves (sB ~ 0.40). Men do better, with sA ~ 0.57
and sB ~ 0.47.
The probability of winning a point when
serving is about 0.62 for women winners at all 4 tournaments (pA ~ 0.62), and
about 0.50 for losers. Men do better, with pA ~ 0.70 and pB ~ 0.58.
The probability of the server winning a
game (when serving) is about 0.80 for women winners at all 4 tournaments (gA ~
0.80), and about 0.50 for losers, although both figures are higher at Wimbledon
and lower at the French Open, especially for the loser. The losing woman player
can expect to lose serve more than half the time at the French Open, and about
half the time at the Australian and US Open. For men, gA ~ 0.90 and gB ~ 0.68.
The serve is more dominant in the men's game.
The winning woman player can expect to win
about half the points when receiving serve (rA ~ 0.5), but the losing player
wins only about 37% of the points on average when receiving (rB ~ 0.37). For
men, rA ~ 0.40 and rB ~ 0.30, with the result that there are fewer break point
opportunities.
The chance of converting a break point is
about the same as winning any other point when receiving, when averaged over
all players, and is especially difficult for the losing player at Wimbledon
where CB = 0.285 for women and CB = 0.225 for men. In any individual match, C
can be quite different to r. A break point is a critical point. Some players
handle it better than others and actually win a greater fraction of their break
points than any other point, while some players wilt under the pressure (or are
just unlucky) and are less successful when they face a break point.
Probability
of winning a game.
Since p = probability of winning a
point, p x p is the probability of
winning two points in a row when serving, 1-p = probability of losing a point,
and (1-p) x (1- p) is the probability of losing two points in a row. For
example, if p = 0.6, then the server can expect to win two points in a row 36%
of the time, he can expect to lose two points in a row 16% of the time, and he
can expect to win then lose a point or lose then win a point 2 x 0.6 x 0.4 = 48% of the time. The
chance of winning 4 points in a row is 0.6 x 0.6 x 0.6 x 0.6 = 0.13, so the
server can expect to win a game to love 13 % of the time, and can expect to
lose a game to love 2.6% of the time.
Extending this process to consider all possible
outcomes, we can work out the probability, g, of winning a game, and the answer (including deuce games) is

A graph of g vs p follows, showing that if
p = 0.5 then g = 0.5 and that the probability of the server winning the game
increases rapidly with only a small increase in p. For example, if p = 0.7 then
g = 0.90. The probability of the server winning a point depends on how well he
or she is serving and on the ability of the opponent.

Knowing the probability that player A wins
his service game, and the probability that player B wins his service game, we
can calculate the most likely outcome of the set and the probability that
player A wins 6-0 or 6-1 etc or loses by these various set scores. Some results
are shown below. There are some surprises. For example if pA = 0.6 and pB = 0.6
then both players are evenly matched. The various possible outcomes are shown
by the blue points, assuming that player A serves first. It is then more likely
that player A will win 6-1 than lose 1-6, and it is more likely that he will
lose 4-6 than win 6-4. The reason is that a 6-4 result can be obtained if
player A loses none of his serves or loses up to 4 of his serves, while a 4-6
result can be obtained if player A loses up to 5 of his serves (while B wins 1
serve and loses 4). That is, there are more ways to lose than to win.
If pA = 0.6 and pB = 0.7 then the most
likely result is that A loses 4-6, but there is also a small chance that A will
win the set. At least, that is the case if every point is won or lost with the
same probability. If player A absolutely refuses to concede a break point, or
if player B is unable to break serve, then player A will win the set. Table 1
shows that the probability of winning a break point is not necessarily the same
as winning other points. In some matches, C can be double or half the value of
r, meaning that the player wins or loses twice as many break points as would
normally be expected.
