FanPost

In depth analysis of advanced stats – part 1

Over the last few months I have seen quite a few discussions on whether or not the Islanders is a mirage. Some say good possession numbers equal a good team and those same people often say PDO is a measurement of luck. I honestly didn't know what to think. This is why I decided to take a deeper look.

I may not have been an advanced stats expert going in to this analysis, but I am pretty good with numbers, and I really find the conclusions interesting, which is why I have decided to post it.

Be aware that all graphs have (,) instead of (.). That's not a mistake, that's just the Danish way to do it.

I am not going to look at individual stats. I simply want to look at what defines a good team in terms of statistics. In order to get the best possible result, I have decided to look at every game played in the NHL going back to the season 2007/2008 and up until January 9th 2019 (13,730 games). That's a very large dataset, so the trends you will see should be well documented.

I'm looking at all strengths (not just 5v5) as I want to see what defines a great team in all situations.

So let us start by looking at records in this time frame:

Team

GP

W

L

OTL

ROW

Points

Point %

Vegas Golden Knights

128

78

39

11

73

167

0.652

Washington Capitals

910

532

274

103

481

1167

0.641

Pittsburgh Penguins

911

538

285

88

473

1164

0.639

San Jose Sharks

913

522

285

105

456

1149

0.629

Boston Bruins

911

512

288

111

460

1135

0.623

Chicago Blackhawks

913

502

301

110

444

1114

0.610

Anaheim Ducks

912

499

304

107

442

1105

0.606

St Louis Blues

909

489

323

97

432

1075

0.591

Nashville Predators

913

483

317

113

432

1079

0.591

New York Rangers

910

486

330

92

426

1064

0.585

Detroit Red Wings

913

469

315

128

411

1066

0.584

Philadelphia Flyers

911

456

331

123

417

1035

0.568

Montreal Canadiens

912

462

337

112

409

1036

0.568

Minnesota Wild

910

463

344

100

411

1026

0.564

Tampa Bay Lightning

911

461

348

101

416

1023

0.561

Winnipeg Jets

582

295

224

63

263

653

0.561

Vancouver Canucks

913

460

350

103

409

1023

0.560

Los Angeles Kings

912

460

351

100

405

1020

0.559

Dallas Stars

912

453

353

105

406

1011

0.554

New Jersey Devils

910

443

358

109

390

995

0.547

Calgary Flames

913

449

366

97

409

995

0.545

Columbus Blue Jackets

910

432

376

102

379

966

0.531

Ottawa Senators

912

426

372

113

375

965

0.529

Carolina Hurricanes

910

408

379

122

375

938

0.515

Toronto Maple Leafs

910

410

384

115

360

935

0.514

Florida Panthers

908

404

377

124

344

932

0.513

New York Islanders

909

407

387

114

349

928

0.510

Colorado Avalanche

912

420

404

87

358

927

0.508

Arizona Coyotes

910

403

395

111

344

917

0.504

Buffalo Sabres

911

388

406

117

330

893

0.490

Atlanta Thrashers

328

138

151

38

113

314

0.479

Edmonton Oilers

911

369

441

101

308

839

0.460

Not much to say about that. It's been some rough years as an Islanders fan. They are ranked no. 27 out of 32 teams (I have counted Atlanta and Winnipeg as two separate teams). It should be noted that in all the graphs below I have not included Vegas, as the dataset is too small. There's only 3 teams under .500 and the league average point% is .558.

Now we will dig a little bit deeper and look at each team's corsi numbers (ratio between shot attempts taken and shot attempts allowed) and PDO (Shooting% + Save%). There has been taken 828669 shots and the average Shooting% is 9.108 meaning the average Save% is 90.892. The goalkeepers average Save% is higher than this, since empty net goals are accounted for in the teams Save% and PDO.

The teams are in order of CF% and again we find the Islanders in the bottom part.

Team

CF%

SH%

SV%

PDO

San Jose Sharks

56.077

8.897

91.079

0.9998

Detroit Red Wings

55.867

8.933

90.666

0.9960

Vegas Golden Knights

55.710

9.674

91.013

1.0069

Chicago Blackhawks

55.170

9.286

90.944

1.0023

Los Angeles Kings

54.969

8.576

91.125

0.9970

Boston Bruins

54.117

9.087

91.850

1.0094

Carolina Hurricanes

53.651

8.429

90.381

0.9881

Pittsburgh Penguins

51.734

9.726

91.110

1.0084

St Louis Blues

51.440

9.232

90.884

1.0012

Washington Capitals

51.111

10.035

91.220

1.0126

New Jersey Devils

50.549

8.666

90.907

0.9957

Philadelphia Flyers

50.543

9.334

90.707

1.0004

Nashville Predators

50.484

9.164

91.159

1.0032

Calgary Flames

50.449

9.378

90.379

0.9976

Winnipeg Jets

50.385

9.411

90.693

1.0010

Dallas Stars

50.316

9.485

90.449

0.9993

Vancouver Canucks

50.093

9.165

91.179

1.0034

New York Rangers

49.662

8.953

91.477

1.0043

Ottawa Senators

49.409

8.993

90.761

0.9975

Columbus Blue Jackets

49.208

8.946

90.719

0.9967

Tampa Bay Lightning

49.135

9.881

90.399

1.0028

Montreal Canadiens

48.915

9.035

91.316

1.0035

Toronto Maple Leafs

48.693

9.288

90.441

0.9973

Florida Panthers

48.442

8.405

91.145

0.9955

Anaheim Ducks

48.132

9.312

91.364

1.0068

Arizona Coyotes

47.322

8.437

91.074

0.9951

New York Islanders

47.188

8.955

90.305

0.9926

Minnesota Wild

46.794

9.306

91.248

1.0055

Edmonton Oilers

44.973

9.095

90.315

0.9941

Buffalo Sabres

44.944

8.573

91.108

0.9968

Colorado Avalanche

44.538

9.104

90.601

0.9971

Atlanta Thrashers

44.130

9.615

90.231

0.9985

Now it starts to get interesting, because now we can test if possession numbers actually equal winning. So I've made a graph with point% vertical and CF% horizontal. If there's a direct correlation you should see a straight line (perfect correlation means R2 = 1).

1

The good corsi teams tend to be winning more, but there's definitely more to it than that. How is it then with PDO? Does that correlate with winning?

2

Clearly it's a good thing to have a high PDO. It will win you matches, but it's not exactly a perfect match either. The fact that good goaltending and good goalscoring wins you hockey games is probably not a revolutionairy discovery though.

So it's obviously a good thing to be high in both PDO and CF%. That is a difficult task though, since the two in many cases work against each other. If you take a lot of shots from the outside, then you will have good corsi numbers but a bad PDO. If you only take quality chances, then it will be the other way round.

The next graph shows PDO as a function of corsi, so you will want to be in the upper right corner. I've marked the great teams (above .600) with orange.

3

Up until now we have treated all shot attempts as equal, but what happens if we look at the shots in terms of danger. Natural Stat Trick divides shot attempts in to 3 categories -€” high danger, medium danger and low danger. On average it requires 7.10 high danger shot attempts to score one goal, it requires 21.35 medium danger shot attempts to score a goal and it requires 50.80 low danger shot attempts to score one goal.

Knowing this we can calculate the expected goal differential (xGDF) based on the amount of high-, medium- and low danger shot attempts for and against. It turns out, that even if we do this, there's no great correlation between the expected goal differential and the teams quality (P%).

4

Just in case you were wondering, there should be a near perfect correlation between goal differential and winning. The next graph shows just that. It's point% as a function of actual goal differential per game. The top teams outscore their opponent by more than 0.4 goals per game.

5

So even if you account for shot danger, shot attempts (corsi) is not a good way to evaluate the quality of a team. There's clearly more to it than just outshooting your opponent. So what happens if we calculate the expected goal differential based on shot danger as well as PDO.

6

Now you see a good correlation. So getting a lot of chances is not enough. You also have to get good scoring and goaltending.

This is perhaps not groundbreaking news, but it's still interesting since many people look to CF% or HDCF% for answers, when in fact those metrics only tell you half the story.

The problem with PDO is that it's difficult to predict. You can calculate an expected PDO based on career numbers, but it's not as easy as it may sound. Instead I will take a look at some numbers from Corsica.

Let us start by looking at Corsica's expected goal differential (xG+/- on the homepage). That stat is based on number of shot attempts and the quality (angle, distance, rebounds and so on) of those shot attempts. It's not based on the quality of the shooter or the quality of the goaltender.

So let us see how well Corsica's xGDF correlates to winning hockey games.

7

As we have shown earlier, you can't ignore the quality of the shooters and the quality of the goaltenders. This means that you can't compare Corsica's expected goal differential to actual goal differential. If someone does that, they are simply wrong unless goaltending and shooting is exactly average.

So what happens if we combine Corsica's xGDF with Corsica's WAR(shooting) and WAR(goaltending).

8

This is based on a conversion rate of 4.5 between WAR and GAR, which is standard. However, if we use 5.3 the result is better (graph below). This indicates that the WAR numbers are correct relative to each other, but should probably be a bit higher.

9

The last thing I will look at in this post is WAR(total). Instead of looking at xGDF combined with WAR(shooting) and WAR(goaltending) we will simply compare the WAR(total) of each team with team quality.

10

This means that WAR is actually a pretty good indicator of team Quality. It should also be noted that the average WAR(total) is not 0, it's 0.113 per game or 9.27 per season.

Conclusion:

- CF% and HDCF% are not good indicators of a team's quality. They tell half the story, but you also have to factor in quality of shooting and quality of goaltending.

- PDO does not equal luck. Looking at 11+ years of data, some teams have a good PDO and some teams have a bad PDO. If the sample size is small, then luck can play a factor.

- WAR is a pretty good indicator of team quality.

That's it for this post. I hope at least a few of you found it interesting, even though it's quite nerdy. In the next post I will look at Islanders' WAR numbers this season and compare them to expected WAR numbers based on career metrics. Spoiler alert: Islanders are outperforming their career numbers, but not by as much as one could fear.

All data is based on stats from Natural Stat Trick and Corsica.

<em>Submitted FanPosts do not necessarily reflect the views of this blog or SB Nation. If you're reading this statement, you pass the fine print legalese test. Four stars for you.</em>