What is Cricsheet?
Cricsheet is a collection of projects which collectively provide data for various aspects of cricket. The current projects provide ball-by-ball match data for Men’s and Women’s Test Matches, One-day internationals, Twenty20 Internationals, some other international T20s, various club competitions such as the Big Bash League, Indian Premier League, T20 Blast, Pakistan Super League, Women's Big Bash League, Caribbean Premier League, Super Smash, The Hundred, County Championship, Women's T20 Challenge, Sheffield Shield, Charlotte Edwards Cup, Rachael Heyhoe Flint Trophy, Mzansi Super League, Bangladesh Premier League, and Bob Willis Trophy; and a registry of people linking the identifiers used for them on various sites.
At the moment we have ball-by-ball information for 10,324 matches comprising 701 Test matches, 23 other multi-day matches, 2,238 One-day internationals, 383 other one-day matches, 1,860 T20 internationals, 329 international T20s, 472 Big Bash League matches, 876 Indian Premier League matches, 935 T20 Blast matches, 180 Pakistan Super League matches, 319 Women's Big Bash League matches, 277 Caribbean Premier League matches, 201 Super Smash matches, 64 The Hundred matches, 856 County Championship matches, 9 Women's T20 Challenge matches, 75 Sheffield Shield matches, 26 Charlotte Edwards Cup matches, 53 Rachael Heyhoe Flint Trophy matches, 56 Mzansi Super League matches, 274 Bangladesh Premier League matches, and 47 Bob Willis Trophy matches featuring 95 countries, 136 club teams, and 3 representative XIs going back as far as 2007 (for women), and 2004 (for men).
The Register consists of multiple CSV files containing entries for 10,496 people, and including 17,498 identifiers from 8 sources, along with 5,872 name variations for them.
This data allows a person to be correctly identified by their ids on various sites, such as CricketArchive, Cricinfo, CricHQ, Pulse etc., and was used for that purpose on the Cricket Scorecard Accuracy Project.
How can I help?
Spotting errors in the data
The first method of helping would be to spot any errors in the data. Ideally we won’t have any but there’s always the chance and if we can spot the errors we can fix them and write further validation to ensure that further examples don’t slip through.
Helping with missing data
The second method of helping is to help us get ball-by-ball data for our missing games. This doesn’t even have to involve finding the data, it’s possible you know a contact who may be able to shed light on some matches, or you know of someone who has the commentary for a match on tape. Even small bits of info might be enough to put us on the right track.
We do have an infrequent blog to which we occasionally post about updates to the data format, additions to the site, or random musings. The most recent entry was “Review corrections” on the 4th of December, 2021.
Getting in touch
You can contact the project by following the instructions on the Contact page. Don't worry, it just involves an email, however I do provide answers for some of the common queries I receive so you may not even need to get in touch!