teachingstatisticsdatasets

From Odds to Insight: Teaching Statistical Inference with Kansas vs. Baylor Game Data

UUnknown

2026-02-21

10 min read

Use the Jan 16, 2026 Kansas vs. Baylor matchup as a reproducible classroom dataset to teach hypothesis testing, confidence intervals, and effect size.

Hook: Turn the frustration of opaque datasets into a hands-on classroom experiment

Students and instructors often face the same pain points: paywalled datasets, messy play-by-play files, and confusion about what a statistically significant result actually means in real-world terms. Use the recent Kansas vs. Baylor (Jan 16, 2026) matchup as a compact, motivating classroom dataset to teach hypothesis testing, confidence intervals, and effect size—with an emphasis on reproducible workflows and pedagogically clear decisions.

The value proposition: sports data as a teaching dataset in 2026

Sports matchups are ideal microcosms for statistical inference: clearly defined outcomes (points, win/loss), repeated structure (possession-level data), rich covariates (home/away, lineups, fatigue), and public interest that motivates students. In 2026, several trends make sports-based pedagogy even more powerful:

Expanded access to play-by-play and tracking data (shot coordinates, possession windows) from open APIs and university collaborations.
Increased adoption of reproducible research platforms—Binder, CodeOcean, GitHub + Actions—for sharing classroom notebooks and datasets.
Wider availability of pre-built analytics packages in Python and R for bootstrapping, permutation tests, and effect-size computation.

These developments let instructors construct compact, real-world experiments students can reproduce in a single lab session.

Why Kansas vs. Baylor is a great teaching match

The Jan 16, 2026 Kansas vs. Baylor game provides a small, meaningful dataset with the following teaching advantages:

Clear contrast: home-court (Kansas, Allen Fieldhouse) vs. visiting Baylor—useful for home advantage tests.
Rich box score: points, fouls, turnovers, shooting splits, possessions—good for multiple hypotheses.
Available odds: pregame betting lines supply an external expected-value comparator for forecasting models.
Immediate narrative: students care about the outcome, increasing engagement.

Designing the classroom activity: learning objectives and dataset

Start by defining clear learning objectives. Example objectives for a 90–120 minute lab:

Apply a two-sample t-test and a nonparametric permutation test to compare means.
Construct and interpret 95% confidence intervals for point differentials and shooting percentages.
Calculate and interpret an effect size (Cohen's d or rank-biserial) for the primary outcome.
Assess reproducibility: build a small, sharable notebook and document the data provenance.

Assemble a minimal dataset for the lab: game-level box score plus a small multi-game sample (e.g., last 20 games for each team) to teach sampling and generalization. Provide both the single-game CSV and the multi-game CSV so students can see the difference between within-game inference and general team effects.

Step-by-step classroom workflow

1. Data collection and provenance (15–20 minutes)

Demonstrate how to collect and document the data. For example, use box score and basic play-by-play scraped from a public source or an approved sports API. Emphasize:

Source citation: URL, date accessed, API key usage, and any license terms.
Version control: include a data version tag or commit SHA in the notebook metadata.
Minimal metadata: team names, date, location (home/away), betting line, and final score.

Teaching tip: provide a small pre-cleaned CSV for beginners and a raw CSV for an optional advanced cleaning exercise.

2. Exploratory data analysis (15 minutes)

Guide students to compute and visualize simple summaries:

Mean points per game and standard deviation for each team (over the last 20 games).
Shot distribution heatmaps or histograms for field-goal attempts.
Time-series of margin-of-victory to discuss serial correlation.

Keep visualizations simple—boxplots for distribution comparisons and difference-in-means plots for the primary outcome.

3. Hypothesis testing: from odds to inference (30 minutes)

Frame the main hypothesis using the betting line and the public expectation:

Null hypothesis (H0): The home-court effect on point differential is zero. Alternative hypothesis (H1): Home-court yields a non-zero effect.

Suggested classroom tests:

Two-sample t-test comparing mean point differentials for Kansas (home) vs Baylor (away) across a multi-game sample. Discuss assumptions: normality, equal variances.
Permutation test for the same comparison—emphasize it's assumption-light and intuitive: shuffle the home/away labels across games and compute the null distribution.
One-sample test comparing the observed Kansas-Baylor margin in the Jan 16 game to the distribution of expected margins from the model or historical differences.

Practical note: with small samples, permutation and bootstrap methods often give more trustworthy uncertainty measures than parametric approximations.

4. Confidence intervals: estimation and interpretation (15 minutes)

Teach confidence intervals as interval estimates, not probability statements about parameters. Classroom activities:

Compute a 95% CI for the mean point differential between Kansas and Baylor over the last 20 matchups (or substitute a similar sample if 20 is unavailable).
Use bootstrap resampling to produce percentile CIs for medians and proportions (e.g., free-throw percentage difference).
Compare parametric CIs (t-based) with bootstrap CIs and discuss differences.

Emphasize interpretation: "If we repeated the sampling process many times, ~95% of the constructed intervals would contain the true mean difference."

5. Effect size: making results meaningful (20 minutes)

Statistical significance is only half the story. Teach students to compute and report effect sizes:

Cohen's d for standardized mean difference (use pooled SD). Interpret small/medium/large with sports-context examples (e.g., 3–5 points in college basketball ≈ meaningful).
Rank-biserial or Cliff's delta for nonparametric comparisons.
Odds ratios for binary outcomes (win/loss) or converting marginal effects into win probability differences.

Classroom exercise: compute Cohen's d for Kansas home vs away scoring, and contrast effect size with p-value significance to practice richer interpretation.

Advanced strategies and 2026 trends to include

For advanced students, connect the basic lab to contemporary methods gaining traction in 2025–2026:

Hierarchical/multilevel models to borrow strength across games and account for team-level random effects.
Bayesian inference for direct probabilistic statements about effect size; demonstrate a simple posterior for mean point difference using a weakly informative prior.
Permutation-based confidence intervals and robust standard errors to deal with heteroskedasticity in sports data.
Probalistic forecasting: calibrate odds-implied models with observed margins to teach model checking (Brier score, calibration plots).

Recent 2025–2026 developments: growing use of player-tracking data in undergraduate projects (shot selection, defensive impact) and more instructor-friendly libraries—refer students to packages like statsmodels, scikit-learn, brms (R), and the tidymodels ecosystem.

Reproducibility and data management: a compact checklist

Make reproducibility part of the assessment. Require a minimal reproducibility standard for each student notebook or report:

Data provenance: source URL, access date, extraction code snippet.
Environment specification: requirements.txt, renv.lock, or environment.yml.
Executable notebook: Jupyter or RMarkdown with outputs committed; include small sample data to avoid API limits.
License and ethical statement: note any terms of use for betting or tracking data and a short comment on responsible use.
Short README.md explaining how to run the analysis in under 10 minutes using Binder or GitHub Codespaces.

Teaching tip: supply a template repository with the above files prepopulated. In 2026, many universities already support GitHub Classroom + automated checks to ensure submissions meet reproducibility criteria.

Assessment rubrics and grading suggestions

Grade on both technical correctness and interpretation. A suggested rubric (out of 100):

Data provenance and reproducibility files (20)
Correct application of tests and CIs (25)
Effect-size calculation and practical interpretation (20)
Visualization and EDA clarity (15)
Discussion of limitations and ethics (10)
Bonus: Bayesian or hierarchical extension (10 extra credit)

Sample classroom prompts and open-ended project ideas

Prompt 1: Is home-court advantage real for Kansas this season?

Use the last 20 Kansas home games and 20 away games. Compute a CI for the mean difference in points and test whether the home effect is > 0. Report Cohen's d and interpret in points and practical terms (e.g., win probability).

Prompt 2: Did pregame odds under- or over-estimate the actual margin?

Compare the bookmaker-implied expected margin for the Jan 16 game with realized margin. Build a simple calibration analysis using a multi-game sample: compute Brier scores for win probability predictions and a mean error for predicted margin.

Prompt 3: What effect did turnovers have on the final margin?

Fit a simple linear model: margin ~ turnovers_diff + offensive_rebound_diff + free_throw_attempt_diff. Report coefficients with bootstrap CIs and standardized effect sizes.

Interpreting results responsibly: beyond p-values

Teach students to answer three practical questions:

Is the effect statistically significant? (report p-value, but avoid over-reliance)
Is the effect practically meaningful? (report effect size in game units)
How reproducible/robust is the finding? (sensitivity analyses, bootstrap, out-of-sample checks)

Encourage pre-registration of classroom hypotheses to avoid data-dredging. For example, ask students to submit a one-paragraph hypothesis and analysis plan before viewing the Jan 16 box score.

Ethical considerations and classroom policy

Sports and betting data can intersect with gambling interests. Include a brief ethics section in every assignment:

Prohibit using class analyses for live betting.
Respect data licenses and player privacy—do not redistribute proprietary tracking data.
Discuss biases in sports analytics (selection bias, survivorship bias) and their impact on inference.

Practical resources and reproducible starter kit (2026-ready)

Provide students with a starter kit repository that includes:

A small CSV for Jan 16, 2026 Kansas vs. Baylor box score and a 40-game sample for both teams.
A Jupyter notebook (Python) and an RMarkdown (R) notebook implementing the workflow: EDA, t-test, permutation test, bootstrap CI, Cohen's d, and a short linear model.
Requirements: requirements.txt (Python) and renv.lock (R).
Binder badge and instructions to launch the notebook in the cloud for students without local installs.

Include one fully annotated notebook as the instructor solution and one minimal stub for student completion.

Common pitfalls and troubleshooting checklist

Small-sample inference: prefer permutation/bootstrap over asymptotic tests when n is small.
Non-independence: consecutive game performance can be autocorrelated—consider blocking or time-series adjustments.
Multiple comparisons: adjust or pre-register when testing many outcomes (shooting, turnovers, rebounds).
Misinterpreting CI: avoid claiming the CI contains the parameter with X% probability—explain frequentist interpretation or use Bayesian intervals for probabilistic claims.

Takeaways and classroom-ready checklist

Implementing a sports-based statistical inference lab delivers engagement and practical skill-building. To run the Kansas vs. Baylor activity successfully, follow this checklist:

Prepare the dataset and cite sources (box score + small multi-game sample).
Provide two notebook templates (full solution + student stub).
Emphasize reproducibility: include environment files and a README for Binder.
Focus grading on interpretation and effect size, not just p-values.
Offer advanced extensions (Bayesian, hierarchical) for motivated students.

Final reflections: why this matters in 2026

By 2026, data literacy is essential across disciplines. Sports datasets like Kansas vs. Baylor give students an emotionally engaging, compact, and reproducible environment to learn rigorous statistical thinking. The exercise bridges hypothesis testing, interval estimation, and practical effect-size interpretation—while modeling reproducible workflows that mirror modern research practices.

Actionable next steps (for instructors)

Download the starter CSV and notebook from the shared repository (provide link in your LMS).
Pre-register one clear hypothesis for the class and distribute the student stub before the lecture.
Run a live demo of a permutation test and bootstrap CI to demystify resampling methods.
Ask students to submit reproducible notebooks via GitHub Classroom or your LMS with the reproducibility checklist completed.

Call-to-action

Ready to adopt this lesson? Download the Kansas vs. Baylor starter kit (CSV + Jupyter + RMarkdown) and a reproducibility template to run this activity in your next lab. Share your class results and improvements on our educator forum to help build an open collection of sports-based teaching datasets and reproducible solutions.

Unknown

Contributor

Senior editor and content strategist. Writing about technology, design, and the future of digital media. Follow along for deep dives into the industry's moving parts.

Up Next

Designing a Replicable Study Around Emerging Biotech Tools

biotech•10 min read

Three Biotech Technologies to Watch in 2026: A Researcher’s Digest

literature review•9 min read

Translating Travel Megatrends into Research Questions: A Primer for Social Scientists

conferences•10 min read

Conference Prep for Researchers: Getting the Most Out of Events Like Skift Megatrends

sports analytics•10 min read

Recreating a SportsLine-style Model: A Hands-on Guide to College Basketball Simulations

From Our Network

Trending stories across our publication group

Designing Fair Award Processes: Lessons from Film and Writers’ Guilds for Academic Societies

journals.biz

governance•11 min read

Designing Fair Award Processes: Lessons from Film and Writers’ Guilds for Academic Societies

When Creators Receive Lifetime Honors: How Career Awards Shape Scholarly Recognition

journals.biz

awards•9 min read

When Creators Receive Lifetime Honors: How Career Awards Shape Scholarly Recognition

Choosing an AI Model for Editorial Workflows: Pros, Cons, and Questions to Ask

journals.biz

AI•10 min read

Choosing an AI Model for Editorial Workflows: Pros, Cons, and Questions to Ask

Why Big Tech Partnerships Matter for Scholarly Publishers: Lessons from Apple’s Gemini Deal

journals.biz

technology•8 min read

Why Big Tech Partnerships Matter for Scholarly Publishers: Lessons from Apple’s Gemini Deal

When Government Figures Cite Research: Navigating Political Attention and Research Neutrality

journals.biz

policy•10 min read

When Government Figures Cite Research: Navigating Political Attention and Research Neutrality

journals.biz

science communication•9 min read

Mayors, Media, and Methodology: Preparing Researchers for High-Profile Media Appearances

2026-02-25T11:10:33.869Z

Hook: Turn the frustration of opaque datasets into a hands-on classroom experiment

The value proposition: sports data as a teaching dataset in 2026

Why Kansas vs. Baylor is a great teaching match

Designing the classroom activity: learning objectives and dataset

Step-by-step classroom workflow

1. Data collection and provenance (15–20 minutes)

2. Exploratory data analysis (15 minutes)

3. Hypothesis testing: from odds to inference (30 minutes)

4. Confidence intervals: estimation and interpretation (15 minutes)

5. Effect size: making results meaningful (20 minutes)

Advanced strategies and 2026 trends to include

Reproducibility and data management: a compact checklist

Assessment rubrics and grading suggestions

Sample classroom prompts and open-ended project ideas

Prompt 1: Is home-court advantage real for Kansas this season?

Prompt 2: Did pregame odds under- or over-estimate the actual margin?

Prompt 3: What effect did turnovers have on the final margin?

Interpreting results responsibly: beyond p-values

Ethical considerations and classroom policy

Practical resources and reproducible starter kit (2026-ready)

Common pitfalls and troubleshooting checklist

Takeaways and classroom-ready checklist

Final reflections: why this matters in 2026

Actionable next steps (for instructors)

Call-to-action

Related Reading

Related Topics

Unknown

Up Next

Designing a Replicable Study Around Emerging Biotech Tools

Three Biotech Technologies to Watch in 2026: A Researcher’s Digest

Translating Travel Megatrends into Research Questions: A Primer for Social Scientists

Conference Prep for Researchers: Getting the Most Out of Events Like Skift Megatrends

Recreating a SportsLine-style Model: A Hands-on Guide to College Basketball Simulations

From Our Network

Designing Fair Award Processes: Lessons from Film and Writers’ Guilds for Academic Societies

When Creators Receive Lifetime Honors: How Career Awards Shape Scholarly Recognition

Choosing an AI Model for Editorial Workflows: Pros, Cons, and Questions to Ask

Why Big Tech Partnerships Matter for Scholarly Publishers: Lessons from Apple’s Gemini Deal

When Government Figures Cite Research: Navigating Political Attention and Research Neutrality

Mayors, Media, and Methodology: Preparing Researchers for High-Profile Media Appearances