Introductory Economic Statistics: A Data-Driven Approach using R

Author

Jordan Adamson

Published

30.10.2025

Preface

This Rbook introduces students to econometrics without parametric assumptions and formulas. In many ways, it is a modern version of “Introductory Econometrics: Using Monte Carlo Simulation with Microsoft Excel” by Barreto and Howland, updated to adhere to modern statistics teaching guidelines and give econometrics students the best tools for their labor market. Altogether, students learn to produce statistical analyses of economic data relevant to both the private and public sector, as well as an intuitive foundation for more advanced courses, including both nonparametric statistics and structural econometrics.

Students are introduced to the basics of statistical programming using R alongside the theoretical analysis of economic data using R. This teaches applied statistics relevant to general students, leaving the classical statistics program to mathematics departments and poor applied practices in the dust.

This Rbook is organized into three substantive parts: univariate, bivariate, and multivariate data analysis. The first part has three notable differences from a typical intro to statistics book.

1: students more deeply learn to use and interpret the Histogram, ECDF, and Boxplot (and to avoid 3D pie charts and other chart junk). They work with actual data before abstract probability theory (initially limited to simple events or intervals, with sums of random variable and transformations available optionally later)

2: students learn the basics of probability theory with real world data and computer simulations. I aimed to replace mathematical proofs with simulations whenever possible. This allows less emphasis on classical probability theory mechanics as well as fewer “t and z drills”. Confidence intervals and hypothesis tests are covered via boostrapping, for example, so students learn the conceptual approach rather than a formula.

3: students learn the theory and practice of univariate statistics before moving to bivariate statistics, rather than mixing uni-and-bivariate content. Business textbooks often introduce both types of data, then cover univariate statistics, and return to bivariate statistics much later. Math textbooks typically introduce students to probability theory long before concrete applications. This textbooks includes many practical examples, including on how to analyze data interactively and communicate results.

Parts II and III refines material from several introductory econometrics textbooks and covers linear models only from a “minimum distance” perspective. (We operate under the maxim “All models are wrong” and do not prove unbiasedness.) Also included is a novel chapter on “Data scientism” that more clearly illustrates the ways that simplistic approaches can mislead rather than illuminate. (I stress “gun safety” instead of “pull to shoot”, which is missing from many econometrics textbooks that start with “Assume \(Y=X\beta+E\)”.) Overall, there is a more humble view towards what we can infer from linear regressions that opens the door towards more advanced courses in model development and interpretation. We also cover statistical reporting using R + markdown, which research suggests is a good combination for students 1 2.


Although any interested reader may find it useful, this Rbook is primarily developed for my students.

If you use this Rbook, please cite

@book{Adamson2025_Rbook,
  title={Introductory Economic Statistics: A Data-Driven Approach using R},
  author={Adamson, Jordan},
  year={2025},
  publisher={Bookdown},
  url={https://jadamso.github.io/Rbooks/}
}

Please also report any errors or issues at https://github.com/Jadamso/Rbooks/issues.