Exploratory Data Analysis Using R
Shipping & Delivery
Our Delivery Time Frames Explained
2-4 Working Days: Available in-stock
14-28 Working Days: On Backorder
Will Deliver When Available: On Pre-Order or Reprinting
We ship your order once all items have arrived at our warehouse and are processed. Need those 2-4 day shipping items sooner? Just place a separate order for them!
Product details
- ISBN 9781032814803
- Weight: 453g
- Dimensions: 156 x 234mm
- Publication Date: 02 Jul 2026
- Publisher: Taylor & Francis Ltd
- Publication City/Country: GB
- Product Form: Paperback
Exploratory Data Analysis Using R provides a classroom-tested introduction to exploratory data analysis (EDA), and this revised edition is accompanied by the R package ExploreTheData that implements many of the approaches described. As before, the primary focus of the book is on identifying "interesting" features - good, bad, and ugly - in a dataset, why it is important to find them, how to treat them, and more generally, the use of R to explore and explain datasets and the analysis results derived from them.
The book begins with a brief overview of exploratory data analysis using R, followed by a detailed discussion of creating various graphical data summaries in R. Then comes a thorough introduction to exploratory data analysis, and a detailed treatment of 13 data anomalies, why they are important, how to find them, and some options for addressing them. Subsequent chapters introduce the mechanics of working with external data, structured query language (SQL) for interacting with relational databases, linear regression analysis (the simplest and historically most important class of predictive models), and crafting data stories to explain our results to others. These chapters use R as an interactive data analysis platform, while Chapter 9 turns to writing programs in R, focusing on creating custom functions that can greatly simplify repetitive analysis tasks. Further chapters expand the scope to more advanced topics and techniques: special considerations for working with text data, a second look at exploratory data analysis, and more general predictive models.
The book is designed for both advanced undergraduate, entry-level graduate students, and working professionals with little to no prior exposure to data analysis, modeling, statistics, or programming. It keeps the treatment relatively non-mathematical, even though data analysis is an inherently mathematical subject. Exercises are included at the end of most chapters, and an instructor's solution manual is available.
Ronald K. Pearson holds a PhD in Electrical Engineering and Computer Science from the Massachussetts Institute of Technology and has more than 40 years professional experience in exploratory data analysis. Dr. Pearson has held industrial, business, and academic positions in the fields of industrial process control, bioinformatics, drug safety data analysis, software development, and insurance. He has authored or co-authored books including Exploring Data in Engineering, the Sciences, and Medicine (Oxford University Press, 2011) and Mining Imperfect Data with Examples in R and Python (SIAM, 2020).
