Exploratory Data Analysis Using R

Regular price €64.99
Quantity:
In stock with our UK publisher. 14-28 days
Will Deliver When Available
14 days return policy Shipping & Delivery
A01=Ronald K. Pearson
Author_Ronald K. Pearson
Category=PBT
Category=UMX
Category=UMZ
Category=UNA
Category=UNF
Category=UYF
data storytelling
eq_bestseller
eq_computing
eq_isMigrated=1
eq_isMigrated=2
eq_nobargain
eq_non-fiction
file management R
forthcoming
interactive data exploration techniques
linear models
R programming exercises
statistical inference
text data analysis

Product details

  • ISBN 9781032814803
  • Weight: 453g
  • Dimensions: 156 x 234mm
  • Publication Date: 02 Jul 2026
  • Publisher: Taylor & Francis Ltd
  • Publication City/Country: GB
  • Product Form: Paperback
Secure checkout Fast Shipping Easy returns

Exploratory Data Analysis Using R provides a classroom-tested introduction to exploratory data analysis (EDA), and this revised edition is accompanied by the R package ExploreTheData that implements many of the approaches described. As before, the primary focus of the book is on identifying "interesting" features - good, bad, and ugly - in a dataset, why it is important to find them, how to treat them, and more generally, the use of R to explore and explain datasets and the analysis results derived from them.

The book begins with a brief overview of exploratory data analysis using R, followed by a detailed discussion of creating various graphical data summaries in R. Then comes a thorough introduction to exploratory data analysis, and a detailed treatment of 13 data anomalies, why they are important, how to find them, and some options for addressing them. Subsequent chapters introduce the mechanics of working with external data, structured query language (SQL) for interacting with relational databases, linear regression analysis (the simplest and historically most important class of predictive models), and crafting data stories to explain our results to others. These chapters use R as an interactive data analysis platform, while Chapter 9 turns to writing programs in R, focusing on creating custom functions that can greatly simplify repetitive analysis tasks. Further chapters expand the scope to more advanced topics and techniques: special considerations for working with text data, a second look at exploratory data analysis, and more general predictive models.

The book is designed for both advanced undergraduate, entry-level graduate students, and working professionals with little to no prior exposure to data analysis, modeling, statistics, or programming. It keeps the treatment relatively non-mathematical, even though data analysis is an inherently mathematical subject. Exercises are included at the end of most chapters, and an instructor's solution manual is available.

Ronald K. Pearson holds a PhD in Electrical Engineering and Computer Science from the Massachussetts Institute of Technology and has more than 40 years professional experience in exploratory data analysis. Dr. Pearson has held industrial, business, and academic positions in the fields of industrial process control, bioinformatics, drug safety data analysis, software development, and insurance. He has authored or co-authored books including Exploring Data in Engineering, the Sciences, and Medicine (Oxford University Press, 2011) and Mining Imperfect Data with Examples in R and Python (SIAM, 2020).

More from this author