Foundations of Statistics for Data Scientists

Regular price €107.99
Quantity:
In stock with our UK publisher. 14-28 days
Delivery/Collection within 10-20 working days
14 days return policy Shipping & Delivery
A01=Alan Agresti
A01=Maria Kateri
Academic Statistics Department
Adjusting Error Rates
advanced data science statistics
Age Group_Uncategorized
Age Group_Uncategorized
Approximately Normal
Author_Alan Agresti
Author_Maria Kateri
automatic-update
Bayesian inference
Binary Response Variable
Carapace Width
Category1=Non-Fiction
Category=GPH
Category=PBT
Category=UNF
Classification Table
Classification Tree
CLT
cluster analysis
COP=United Kingdom
Delivery_Delivery within 10-20 working days
Delta Method
eq_bestseller
eq_computing
eq_isMigrated=2
eq_nobargain
eq_non-fiction
Exit Poll
Explanatory Variable Values
Fisher's Linear Discriminant Function
Hidden Units
Horseshoe Crabs
Inferential Statistical Methods
Language_English
Linear Discriminant Analysis
Linear Discriminant Function
Linear Probability Model
logistic regression
Normal Sampling Distribution
PA=Available
Poisson models
Practicing Data Science
Price_€50 to €100
PS=Active
regularisation techniques
Roc Curve
Sample Data Distribution
Sampling Distribution
Simple Random Sample
Simulated Sampling Distribution
softlaunch
statistical simulation

Product details

  • ISBN 9780367748456
  • Weight: 1220g
  • Dimensions: 178 x 254mm
  • Publication Date: 30 Nov 2021
  • Publisher: Taylor & Francis Ltd
  • Publication City/Country: GB
  • Product Form: Hardback
  • Language: English
Secure checkout Fast Shipping Easy returns

Foundations of Statistics for Data Scientists: With R and Python is designed as a textbook for a one- or two-term introduction to mathematical statistics for students training to become data scientists. It is an in-depth presentation of the topics in statistical science with which any data scientist should be familiar, including probability distributions, descriptive and inferential statistical methods, and linear modeling. The book assumes knowledge of basic calculus, so the presentation can focus on "why it works" as well as "how to do it." Compared to traditional "mathematical statistics" textbooks, however, the book has less emphasis on probability theory and more emphasis on using software to implement statistical methods and to conduct simulations to illustrate key concepts. All statistical analyses in the book use R software, with an appendix showing the same analyses with Python.

Key Features:

  • Shows the elements of statistical science that are important for students who plan to become data scientists.
  • Includes Bayesian and regularized fitting of models (e.g., showing an example using the lasso), classification and clustering, and implementing methods with modern software (R and Python).
  • Contains nearly 500 exercises.

The book also introduces modern topics that do not normally appear in mathematical statistics texts but are highly relevant for data scientists, such as Bayesian inference, generalized linear models for non-normal responses (e.g., logistic regression and Poisson loglinear models), and regularized model fitting. The nearly 500 exercises are grouped into "Data Analysis and Applications" and "Methods and Concepts." Appendices introduce R and Python and contain solutions for odd-numbered exercises. The book's website (http://stat4ds.rwth-aachen.de/) has expanded R, Python, and Matlab appendices and all data sets from the examples and exercises.

Alan Agresti, Distinguished Professor Emeritus at the University of Florida, is the author of seven books, including Categorical Data Analysis (Wiley) and Statistics: The Art and Science of Learning from Data (Pearson), and has presented short courses in 35 countries. His awards include an honorary doctorate from De Montfort University (UK) and Statistician of the Year from the American Statistical Association (Chicago chapter).

Maria Kateri, Professor of Statistics and Data Science at the RWTH Aachen University, authored the monograph Contingency Table Analysis: Methods and Implementation Using R (Birkhäuser/Springer) and a textbook on mathematics for economists (in German). She has long-term experience in teaching statistics courses to students of Data Science, Mathematics, Statistics, Computer Science, Business Administration, and Engineering.

More from this author