Data Science and Machine Learning: Mathematical and
Statistical Methods
This homepage accompanies the books:
D.P. Kroese, Z.I. Botev, T. Taimre, R. Vaisman. Data Science
and Machine Learning: Mathematical and Statistical Methods,
Chapman and Hall/CRC, Boca Raton, 2019.
Z.I. Botev, D.P. Kroese, T. Taimre. Data Science
and Machine Learning: Mathematical and Statistical Methods,
Second Edition,
Chapman and Hall/CRC, Boca Raton, 2025.
The purpose of DSML is to provide an accessible, yet comprehensive textbook intended for students interested in gaining a better understanding of the mathematics and statistics that underpin the rich variety of ideas and machine learning algorithms in data science.
If you wish to use this book for educational purposes or self-study, you
are welcome to download the PDF of the first edition free of charge, provided that you give
due
acknowledgement to the source and its original location (this
website).
New in the Second Edition
The second edition provides updates across key areas of statistical learning:
- Monte Carlo Methods: A new section
introducing regenerative rejection sampling - a simpler alternative to MCMC.
- Unsupervised Learning: Inclusion of two multidimensional diffusion kernel density estimators, as well as the bandwidth perturbation matching method for the optimal data-driven bandwidth selection.
- Regression: New automatic bandwidth selection for local linear regression.
- Feature Selection and Shrinkage: A new chapter introducing the klimax method for model selection in high-dimensions.
- Reinforcement Learning: A new chapter on contemporary topics such as policy iteration, temporal difference learning, and policy gradient methods, all complete with Python code.
- Appendices: Expanded treatment of
linear algebra, functional analysis, and optimization that
includes the coordinate-descent method and the novel
Majorization-Minimization method for constrained optimization.
Key Features:
- Focuses on mathematical understanding.
- Presentation is self-contained, accessible, and comprehensive.
- Extensive list of exercises and worked-out examples.
- Many concrete algorithms with Python code.
- Full color throughout and extensive indexing.
- A single-counter consecutive numbering of all theorems, definitions, equations, etc., for easier text searches.
Order Information (Hard copy and Kindle):
[
CRC Press
|
Amazon
|
Barnes
& Noble
]
Japanese
translation published by:
TOKYO KAGAKU DOZIN, Co., Ltd.
Simplified Chinese
translation published by:
China Machine Press/Huazhang.
To Dirk Kroese's homepage