Hastie T., Tibshirani R., Friedman J. The Elements of Statistical Learning: Data Mining, Inference, and Prediction

Файл формата pdf
размером 12,85 МБ

Добавлен пользователем Михаил 28.01.2018 17:25
Описание отредактировано 29.01.2018 04:36

Hastie T., Tibshirani R., Friedman J. The Elements of Statistical Learning: Data Mining, Inference, and Prediction

Springer, 2009. Corrected 12th printing, 2017. — 745 p. — ISBN: 0387848576, 978-0387848570.
Исправлены более 100 опечаток, обнаруженных в издании 2009 года.

During the past decade there has been an explosion in computation and information technology. With it have come vast amounts of data in a variety of fields such as medicine, biology, finance, and marketing. The challenge of understanding these data has led to the development of new tools in the field of statistics, and spawned new areas such as data mining, machine learning, and bioinformatics. Many of these tools have common underpinnings but are often expressed with different terminology. This book describes the important ideas in these areas in a common conceptual framework. While the approach is statistical, the emphasis is on concepts rather than mathematics. Many examples are given, with a liberal use of color graphics. It is a valuable resource for statisticians and anyone interested in data mining in science or industry. The book's coverage is broad, from supervised learning (prediction) to unsupervised learning. The many topics include neural networks, support vector machines, classification trees and boosting-the first comprehensive treatment of this topic in any book.

This major new edition features many topics not covered in the original, including graphical models, random forests, ensemble methods, least angle regression & path algorithms for the lasso, non-negative matrix factorization, and spectral clustering. There is also a chapter on methods for "wide'' data (p bigger than n), including multiple testing and false discovery rates.

Overview of Supervised Learning
Variable Types and Terminology, Two Simple Approaches to Prediction,
Least Squares and Nearest Neighbors, Statistical Decision Theory, Local Methods in High Dimensions,
Statistical Models, Supervised Learning and Function Approximation, Structured Regression Models,
Classes of Restricted Estimators, Model Selection and the Bias–Variance Tradeoff
Linear Methods for Regression
Linear Regression Models and Least Squares, Subset Selection, Shrinkage Methods,
Methods Using Derived Input Directions, Discussion: A Comparison of the Selection and Shrinkage Methods, Multiple Outcome Shrinkage and Selection,
More on the Lasso and Related Path Algorithms
Linear Methods for Classification
Linear Regression of an Indicator Matrix, Linear Discriminant Analysis, Logistic Regression, Separating Hyperplanes
Basis Expansions and Regularization
Piecewise Polynomials and Splines, Filtering and Feature Extraction, Smoothing Splines,
Automatic Selection of the Smoothing Parameters, Nonparametric Logistic Regression,
Multidimensional Splines, Regularization and Reproducing Kernel Hilbert Spaces, Wavelet Smoothing
Kernel Smoothing Methods
One-Dimensional Kernel Smoothers, Selecting the Width of the Kernel, Local Regression in IRp,
Structured Local Regression Models in IRp, Local Likelihood and Other Models,
Kernel Density Estimation and Classification, Radial Basis Functions and Kernels, Mixture Models for Density Estimation and Classification
Model Assessment and Selection
Bias, Variance and Model Complexity, The Bias–Variance Decomposition, Optimism of the Training Error Rate,
Estimates of In-Sample Prediction Error, The Effective Number of Parameters, The Bayesian Approach and BIC,
Minimum Description Length, Vapnik–Chervonenkis Dimension, Cross-Validation, Bootstrap Methods, Conditional or Expected Test Error?
Model Inference and Averaging
The Bootstrap and Maximum Likelihood Methods, Bayesian Methods, Relationship Between the Bootstrap and Bayesian Inference,
The EM Algorithm MCMC for Sampling from the Posterior, Bagging, Model Averaging and Stacking, Stochastic Search: Bumping
Additive Models, Trees, and Related Methods
Generalized Additive Models, Tree-Based Methods, PRIM: Bump Hunting, MARS: Multivariate Adaptive Regression Splines,
Hierarchical Mixtures of Experts, Missing Data
Boosting and Additive Trees
Boosting Methods, Boosting Fits an Additive Model, Forward Stagewise Additive Modeling, Exponential Loss and AdaBoost,
Why Exponential Loss?, Loss Functions and Robustness, Off-the-Shelf Procedures for Data Mining, Example: Spam Data, Boosting Trees,
Numerical Optimization via Gradient Boosting, Right-Sized Trees for Boosting, Regularization
Neural Networks
Projection Pursuit Regression, Neural Networks, Fitting Neural Networks, Some Issues in Training Neural Networks,
Example: Simulated Data, Example: ZIP Code Data, Discussion, Bayesian Neural Nets and the NIPS 2003 Challenge
Support Vector Machines and Flexible Discriminants
The Support Vector Classifier, Support Vector Machines and Kernels, Generalizing Linear Discriminant Analysis, Flexible Discriminant Analysis,
Penalized Discriminant Analysis, Mixture Discriminant Analysis
Prototype Methods and Nearest-Neighbors
Prototype Methods, k-Nearest-Neighbor Classifiers, Adaptive Nearest-Neighbor Methods
Unsupervised Learning
Association Rules, Cluster Analysis, Self-Organizing Maps, Principal Components, Curves and Surfaces,
Non-negative Matrix Factorization, Independent Component Analysis and Exploratory Projection Pursuit,
Multidimensional Scaling, Nonlinear Dimension Reduction and Local Multidimensional Scaling, The Google PageRank Algorithm
Random Forests
Definition of Random Forests, Details of Random Forests, Analysis of Random Forests
Ensemble Learning
Boosting and Regularization Paths, Learning Ensembles
Undirected Graphical Models
Markov Graphs and Their Properties, Undirected Graphical Models for Continuous Variables, Undirected Graphical Models for Discrete Variables
High-Dimensional Problems
When p is Much Bigger than N, Diagonal Linear Discriminant Analysis and Nearest Shrunken Centroids,
Linear Classifiers with Quadratic Regularization, Linear Classifiers with L1 Regularization,
Classification When Features are Unavailable, High-Dimensional Regression:
Supervised Principal Components, Feature Assessment and the Multiple-Testing Problem

The website for this book is located at web.stanford.edu/~hastie/ElemStatLearn/

Чтобы скачать этот файл зарегистрируйтесь и/или войдите на сайт используя форму сверху.
Регистрация

Смотри также

Подробнее

Deisenroth M.P. Faisal A.A., Cheng S.O. Mathematics for Machine Learning

Раздел: Искусственный интеллект → Машинное обучение (Machine Learning)

Cambridge: Cambridge University Press, 2020. — 398 p. — ISBN: 110845514X. Machine learning is the latest in a long line of attempts to distill human knowledge and reasoning into a form that is suitable for constructing machines and engineering automated systems. As machine learning becomes more ubiquitous and its software packages become easier to use, it is natural and...

16,29 МБ
добавлен 18.10.2019 05:33
описание отредактировано 09.02.2020 09:44

Подробнее

James G., Witten D., Hastie T., Tibshirani R. An Introduction to Statistical Learning: With Applications in R

Раздел: Компьютерная литература → R

Springer, 2013, corrected 8th printing (2017). — 440 p. — (Springer Texts in Statistics). — ISBN: 978-1461471387. An Introduction to Statistical Learning provides an accessible overview of the field of statistical learning, an essential toolset for making sense of the vast and complex data sets that have emerged in fields ranging from biology to finance to marketing to...

10,64 МБ
добавлен 28.01.2018 16:57
описание отредактировано 31.01.2019 01:25

Подробнее

Murphy K.P. Machine Learning: A Probabilistic Perspective

Раздел: Искусственный интеллект → Машинное обучение (Machine Learning)

Massachusetts Institute of Technology, 2012. — 1067 p. — ISBN: 0262018020, 978-0262018029. Today's Web-enabled deluge of electronic data calls for automated methods of data analysis. Machine learning provides these, developing methods that can automatically detect patterns in data and then use the uncovered patterns to predict future data. This textbook offers a comprehensive...

25,69 МБ
добавлен 02.02.2014 03:25
описание отредактировано 15.01.2022 04:28

Подробнее

Nisbet R., Elder J., Miner G. Handbook of Statistical Analysis and Data Mining Applications

Раздел: Искусственный интеллект → Интеллектуальный анализ данных

Academic Press, 2009. — 864 p. — ISBN: 0123747651. Robert Nisbet, Pacific Capital Bank Corporation, Santa Barbara, CA, USA John Elder, Elder Research, Inc. and the University of Virginia, Charlottesville, USA Gary Miner, StatSoft, Inc. , Tulsa, OK, USA Description The Handbook of Statistical Analysis and Data Mining Applications is a comprehensive professional reference book...

41,49 МБ
дата добавления неизвестна
описание отредактировано 08.05.2010 23:04

Подробнее

Мюллер А., Гидо С. Введение в машинное обучение с помощью Python

Раздел: Искусственный интеллект → Машинное обучение (Machine Learning)

М.: O’Reilly Media, 2017. — 392 с. Машинное обучение стало неотъемлемой частью различных коммерческих и исследовательских проектов, однако эта область не является прерогативой больших компаний с мощными аналитическими командами. Даже если вы еще новичок в использовании Python, эта книга познакомит вас с практическими способами построения систем машинного обучения. При всем...

13,28 МБ
добавлен 20.02.2017 01:10
описание отредактировано 14.07.2024 20:39

Главная

Наверх