本書內容包括:研究特征提取與特征工程過程、評估線性回歸的性能和誤差估計、使用不同類型的算法構建數據模型并理解其工作原理、調整支持向量機(SVM)的參數、探討自然語言處理(NLP)和推薦系統(tǒng)的概念、從頭開始創(chuàng)建一個機器學習架構。
Preface
Chapter 1: A Gentle Introduction to Machine Learning
Introduction - classic and adaptive machines
Descriptive analysis
Predictive analysis
Only learning matters
Supervised learning
Unsupervised learning
Semi-supervised learning
Reinforcement learning
Computational neuroscience
Beyond machine learning - deep learning and bio-inspired adaptive
systems
Machine learning and big data
Summary
Chapter 2: Important Elements in Machine Learning
Data formats
Multiclass strategies
One-vs-all
One-vs-one
Learnability
Underfitting and overfitting
Error measures and cost functions
PAC learning
Introduction to statistical learning concepts
MAP learning
Maximum likelihood learning
Class balancing
Resampling with replacement
SMOTE resampling
Elements of information theory
Entropy
Cross-entropy and mutual information
Divergence measures between two probability distributions
Summary
Chapter 3: Feature Selection and Feature Engineering
scikit-learn toy datasets
Creating training and test sets
Managing categorical data
Managing missing features
Data scaling and normalization
Whitening
Feature selection and filtering
Principal Component Analysis
Non-Negative Matrix Factorization
Sparse PCA
Kernel PCA
Independent Component Analysis
Atom extraction and dictionary learning
Visualizing high-dimensional datasets using t-SNE