Machine Learning Resources
Neural Networks
Recurrent Neural Networks
- Recurrent Neural Networks Tutorial, Part 1 – Introduction to RNNs
- Recurrent Neural Networks Tutorial, Part 2 – Implementing a RNN with Python, Numpy and Theano
- Recurrent Neural Networks Tutorial, Part 3 – Backpropagation Through Time and Vanishing Gradients
- Recurrent Neural Network Tutorial, Part 4 – Implementing a GRU/LSTM RNN with Python and Theano
- Understanding LSTM Networks
Deep Learning
Deep Learning Book (Ian Goodfellow, Yoshua Bengio and Aaron Courville) (link)
- Introduction
- Linear Algebra
- Probability and Information Theory
- Numerical Computation
- Machine Learning Basics
- Deep Feedforward Networks
- Regularization
- Optimization for Training Deep Models
- Convolutional Networks
- Sequence Modeling: Recurrent and Recursive Nets
- Practical Methodology
- Applications
NLP
Deep Learning for NLP (Stanford CS224)
Torch
Machine Learning (University of Oxford)
- Introduction to Lua and Torch
- Linear models
- Classifying digits and tuning optimizers
- Implementing your own layer
- nngraph
- LSTMs for language modelling
Theano
Mathematics Resources
Algebra
- Applied Numerical Linear Algebra (James W. Demmel)
- Numerical Linear Algebra (Lloyd N. Trefethen, and David Bau III)
- The Matrix Cookbook
- 高等代数学 (姚慕生)
- Linear Algebra Review and Reference (cs229)
- Machine Arithmetic: Fixed-Point and Floating-Point Numbers (MIT 18.330)
Optimization
- Numerical Optimization (Jorge Nocedal and Stephen J. Wright)
- Convex Optimization (slides, solutions)
- Proximal Algorithms
- Distributed Optimization and Statistical Learning via the Alternating Direction Method of Multipliers
- Convex Optimization Overview (cs229)
- Convex Optimization Overview 2 (cs229)
- An Introduction to the Conjugate Gradient Method Without the Agonizing Pain
Probability and Statistics
- The Elements of Statistical Learning
- An Introduction to Statistical Learning
- All of Statistics
- The Multivariate Gaussian Distribution (cs229)
- More on Multivariate Gaussians (cs229)
- Review of Probability Theory (cs229)
My Notes
Machine Learning
- Linear Regression
- Logistic Regression
- Principal Components Analysis
- K-Means
- K-Nearest Neighbors
- Decision Tree
- Naive Bayes
- Neural Networks
- Recurrent Neural Network
- LSTM Networks