Studies in Machine Learning & AI

cyberpunk

Papers

This is a selection of quintessential papers for anyone starting on Deep Learning (Thanks to Joe Zimmerman):

Using Keras and Deep Q-Network to Play FlappyBird. Hands-on on Google DeepMind's Deep Q-Network.
Neural Networks, Manifolds, and Topology. This is a 2-years-old article, but a very well-written high-level explanation of the topology of low-dimensional NNs. "The task of a classification algorithm is fundamentally to separate a bunch of tangled manifolds."
Calculus on Computational Graphs: Backpropagation. Backpropagation explained in a very well-written text.
Understanding LSTM Networks. Another hit :).
Visualizing Representations: Deep Learning and Human Beings. Another Christopher Olah's great post, now on NN's different layers representations, tanging some philosophic aspects of it.
Karpathy's t-SNE visualization of CNN codes. He takes the 50k ILSVRC 2012 validation images, extracts the 4096-dimensional fc7 CNN features using Caffe and then uses Barnes-Hut t-SNE to compute a 2-dimensional embedding that respects the high-dimensional (L2) distances.
NVIDIA's Accelerating AI with GPUs: A New Computing Model.
Torchnet: Lighting the way to deep machine learning. "Torchnet is different from frameworks such as Caffe, Chainer, TensorFlow, and Theano, in that it does not focus on performing efficient inference and gradient computations in deep networks. Instead, Torchnet provides a framework on top of a deep learning framework that makes rapid experimentation easier."

Remember that the hidden layer learns a representation so that the data is linearly separable, so that's is how you do separate a spiral two-dimensional dataset using Tensorflow playground and Convnetjs:

With tanh:

tahn2 tahn11 tan2 tan3

With ReLU:

relu2

linear relu_no sigmoid relu_no2 relu_no3 relu_no4

cyberpunk