This is a short companion page to our internal reading group of the book “Hands-on Machine Learning with Scikit-Learn, Keras, and TensorFlow, 2nd Edition”. However I unashamedly used a lot of PyTorch examples.
Intuition: visual cortex
Convolution
Filters, kernels, padding, strides, and pooling layers
Putting it altogether: LeNet5
CNN as a strong prior: locality, stationarity, compositionality
Residue block and skip connections
Deep CNN model architectures
AlexNet
GoogLeNet
ResNet 50
Other models
Classification, detection, and semantic segmentation
IoU and NMS
R-CNN, Feature Pyramid Network (FPN), Region Proposal Networks (RPN)
Mask RCNN
Towards realtime, embedded: NAS and MobileNets
Towards low prior: vision without CNN
Towards low data: contrastive representation learning, zero shots or few shots
Towards explainability
jiayu@hey.com
July 27, 2021