Chapter 14. Deep Computer Vision Using Convolutional Neural Networks
🍪

Chapter 14. Deep Computer Vision Using Convolutional Neural Networks

This is a short companion page to our internal reading group of the book “Hands-on Machine Learning with Scikit-Learn, Keras, and TensorFlow, 2nd Edition”. However I unashamedly used a lot of PyTorch examples.

Intuition: visual cortex

Convolution

Filters, kernels, padding, strides, and pooling layers

Putting it altogether: LeNet5

CNN as a strong prior: locality, stationarity, compositionality

Residue block and skip connections

Deep CNN model architectures

AlexNet

GoogLeNet

ResNet 50

Other models

Classification, detection, and semantic segmentation

IoU and NMS

R-CNN, Feature Pyramid Network (FPN), Region Proposal Networks (RPN)

Mask RCNN

Towards realtime, embedded: NAS and MobileNets

Towards low prior: vision without CNN

Towards low data: contrastive representation learning, zero shots or few shots

Towards explainability

 

📧
jiayu@hey.com
July 27, 2021