Deconstructing Deep Learning + δeviations
Drop me an email
Format :
Date | Title
TL; DR
[36] LightSeg (only notes for now) - Emara, T., Abd El Munim, H. E., & Abbas, H. M. (2019, December). LiteSeg: A Novel Lightweight ConvNet for Semantic Segmentation. In 2019 Digital Image Computing: Techniques and Applications (DICTA) (pp. 1-7). IEEE. Link
(P.S. This was done for an internship so is a bit more formal than usual)
TL;DR -> Faster Semantic Segmentation with a modified ASPP module(from the DeepLab paper) + MobileNetV2
- Decrease receptive field
- Upsample by adding 0s b/w two filter vals along spatial dimensions with a dilation factor
- Note for Pytorch -> Conv2d has dilation as a param
- Take feature map -> Add 4 parallel atrous convs with different rates
- Take ASPP
- Add 3x3 convs after 3x3 atrous convs
- Replace normal convs
- Split input and output into channels
- Convolve pointwise
- Note for Pytorch -> Add groups = no of in_channels in Conv2d
- Take residual/skip connections
- H(X) = F(X) + X ; H(X) is output; F(X) is residual ; X is input feature map
- Long -> Across larger no of layers
- Short -> Smaller no of layers (as memory units)
- Fuse them both
- Use 1x1 convs
- End up with richer features
- MobileNetV2 with output stride of 32
- DeepLabv3+ decoder with ASPP modified to DASPP