AdaIn Adam Batch Normalization DeepNorm Dropout Effects of Regularization Fine Tuning Based Pruning Global Gradient Magnitude Based Pruning Global Magnitude Based Pruning He Initialization Instance Normalization Label Smoothing Layer Normalization Layerwise Gradient Magnitude Based Pruning Layerwise Magnitude Based Pruning Leaky Relu Learning Rate Range Test LeCun Init Lp Regularization Modality Dropout No bias decay Normalization Optimizers Orthogonal Initialization Pruning Random Pruning Regularization Term Regularization Scheduling Scoring Pruning Approaches SELU Structure Based Pruning treecoverSegmentation Tuning Model Flexibility VariationalRecurrent Dropout Weight Decay Vs L2 Regularization Xavier Initialization