Subhaditya's KB
Search
Search
Dark mode
Light mode
Explorer
Home
❯
KB
❯
AI
❯
Machine Learning
❯
Models
Folder: KB/AI/Machine-Learning/Models
355 items under this folder.
Oct 14, 2025
1x1 conv
architecture
Oct 14, 2025
ADVENT
architecture
Oct 14, 2025
ALBERT
architecture
Oct 14, 2025
Activation Functions
architecture
Oct 14, 2025
Adagrad
architecture
Oct 14, 2025
Adaptive Gradient Clipping
architecture
Oct 14, 2025
Adaptive Input Representation
architecture
Oct 14, 2025
Additive Attention
architecture
Oct 14, 2025
Additive coupling layer
distributions
architecture
Oct 14, 2025
Alex Net
architecture
Oct 14, 2025
Alphacode
architecture
Oct 14, 2025
Atrous Convolution
architecture
Oct 14, 2025
Attention NMT
architecture
Oct 14, 2025
Attention
architecture
Oct 14, 2025
AudioLM
architecture
Oct 14, 2025
AutoEncoder
architecture
Oct 14, 2025
AutoDistill
architecture
Oct 14, 2025
BART
architecture
Oct 14, 2025
BERT
architecture
Oct 14, 2025
BEiT
temp
Oct 14, 2025
Backprop
loss
architecture
Oct 14, 2025
Bahdanau Attention
architecture
Oct 14, 2025
Basic GAN
architecture
Oct 14, 2025
Basic RNN Architectures
architecture
Oct 14, 2025
Basic Transformer
architecture
Oct 14, 2025
Basics of Federated Learning
federatedlearning
Oct 14, 2025
Beam search
architecture
Oct 14, 2025
Best Matching Unit
temp
Oct 14, 2025
Bi Directional RNN
architecture
Oct 14, 2025
Bias nodes
architecture
Oct 14, 2025
Big Bird
architecture
Oct 14, 2025
Big-Bench
benchmark
Oct 14, 2025
BinaryBERT
architecture
Oct 14, 2025
BlockDrop
architecture
Oct 14, 2025
BlockNeRF
architecture
Oct 14, 2025
Bruckhaus - 2024 - RAG Does Not Work for Enterprises
architecture
Oct 14, 2025
CAM
explainability
Oct 14, 2025
CLIP
architecture
Oct 14, 2025
Capsule Layer
architecture
Oct 14, 2025
Capsule Network
architecture
Oct 14, 2025
Causal 1D Conv
temp
Oct 14, 2025
Causal Dilated Conv
temp
Oct 14, 2025
Causal Language Model
nlp
Oct 14, 2025
Channels
architecture
Oct 14, 2025
chatgptisnotallyouneed
architecture
Oct 14, 2025
ChatGPT
architecture
Oct 14, 2025
Chinchilla
architecture
Oct 14, 2025
Classifier Gradients
architecture
Oct 14, 2025
Codex
architecture
Oct 14, 2025
Collaborative Topic Regression
architecture
Oct 14, 2025
Complete AI Pipeline
deeplearning
Oct 14, 2025
Computational Graph
temp
Oct 14, 2025
Conditional GAN
architecture
Oct 14, 2025
Conformer
architecture
Oct 14, 2025
Content Based Attention
architecture
Oct 14, 2025
Contrastive Predictive Coding
architecture
Oct 14, 2025
Convnd
temp
Oct 14, 2025
ConvBERT
architecture
Oct 14, 2025
ConvNeXt
architecture
Oct 14, 2025
Convolutional RNN
architecture
Oct 14, 2025
Curriculum Learning
architecture
Oct 14, 2025
CvT
temp
Oct 14, 2025
CycleGAN
architecture
Oct 14, 2025
DALL-E 2
architecture
Oct 14, 2025
DALL-E
architecture
Oct 14, 2025
DALL·E 2
architecture
Oct 14, 2025
DCGAN
architecture
Oct 14, 2025
DLRM
architecture
Oct 14, 2025
DeepFM
architecture
Oct 14, 2025
Index
index
anchor
Oct 14, 2025
DeepNet
architecture
Oct 14, 2025
DeepPERF
architecture
Oct 14, 2025
DeiT
temp
Oct 14, 2025
Denoising Autoencoder
architecture
Oct 14, 2025
Dense Net
architecture
Oct 14, 2025
Dense Skip Connections
temp
Oct 14, 2025
Dense Vector Indexes
architecture
Oct 14, 2025
Dense
temp
Oct 14, 2025
Density estimation using real NVP
distributions
architecture
Oct 14, 2025
Depth Efficiency of Neural Networks
deeplearning
Oct 14, 2025
Depthwise Separable
temp
Oct 14, 2025
Diffusion LM
architecture
Oct 14, 2025
Dilated Sliding Window Attention
architecture
Oct 14, 2025
Dirac Delta
temp
Oct 14, 2025
DistillBERT
architecture
Oct 14, 2025
Dot Product Attention
architecture
Oct 14, 2025
Dreamfusion
architecture
Oct 14, 2025
Dynamic Eager Execution
temp
Oct 14, 2025
Dynamic Sparsity
architecture
Oct 14, 2025
ELECTRA
architecture
Oct 14, 2025
ELMO
architecture
Oct 14, 2025
Effect of Depth
temp
Oct 14, 2025
EfficientNet
architecture
Oct 14, 2025
EigenCAM
explainability
Oct 14, 2025
Elementwise Flows
architecture
Oct 14, 2025
Elu
architecture
Oct 14, 2025
Encoder Decoder Attention
architecture
Oct 14, 2025
Ensemble Distillation
architecture
Oct 14, 2025
Exploding Gradient
architecture
Oct 14, 2025
FLASH
architecture
Oct 14, 2025
FLAVA
architecture
Oct 14, 2025
FTSwish
architecture
Oct 14, 2025
FaceNet
architecture
Oct 14, 2025
Factorized Embedding Parameters
architecture
Oct 14, 2025
FastText
architecture
Oct 14, 2025
Faster RCNN
architecture
Oct 14, 2025
Feature Correlationa
architecture
Oct 14, 2025
Fixed Factorization Attention
architecture
Oct 14, 2025
Flamingo
architecture
Oct 14, 2025
FlowNet
architecture
Oct 14, 2025
GAN Z Space
architecture
Oct 14, 2025
GAU
architecture
Oct 14, 2025
GELU
architecture
Oct 14, 2025
GLOW
architecture
Oct 14, 2025
GPT
architecture
Oct 14, 2025
GPT3
architecture
Oct 14, 2025
Gated Recurrent Unit (GRU)
architecture
Oct 14, 2025
Galactica
architecture
Oct 14, 2025
Gato
architecture
Oct 14, 2025
Generalizing Adversarial Explanations with Grad-CAM
explainability
Oct 14, 2025
Generative_Models
architecture
Oct 14, 2025
Generative RNN
architecture
Oct 14, 2025
Generative Spoken Language Modeling
architecture
Oct 14, 2025
Generative vs Discriminative Models
architecture
Oct 14, 2025
Git Commands
architecture
Oct 14, 2025
GloVE
architecture
Oct 14, 2025
Global Average Pooling
architecture
Oct 14, 2025
Global and Sliding Window Attention
architecture
Oct 14, 2025
Google NMT
architecture
Oct 14, 2025
GradCAM
architecture
Oct 14, 2025
Gradient Accumulation
architecture
Oct 14, 2025
Gradient Clipping
architecture
Oct 14, 2025
HNSW
architecture
Oct 14, 2025
Hallucination Text Generation
architecture
Oct 14, 2025
Heaviside
architecture
Oct 14, 2025
HiFI-GAN_Denoising
architecture
Oct 14, 2025
HiFI-GAN Synthesis
architecture
Oct 14, 2025
Higher Layer Capsule
architecture
Oct 14, 2025
Highway Convolutions
architecture
Oct 14, 2025
Hopfield networks
architecture
Oct 14, 2025
HyTAS
automl
Oct 14, 2025
IVFADC
architecture
Oct 14, 2025
Imagen
architecture
Oct 14, 2025
Improved variational inference with inverse autoregressive flows
architecture
Oct 14, 2025
Inception
architecture
Oct 14, 2025
Influence of image classification accuracy on saliency map estimation
explainability
Oct 14, 2025
Instant NeRF
architecture
Oct 14, 2025
Interpreting Attention
architecture
Oct 14, 2025
Isotropic Architectures
architecture
Oct 14, 2025
Joint Factor Analysis
architecture
Oct 14, 2025
Jukebox
architecture
Oct 14, 2025
Klue ML Engineer
jobsearch
Oct 14, 2025
LASER
architecture
Oct 14, 2025
LLM Guide
architecture
Oct 14, 2025
LaMDA
architecture
Oct 14, 2025
Large Kernel in Attention
architecture
Oct 14, 2025
Large Kernel in Convolution
architecture
Oct 14, 2025
Layers
temp
Oct 14, 2025
Le Net
architecture
Oct 14, 2025
Learning Rate Scheduling
temp
architecture
Oct 14, 2025
Linear Classifier Probes
architecture
Oct 14, 2025
Linear Flows
architecture
Oct 14, 2025
Lisht
architecture
Oct 14, 2025
Listen Attend Spell
architecture
Oct 14, 2025
Location Aware Attention
architecture
Oct 14, 2025
Location Base Attention
architecture
Oct 14, 2025
Long Short Term Memory (LSTM)
architecture
Oct 14, 2025
Longformer
architecture
Oct 14, 2025
Lost in the Middle How Language Models Use Long Contexts
architecture
Oct 14, 2025
MADE - Masked autoencoder for distribution estimation
architecture
Oct 14, 2025
MCnet
architecture
Oct 14, 2025
ML Production Flow
architecture
Oct 14, 2025
MLIM
architecture
Oct 14, 2025
MLM
architecture
Oct 14, 2025
MLOps
architecture
Oct 14, 2025
Machine Learning Tool Landscape
architecture
Oct 14, 2025
Magic3D
architecture
Oct 14, 2025
Masked Autoencoders
architecture
Oct 14, 2025
Masked autoregressive flow for density estimation
architecture
Oct 14, 2025
Minerva
architecture
Oct 14, 2025
Mixed chunk attention
architecture
Oct 14, 2025
Mobile Net
architecture
Oct 14, 2025
MobileOne
architecture
Oct 14, 2025
Model Capacity
architecture
deeplearning
Oct 14, 2025
Multi Head Attention
architecture
Oct 14, 2025
Multi Scale Flows
architecture
Oct 14, 2025
Multiplicative Attention
architecture
Oct 14, 2025
Muse
architecture
Oct 14, 2025
NICE - non linear independant components estimation
distributions
architecture
Oct 14, 2025
Nasnet
architecture
Oct 14, 2025
Network Dissection Quantifying Interpretability of Deep Visual Representions
explainability
Oct 14, 2025
Neural Network Architecture Cheat Sheet
architecture
Oct 14, 2025
Neural Probabilistic Model
architecture
Oct 14, 2025
Neural Text Degeneration
architecture
Oct 14, 2025
Noisy Relu
architecture
Oct 14, 2025
Non Maxima Supression
architecture
Oct 14, 2025
OPT
architecture
Oct 14, 2025
On the overlap between Grad-CAM saliency maps and explainable visual features in skin cancer images
explainability
Oct 14, 2025
OpenML x SURF
conference
Oct 14, 2025
PEER
architecture
Oct 14, 2025
PQ
architecture
Oct 14, 2025
Parametric Relu
architecture
Oct 14, 2025
PaLM
architecture
Oct 14, 2025
Padded Conv
architecture
Oct 14, 2025
Perceptron
temp
Oct 14, 2025
PhD vs Startup vs Big Company
architecture
Oct 14, 2025
Phase Transition Model Zoo
conference
Oct 14, 2025
Phenaki
architecture
Oct 14, 2025
Phrase Representation Learning
architecture
Oct 14, 2025
Pix2Seq
architecture
Oct 14, 2025
PixelShuffle
architecture
Oct 14, 2025
Point Cloud Data
architecture
Oct 14, 2025
PointNet++
architecture
todo
Oct 14, 2025
Pooling
temp
Oct 14, 2025
Position Encoding
architecture
Oct 14, 2025
Position Wise Feed Forward
architecture
Oct 14, 2025
Primary Capsule
architecture
Oct 14, 2025
Problems facing MLOps
architecture
Oct 14, 2025
Properties Required by Network Layers for Normalizing Flows
architecture
Oct 14, 2025
RAGAS - automated evaluation of RAG
architecture
Oct 14, 2025
RETRO
architecture
Oct 14, 2025
RandAugment
explainability
Oct 14, 2025
Real Time Image Saliency for Black Box Classifiers
explainability
Oct 14, 2025
Receptive field
architecture
Oct 14, 2025
Recurrent
temp
architecture
deeplearning
Oct 14, 2025
RegNet
architecture
Oct 14, 2025
Region Proposal
architecture
Oct 14, 2025
Relative Multi Head Self Attention
architecture
Oct 14, 2025
Relu
architecture
Oct 14, 2025
RepLKNet
architecture
Oct 14, 2025
RepVGG
temp
Oct 14, 2025
Representational Capacity
deeplearning
architecture
Oct 14, 2025
Res Net D
architecture
Oct 14, 2025
Res Net
architecture
Oct 14, 2025
ResNeXt
architecture
Oct 14, 2025
Research Engineer in Human Modeling for Automated Driving delft
architecture
Oct 14, 2025
Restricted Boltzmann Machine
architecture
Oct 14, 2025
RetinaNet
architecture
Oct 14, 2025
Rmsprop
architecture
Oct 14, 2025
RoBERTa
architecture
Oct 14, 2025
Robust RegNet
temp
Oct 14, 2025
Routing by Agreement
architecture
Oct 14, 2025
S2ST
architecture
Oct 14, 2025
SLAK
architecture
Oct 14, 2025
SRN
architecture
Oct 14, 2025
Saddle Points
architecture
Oct 14, 2025
Salemi and Zamani - 2024 - Evaluating Retrieval Quality in Retrieval-Augmente
architecture
Oct 14, 2025
Salience Map
explainability
Oct 14, 2025
Scaled Dot Product Attention
architecture
Oct 14, 2025
Scaling matrix for coupling layers
distributions
architecture
Oct 14, 2025
Scene based text to image generation
architecture
Oct 14, 2025
ScoreCAM
explainability
Oct 14, 2025
Seg Net
architecture
Oct 14, 2025
Self Attention GAN
architecture
Oct 14, 2025
Self Attention
architecture
Oct 14, 2025
Self Supervised Vision Transformers
temp
Oct 14, 2025
Seq2Seq
architecture
Oct 14, 2025
Sequential auto encoding of neural embeddings - SANE
architecture
Oct 14, 2025
Shake-Drop
architecture
Oct 14, 2025
Shake-Shake
architecture
Oct 14, 2025
Shallow vs deep networks
deeplearning
Oct 14, 2025
ShuffleNet
architecture
Oct 14, 2025
Sigmoid
architecture
Oct 14, 2025
SimCLR
temp
Oct 14, 2025
Mirman Et Al
temp
Oct 14, 2025
Skip Connection
temp
architecture
Oct 14, 2025
Sliding Window Attention
architecture
Oct 14, 2025
Soft Attention
architecture
Oct 14, 2025
Softmax
architecture
Oct 14, 2025
Softplus
architecture
Oct 14, 2025
Soundify
architecture
Oct 14, 2025
Sparse Encoder Indexes
architecture
Oct 14, 2025
Sparse Evolutionary Training
architecture
Oct 14, 2025
Sparse Transformer
architecture
Oct 14, 2025
Spatial Transformer
architecture
Oct 14, 2025
Speaker Verification
architecture
Oct 14, 2025
Speech Emotion Recognition
architecture
Oct 14, 2025
Speech Recognition
architecture
Oct 14, 2025
Speech Resynthesis
architecture
Oct 14, 2025
Spiking Networks
architecture
Oct 14, 2025
Stable Difusion
architecture
Oct 14, 2025
Stack GAN
architecture
todo
Oct 14, 2025
Stacking RNN
architecture
Oct 14, 2025
StarGAN v2
architecture
Oct 14, 2025
StarGAN
architecture
Oct 14, 2025
Static Graph Execution
temp
Oct 14, 2025
Strided Attention
architecture
Oct 14, 2025
Strided
architecture
Oct 14, 2025
Style GAN
architecture
Oct 14, 2025
Swin Transformer
temp
Oct 14, 2025
Swish
architecture
Oct 14, 2025
Tacotron
architecture
Oct 14, 2025
Tanh
architecture
Oct 14, 2025
Teacher Forcing
architecture
Oct 14, 2025
Temporal Conv
architecture
Oct 14, 2025
Temporal Learning
architecture
Oct 14, 2025
Textless Speech Emotion Conversion
architecture
Oct 14, 2025
The elephant in the interpretability room
explainability
Oct 14, 2025
Thesis Flow
Oct 14, 2025
TinyBERT
architecture
Oct 14, 2025
Token Embedding
architecture
Oct 14, 2025
Tower
temp
Oct 14, 2025
Transfer Learning or Self-supervised Learning? A Tale of Two Pretraining Paradigms
deeplearning
Oct 14, 2025
Transformer-XL
architecture
Oct 14, 2025
Transformer
architecture
Oct 14, 2025
Transposed Conv
architecture
Oct 14, 2025
Tug of war between RAG and LLM prior
architecture
Oct 14, 2025
Types of Normalizing flows
architecture
Oct 14, 2025
ULMFit
architecture
Oct 14, 2025
Un-LSTM
architecture
Oct 14, 2025
Unet
architecture
Oct 14, 2025
Upweighting
temp
Oct 14, 2025
Variational Autoencoder
architecture
Oct 14, 2025
VGGish
architecture
Oct 14, 2025
VICReg
architecture
Oct 14, 2025
VL-BEIT
architecture
Oct 14, 2025
Vanishing Gradient
architecture
Oct 14, 2025
Vapnik chervonenkis dimension
architecture
Oct 14, 2025
Vgg
architecture
Oct 14, 2025
ViLT
architecture
Oct 14, 2025
Vision Transformer
temp
Oct 14, 2025
VisualGPT
architecture
Oct 14, 2025
Voronoi Cell
temp
Oct 14, 2025
WOMBO Dream
art
Oct 14, 2025
WaveGlow
architecture
Oct 14, 2025
WebGPT
architecture
Oct 14, 2025
Weight space learning
conference
representationlearning
Oct 14, 2025
What is being Transferred in transfer learning
explainability
Oct 14, 2025
Whisper
architecture
Oct 14, 2025
Wide Deep Recommender
architecture
Oct 14, 2025
Window Based Regression
architecture
Oct 14, 2025
Word2Vec
architecture
Oct 14, 2025
X Vectors
architecture
Oct 14, 2025
XLM-R
architecture
Oct 14, 2025
XLNet
architecture
Oct 14, 2025
Xception
architecture
Oct 14, 2025
YOLO
architecture
Oct 14, 2025
Z-Space Entanglement
architecture
Oct 14, 2025
Zeiler Fergus
architecture
Oct 14, 2025
_Index_of_Models
Oct 14, 2025
__Index_of__Models
Oct 14, 2025
architecture
anchor
Oct 14, 2025
autoregressive flows
architecture
Oct 14, 2025
cognitivemodel
anchor
Oct 14, 2025
coupling flows
distributions
Oct 14, 2025
cross-layer parameter sharing
architecture
Oct 14, 2025
dGSLM
architecture
Oct 14, 2025
data2vec
architecture
Oct 14, 2025
i-Code
architecture
Oct 14, 2025
ill conditioning
architecture
Oct 14, 2025
inverse autoregressive flows
architecture
Oct 14, 2025
pGLSM
architecture
Oct 14, 2025
residual flows
architecture
Oct 14, 2025
usermodel
anchor
Oct 14, 2025
wave2vec
architecture