The Unified Causal AI Pipeline

Intro

predict the effect of actions and decide effective policies - and how
dl - can learn representations from unstructured data
predict effects of interventions
- causal variables not observed directly
- no labels
- just high level observations
tasks
- identify causal variables from observations
- learn causal relations between them

theoretical guarantees
without supervision, cannot identify exact causal variables - but up to equivalence class (aka not perfectly)
- somehow seems like trying to recover some variables from a tangled mess of the ground truth
- 1-1 map?

learn underlying causal process from temporal seq of high dim data (images) $X^{t}_{t - 1}^{T}$
Assume latent causal process in Dynamic Bayesian Networks with K multidim causal variables
Assume interventions can happen, observe binary targets $I_{i}^{t}$ and $I_{i} \to C_{i}$
targets are 1-1 with causal variables
model with latent variable
paper : CITRIS (phillip lippe, sara magliacane)
- stochastic intervention : we dont know where the ball will be
- simplified identification proof
- minimal causal variables
  - part of the variable that is varying
  - identify them up to invertible component wise transforms if $I_{i}^{t}$ up to $I_{j}^{t}$
- CITRIS - VAE
  - cant really use pretrained autoencoder since you assume its disentangling
- normalizing flows : CITRIS NF
  - use a pretrained autoencoder
BISCUIT - causal representation learning from binary interactions
- extension of TRIS
- effect encoded in binary interaction variables
- assume action has to affect causal variables in a binary way
- More BISCUIT - VAE and NF
Environments - CausalWorld, iTHOR
dynamic interaction map
CRL World models