章节 01
D-SAT Project Overview: Building AI That Understands Causal Relationships in Videos
Project Source
- Author/Maintainer: engineer-nithura
- Source Platform: GitHub
- Original Title: D-SAT-Phases-1-3-Data-Pipeline-Causal-Model-Training-Counterfactual-Fine-tuning
- Link: https://github.com/engineer-nithura/D-SAT-Phases-1-3-Data-Pipeline-Causal-Model-Training-Counterfactual-Fine-tuning
- Release/Update Time: 2026-06-01T17:12:09Z
Core Idea
D-SAT (Dynamic Scene-Action Transformer) aims to teach AI to understand 'why' (causal relationships) instead of just 'what' in videos. It builds a causal world model via three phases, using Gemma 3 and LoRA for scene graph-to-scene graph causal reasoning, plus counterfactual training to enhance causal understanding.