Character-Level GPT
A character-level language model implemented from scratch, demonstrating core Transformer principles. It predicts directly at the character level, includes a complete training process and progress check tools, helping to understand concepts like self-attention and positional encoding.
GAN
Standard GAN architecture, trained adversarially by generator and discriminator to learn data distribution, demonstrating the process of generating realistic synthetic data.
VAE (MNIST Version)
A generative architecture based on probabilistic graphical models, learning latent representations of data. It demonstrates the core ideas of encoding into probability distributions and sampling decoding on the MNIST dataset, suitable for tasks like image generation and anomaly detection.
WDCGAN (CIFAR-10 Version)
An improved GAN that introduces Wasserstein distance and gradient penalty to solve training instability issues. Its deep convolutional structure captures hierarchical image features, generating 32x32 color images.
Plenoxels
A neural radiance field method for 3D scene representation and rendering, using voxel grids to store radiance fields, improving training and inference speed. It represents the cutting-edge direction of generative AI expansion into 3D.