Zhenyuan Chen

Blog

Welcome to my blog! Here I share my thoughts and insights on AI, computer vision, and remote sensing.

Recent Posts

📮 Subscribe to my blog: RSS Feed (Use with Feedly, NewsBlur, or any RSS reader)

From U-Nets to DiTs: The Architectural Evolution of Text-to-Image Diffusion Models (2021–2025)

November 09, 2025

A comprehensive analysis of how diffusion model architectures evolved from U-Net backbones to Diffusion Transformers, transforming text-to-image generation capabilities. Covers the complete evolution from Stable Diffusion through the latest models like Qwen-Image and SANA 1.5, with interactive timelines and architectural comparisons.


Diffusion Model Track: Methods and Application in Image Synthesis

November 17, 2024 PDF

An overview of diffusion models for image synthesis, with a focus on pioneering work from OpenAI, including GLIDE, DALL-E 2, and Consistency Models.


Introducing Latex-Slides-Template-GenAI: A Modern LaTeX Beamer Template

October 11, 2025 Repository

A comprehensive LaTeX Beamer template with modern design, advanced citation management, and professional styling. Perfect for academic presentations and keynotes, featuring per-slide references, multi-page PDF support, and clean aesthetics.


World Models: Background, Applications and Opportunities

October 10, 2025 PDF

An exploration of world models in AI, covering video generation, representation learning, and the Platonic representation hypothesis. This post discusses recent advances in models like CogVideoX and their applications in image editing and world simulation.


More posts coming soon...

← Back to Home


Powered by Jekyll and Minimal Light theme.