Blog
Welcome to my blog! Here I share my thoughts and insights on AI, computer vision, and remote sensing.
Recent Posts
From U-Nets to DiTs: The Architectural Evolution of Text-to-Image Diffusion Models (2021–2025)
November 09, 2025
A comprehensive analysis of how diffusion model architectures evolved from U-Net backbones to Diffusion Transformers, transforming text-to-image generation capabilities. Covers the complete evolution from Stable Diffusion through the latest models like Qwen-Image and SANA 1.5, with interactive timelines and architectural comparisons.
Diffusion Model Track: Methods and Application in Image Synthesis
| November 17, 2024 |
An overview of diffusion models for image synthesis, with a focus on pioneering work from OpenAI, including GLIDE, DALL-E 2, and Consistency Models.
Introducing Latex-Slides-Template-GenAI: A Modern LaTeX Beamer Template
| October 11, 2025 | Repository |
A comprehensive LaTeX Beamer template with modern design, advanced citation management, and professional styling. Perfect for academic presentations and keynotes, featuring per-slide references, multi-page PDF support, and clean aesthetics.
World Models: Background, Applications and Opportunities
| October 10, 2025 |
An exploration of world models in AI, covering video generation, representation learning, and the Platonic representation hypothesis. This post discusses recent advances in models like CogVideoX and their applications in image editing and world simulation.
Powered by Jekyll and Minimal Light theme.