Research Areas: Computer Vision, Multimodal Learning, Generative AI
Recent Publications
Shiv Gehlot, Guan-Ming Su
"LViCAR: Diffusion Models for Perceptual Quality Enhancement in Video Compression Artifact Reduction." ACMMM Workshops 2025
Haoming Cai, Tsung-Wei Huang, Shiv Gehlot, Brandon Y. Feng, Sachin Sha, Guan-Ming Su, Christopher Metzler
"Parametric Shadow Control for Portrait Generation in Text-to-Image Diffusion Models." IEEE/CVF ICCV 2025 [arXiv]
Shiv Gehlot, Guan-Ming Su, Peng Yin, Sean McCarthy, Gary J. Sullivan
"A Generative Face Video Coding Framework with Disentangled and Consistent Background." IEEE ICIP 2025Spotlight