Patel, Vishal M.
131 publications
MIDL
2026
CatVLM: Enhancing Temporal Understanding in Cataract Surgery Videos with Boundary-Aware VLM
NeurIPS
2025
A Technical Report on “Erasing the Invisible”: The 2024 NeurIPS Competition on Stress Testing Image Watermarks
MIDL
2025
A Vision Foundation Model for Cataract Surgery Using Joint-Embedding Predictive Architecture
CVPR
2025
GenDeg: Diffusion-Based Degradation Synthesis for Generalizable All-in-One Image Restoration
MIDL
2025
MedCL: Learn Consistent Anatomy Distribution for Scribble-Supervised Medical Image Segmentation
ICCV
2025
Scaling Transformer-Based Novel View Synthesis with Models Token Disentanglement and Synthetic Data
CVPR
2024
JeDi: Joint-Image Diffusion Models for Finetuning-Free Personalized Text-to-Image Generation
CVPR
2024
View-Decoupled Transformer for Person Re-Identification Under Aerial-Ground Camera Network
ECCV
2022
Auto-FedRL: Federated Hyperparameter Optimization for Multi-Institutional Medical Image Segmentation
CVPR
2022
TransWeather: Transformer-Based Restoration of Images Degraded by Adverse Weather Conditions
CVPR
2021
MeGA-CDA: Memory Guided Attention for Category-Aware Unsupervised Domain Adaptive Object Detection
CVPR
2019
Improving the Performance of Unimodal Dynamic Hand-Gesture Recognition with Multimodal Training
ICCVW
2015
An End-to-End System for Unconstrained Face Verification with Deep Convolutional Neural Networks