Gould, Stephen

97 publications

ICML 2025 Can We Predict Performance of Large Models Across Vision-Language Tasks? Qinyu Zhao, Ming Xu, Kartik Gupta, Akshay Asthana, Liang Zheng, Stephen Gould

ICCV 2025 Leaps and Bounds: An Improved Point Cloud Winding Number Formulation for Fast Normal Estimation and Surface Reconstruction Chamin Hewa Koneputugodage, Dylan Campbell, Stephen Gould

ICCV 2025 Manual-PA: Learning 3D Part Assembly from Instruction Diagrams Jiahao Zhang, Anoop Cherian, Cristian Rodriguez, Weijian Deng, Stephen Gould

CVPR 2025 Pos3R: 6d Pose Estimation for Unseen Objects Made Easy Weijian Deng, Dylan Campbell, Chunyi Sun, Jiahao Zhang, Shubham Kanitkar, Matt E. Shaffer, Stephen Gould

NeurIPS 2025 Sharper Convergence Rates for Nonconvex Optimisation via Reduction Mappings Evan Markou, Thalaiyasingam Ajanthan, Stephen Gould

WACV 2025 Temporally Grounding Instructional Diagrams in Unconstrained Videos Jiahao Zhang, Frederic Z. Zhang, Cristian Rodriguez, Yizhak Ben-Shabat, Anoop Cherian, Stephen Gould

CVPR 2025 VI^3NR: Variance Informed Initialization for Implicit Neural Representations Chamin Hewa Koneputugodage, Yizhak Ben-Shabat, Sameera Ramasinghe, Stephen Gould

CVPR 2024 3DInAction: Understanding Human Actions in 3D Point Clouds Yizhak Ben-Shabat, Oren Shrout, Stephen Gould

ICML 2024 An Empirical Study into What Matters for Calibrating Vision-Language Models Weijie Tu, Weijian Deng, Dylan Campbell, Stephen Gould, Tom Gedeon

WACV 2024 Bi-Directional Training for Composed Image Retrieval via Text Prompt Learning Zheyuan Liu, Weixuan Sun, Yicong Hong, Damien Teney, Stephen Gould

TMLR 2024 Candidate Set Re-Ranking for Composed Image Retrieval with Dual Multi-Modal Encoder Zheyuan Liu, Weixuan Sun, Damien Teney, Stephen Gould

CVPR 2024 Differentiable Neural Surface Refinement for Modeling Transparent Objects Weijian Deng, Dylan Campbell, Chunyi Sun, Shubham Kanitkar, Matthew E. Shaffer, Stephen Gould

NeurIPS 2024 Guiding Neural Collapse: Optimising Towards the Nearest Simplex Equiangular Tight Frame Evan Markou, Thalaiyasingam Ajanthan, Stephen Gould

WACV 2024 IKEA Ego 3D Dataset: Understanding Furniture Assembly Actions from Ego-View 3D Point Clouds Yizhak Ben-Shabat, Jonathan Paul, Eviatar Segev, Oren Shrout, Stephen Gould

CVPR 2024 Learning to Select Views for Efficient Multi-View Understanding Yunzhong Hou, Stephen Gould, Liang Zheng

WACV 2024 LipAT: Beyond Style Transfer for Controllable Neural Simulation of Lipstick Using Cosmetic Attributes Amila Silva, Olga Moskvyak, Alexander Long, Ravi Garg, Stephen Gould, Gil Avraham, Anton van den Hengel

WACV 2024 NeRFEditor: Differentiable Style Decomposition for 3D Scene Editing Chunyi Sun, Yanbin Liu, Junlin Han, Stephen Gould

NeurIPS 2024 Neural Experts: Mixture of Experts for Implicit Neural Representations Yizhak Ben-Shabat, Chamin Hewa Koneputugodage, Sameera Ramasinghe, Stephen Gould

WACV 2024 Ray Deformation Networks for Novel View Synthesis of Refractive Objects Weijian Deng, Dylan Campbell, Chunyi Sun, Shubham Kanitkar, Matthew Shaffer, Stephen Gould

CVPR 2024 Small Steps and Level Sets: Fitting Neural Surface Models with Point Guidance Chamin Hewa Koneputugodage, Yizhak Ben-Shabat, Dylan Campbell, Stephen Gould

CVPR 2024 Temporally Consistent Unbalanced Optimal Transport for Unsupervised Action Segmentation Ming Xu, Stephen Gould

ECCV 2024 The First to Know: How Token Distributions Reveal Hidden Knowledge in Large Vision-Language Models? Qinyu Zhao, Ming Xu, Kartik Gupta, Akshay Asthana, Liang Zheng, Stephen Gould

ICLR 2024 Towards Optimal Feature-Shaping Methods for Out-of-Distribution Detection Qinyu Zhao, Ming Xu, Kartik Gupta, Akshay Asthana, Liang Zheng, Stephen Gould

ECCV 2024 Unsupervised Dense Prediction Using Differentiable Normalized Cuts Yanbin Liu, Stephen Gould

CVPR 2023 Aligning Step-by-Step Instructional Diagrams to Video Demonstrations Jiahao Zhang, Anoop Cherian, Yanbin Liu, Yizhak Ben-Shabat, Cristian Rodriguez, Stephen Gould

ICML 2023 Confidence and Dispersity Speak: Characterizing Prediction Matrix for Unsupervised Accuracy Estimation Weijian Deng, Yumin Suh, Stephen Gould, Liang Zheng

ICLR 2023 Deep Declarative Dynamic Time Warping for End-to-End Learning of Alignment Paths Ming Xu, Sourav Garg, Michael Milford, Stephen Gould

ICCV 2023 Exploring Predicate Visual Context in Detecting of Human-Object Interactions Frederic Z Zhang, Yuhui Yuan, Dylan Campbell, Zhuoyao Zhong, Stephen Gould

CVPR 2023 High-Fidelity Guided Image Synthesis with Latent Diffusion Models Jaskirat Singh, Stephen Gould, Liang Zheng

ICCV 2023 Learning Navigational Visual Representations with Semantic mAP Supervision Yicong Hong, Yang Zhou, Ruiyi Zhang, Franck Dernoncourt, Trung Bui, Stephen Gould, Hao Tan

CVPR 2023 Octree Guided Unoriented Surface Reconstruction Chamin Hewa Koneputugodage, Yizhak Ben-Shabat, Stephen Gould

ICMLW 2023 PMaF: Deep Declarative Layers for Principal Matrix Features Zhiwei Xu, Hao Wang, Yanbin Liu, Stephen Gould

NeurIPS 2023 Revisiting Implicit Differentiation for Learning Problems in Optimal Control Ming Xu, Timothy L. Molloy, Stephen Gould

ICCV 2023 Scaling Data Generation in Vision-and-Language Navigation Zun Wang, Jialu Li, Yicong Hong, Yi Wang, Qi Wu, Mohit Bansal, Stephen Gould, Hao Tan, Yu Qiao

ICCV 2023 Semi-Supervised Semantic Segmentation Under Label Noise via Diverse Learning Groups Peixia Li, Pulak Purkait, Thalaiyasingam Ajanthan, Majid Abdolshah, Ravi Garg, Hisham Husain, Chenchen Xu, Stephen Gould, Wanli Ouyang, Anton van den Hengel

ICMLW 2023 Towards Understanding Gradient Approximation in Equality Constrained Deep Declarative Networks Stephen Gould, Ming Xu, Zhiwei Xu, Yanbin Liu

CVPR 2022 Bridging the Gap Between Learning in Discrete and Continuous Environments for Vision-and-Language Navigation Yicong Hong, Zun Wang, Qi Wu, Stephen Gould

CVPR 2022 DiGS: Divergence Guided Shape Implicit Neural Representation for Unoriented Point Clouds Yizhak Ben-Shabat, Chamin Hewa Koneputugodage, Stephen Gould

CVPR 2022 Efficient Two-Stage Detection of Human-Object Interactions with a Novel Unary-Pairwise Transformer Frederic Z. Zhang, Dylan Campbell, Stephen Gould

NeurIPS 2022 On the Strong Correlation Between Model Invariance and Generalization Weijian Deng, Stephen Gould, Liang Zheng

ICLR 2021 Conditional Generative Modeling via Learning the Latent Space Sameera Ramasinghe, Kanchana Nisal Ranasinghe, Salman Khan, Nick Barnes, Stephen Gould

ICCV 2021 Contextually Plausible and Diverse 3D Human Motion Prediction Sadegh Aliakbarian, Fatemeh Saleh, Lars Petersson, Stephen Gould, Mathieu Salzmann

WACV 2021 DORi: Discovering Object Relationships for Moment Localization of a Natural Language Query in a Video Cristian Rodriguez-Opazo, Edison Marrese-Taylor, Basura Fernando, Hongdong Li, Stephen Gould

ICCV 2021 Image Retrieval on Real-Life Images with Pre-Trained Vision-and-Language Models Zheyuan Liu, Cristian Rodriguez-Opazo, Damien Teney, Stephen Gould

CVPR 2021 Probabilistic Tracklet Scoring and Inpainting for Multiple Object Tracking Fatemeh Saleh, Sadegh Aliakbarian, Hamid Rezatofighi, Mathieu Salzmann, Stephen Gould

NeurIPS 2021 Rethinking Conditional GAN Training: An Approach Using Geometrically Structured Latent Manifolds Sameera Ramasinghe, Moshiur Farazi, Salman H Khan, Nick Barnes, Stephen Gould

ICCV 2021 Spatially Conditioned Graphs for Detecting Human-Object Interactions Frederic Z. Zhang, Dylan Campbell, Stephen Gould

WACV 2021 The IKEA ASM Dataset: Understanding People Assembling Furniture Through Actions, Objects and Pose Yizhak Ben-Shabat, Xin Yu, Fatemeh Saleh, Dylan Campbell, Cristian Rodriguez-Opazo, Hongdong Li, Stephen Gould

CVPR 2021 VLN BERT: A Recurrent Vision-and-Language BERT for Navigation Yicong Hong, Qi Wu, Yuankai Qi, Cristian Rodriguez-Opazo, Stephen Gould

ICML 2021 What Does Rotation Prediction Tell Us About Classifier Accuracy Under Varying Testing Environments? Weijian Deng, Stephen Gould, Liang Zheng

ICLR 2020 A Signal Propagation Perspective for Pruning Neural Networks at Initialization Namhoon Lee, Thalaiyasingam Ajanthan, Stephen Gould, Philip H. S. Torr

WACV 2020 Blended Convolution and Synthesis for Efficient Discrimination of 3D Shapes Sameera Ramasinghe, Salman Khan, Nick Barnes, Stephen Gould

ECCV 2020 DeepFit: 3D Surface Fitting via Neural Network Weighted Least Squares Yizhak Ben-Shabat, Stephen Gould

CVPRW 2020 Inferring Temporal Compositions of Actions Using Probabilistic Automata Rodrigo Santa Cruz, Anoop Cherian, Basura Fernando, Dylan Campbell, Stephen Gould

NeurIPS 2020 Language and Visual Entity Relationship Graph for Agent Navigation Yicong Hong, Cristian Rodriguez, Yuankai Qi, Qi Wu, Stephen Gould

ECCV 2020 Multiview Detection with Feature Perspective Transformation Yunzhong Hou, Liang Zheng, Stephen Gould

WACV 2020 Proposal-Free Temporal Moment Localization of a Natural-Language Query in Video Using Guided Attention Cristian Rodriguez, Edison Marrese-Taylor, Fatemeh Sadat Saleh, Hongdong Li, Stephen Gould

ECCV 2020 Solving the Blind Perspective-N-Point Problem End-to-End with Robust Differentiable Geometric Optimization Dylan Campbell, Liu Liu, Stephen Gould

WACV 2018 Neural Algebra of Classifiers Rodrigo Santa Cruz, Basura Fernando, Anoop Cherian, Stephen Gould

NeurIPS 2018 Partially-Supervised Image Captioning Peter Anderson, Stephen Gould, Mark Johnson

CVPR 2017 DeepPermNet: Visual Permutation Learning Rodrigo Santa Cruz, Basura Fernando, Anoop Cherian, Stephen Gould

CVPR 2017 Generalized Rank Pooling for Activity Recognition Anoop Cherian, Basura Fernando, Mehrtash Harandi, Stephen Gould

WACV 2017 Higher-Order Pooling of CNN Features via Kernel Linearization for Action Recognition Anoop Cherian, Piotr Koniusz, Stephen Gould

CVPR 2017 Self-Supervised Video Representation Learning with Odd-One-Out Networks Basura Fernando, Hakan Bilen, Efstratios Gavves, Stephen Gould

CVPRW 2017 Unsupervised Human Action Detection by Action Matching Basura Fernando, Sareh Shirazi, Stephen Gould

ECCV 2016 Built-in Foreground/Background Prior for Weakly-Supervised Semantic Segmentation Fatemehsadat Saleh, Mohammad Sadegh Ali Akbarian, Mathieu Salzmann, Lars Petersson, Stephen Gould, José M. Álvarez

ECCV 2016 Deep Convolutional Neural Networks for Human Embryonic Cell Counting Aisha Khan, Stephen Gould, Mathieu Salzmann

ECCVW 2016 Deep Convolutional Neural Networks for Human Embryonic Cell Counting Aisha Khan, Stephen Gould, Mathieu Salzmann

CVPR 2016 Discriminative Hierarchical Rank Pooling for Activity Recognition Basura Fernando, Peter Anderson, Marcus Hutter, Stephen Gould

CVPR 2016 Dynamic Image Networks for Action Recognition Hakan Bilen, Basura Fernando, Efstratios Gavves, Andrea Vedaldi, Stephen Gould

ICML 2016 Learning End-to-End Video Classification with Rank-Pooling Basura Fernando, Stephen Gould

ECCV 2016 SPICE: Semantic Propositional Image Caption Evaluation Peter Anderson, Basura Fernando, Mark Johnson, Stephen Gould

WACV 2015 A Linear Chain Markov Model for Detection and Localization of Cells in Early Stage Embryo Development Aisha Khan, Stephen Gould, Mathieu Salzmann

ICCV 2015 Hierarchical Higher-Order Regression Forest Fields: An Application to 3D Indoor Scene Labelling Trung T. Pham, Ian Reid, Yasir Latif, Stephen Gould

WACV 2015 Multi-Class Semantic Video Segmentation with Exemplar-Based Object Reasoning Buyu Liu, Xuming He, Stephen Gould

CVPR 2014 An Exemplar-Based CRF for Multi-Instance Object Segmentation Xuming He, Stephen Gould

WACV 2014 Joint Semantic and Geometric Segmentation of Videos with a Stage Model Buyu Liu, Xuming He, Stephen Gould

ECCV 2014 Superpixel Graph Label Transfer with Learned Distance Metric Stephen Gould, Jiecheng Zhao, Xuming He, Yuhang Zhang

IJCAI 2013 Efficient Extraction and Representation of Spatial Information from Video Data Hajar Sadeghi Sokeh, Stephen Gould, Jochen Renz

ICCVW 2013 Multi-Instance Object Segmentation with Exemplars Xuming He, Stephen Gould

MLOSS 2012 DARWIN: A Framework for Machine Learning and Computer Vision Research and Development Stephen Gould

CVPR 2012 Multiclass Pixel Labeling with Non-Local Matching Constraints Stephen Gould

ECCV 2012 On Learning Higher-Order Consistency Potentials for Multi-Class Pixel Labeling Kyoungup Park, Stephen Gould

ECCV 2012 PatchMatchGraph: Building a Graph of Dense Patch Correspondences for Label Transfer Stephen Gould, Yuhang Zhang

ICML 2011 Max-Margin Learning for Lower Linear Envelope Potentials in Binary Markov Random Fields Stephen Gould

ECCV 2010 A Unified Contour-Pixel Model for Figure-Ground Segmentation Benjamin Packer, Stephen Gould, Daphne Koller

ICML 2010 Accelerated Dual Decomposition for MAP Inference Vladimir Jojic, Stephen Gould, Daphne Koller

ECCV 2010 Discriminative Learning with Latent Variables for Cluttered Indoor Scene Understanding Huayan Wang, Stephen Gould, Daphne Koller

CVPR 2010 Single Image Depth Estimation from Predicted Semantic Labels Beyang Liu, Stephen Gould, Daphne Koller

CVPR 2009 Alphabet SOUP: A Framework for Approximate Energy Minimization Stephen Gould, Fernando Amat, Daphne Koller

ICCV 2009 Decomposing a Scene into Geometric and Semantically Consistent Regions Stephen Gould, Richard Fulton, Daphne Koller

NeurIPS 2009 Region-Based Segmentation and Object Detection Stephen Gould, Tianshi Gao, Daphne Koller

NeurIPS 2008 Cascaded Classification Models: Combining Models for Holistic Scene Understanding Geremy Heitz, Stephen Gould, Ashutosh Saxena, Daphne Koller

JMLR 2008 Learning Bounded Treewidth Bayesian Networks Gal Elidan, Stephen Gould

NeurIPS 2008 Learning Bounded Treewidth Bayesian Networks Gal Elidan, Stephen Gould

UAI 2008 Projected Subgradient Methods for Learning Sparse Gaussians John C. Duchi, Stephen Gould, Daphne Koller

IJCAI 2007 Peripheral-Foveal Vision for Real-Time Object Recognition and Tracking in Video Stephen Gould, Joakim Arfvidsson, Adrian Kaehler, Benjamin Sapp, Marius Messner, Gary R. Bradski, Paul Baumstarck, Sukwon Chung, Andrew Y. Ng