Publications

(2021). Align before Fuse: Vision and Language Representation Learning with Momentum Distillation. Advances in Neural Information Processing Systems (NeurIPS) 2021.

PDF Code

(2021). SOrTing VQA Models: Improving Consistency via Gradient Alignment. International Conference on Computer Vision (ICCV) 2021.

PDF Code

(2021). CASTing Your Model: Learning to Localize Improves Self-Supervised Representations. International Conference on Computer Vision (ICCV) 2021.

PDF Code

(2020). SQuINTing at VQA Models: Interrogating VQA Models with Sub-Questions. Conference on Computer Vision and Pattern Recognition (CVPR) 2020.

PDF Code

(2019). Visual Explanations from Deep Networks. International Conference on Computer Vision (ICCV) 2019.

PDF Code

(2019). Taking a HINT: Leveraging Explanations to Make Vision and Language Models More Grounded. International Conference on Computer Vision (ICCV) 2019.

PDF Code

(2019). Trick or Treat: Thematic Reinforcement for Artistic Typography. International Conference on Computer Vision (ICCV) 2019.

PDF Code

(2018). Choose Your Neuron: Incorporating Domain Knowledge into Deep Networks through Neuron Importance. European Conference on Computer Vision (ECCV) 2018.

PDF Code

(2018). Diverse Beam Search: Decoding Diverse Solutions from Neural Sequence Models. Conference on Computer Vision and Pattern Recognition (CVPR) 2018.

PDF Code

(2017). Grad-CAM: Visual Explanations from Deep Networks via Gradient-based Localization. International Conference on Computer Vision (ICCV) 2017.

PDF Code

(2017). Counting Everyday Objects in Everyday Scenes. Conference on Computer Vision and Pattern Recognition (CVPR) 2017.

PDF Code

(2015). The Semantic Paintbrush: Interactive 3D Mapping and Recognition in Large Outdoor Spaces. Conference on Human Factors in Computing Systems (CHI) 2015.

PDF Code