Ramprasaath R. Selvaraju
SQuINTing at VQA Models: Interrogating VQA Models with Sub-Questions
Trick or Treat: Thematic Reinforcement for Artistic Typography
Counting Everyday Objects in Everyday Scenes
Grad-CAM: Visual Explanations from Deep Networks via Gradient-based Localization
Align before Fuse: Vision and Language Representation Learning with Momentum Distillation
Diverse Beam Search: Decoding Diverse Solutions from Neural Sequence Models
The Semantic Paintbrush: Interactive 3D Mapping and Recognition in Large Outdoor Spaces
Taking a HINT: Leveraging Explanations to Make Vision and Language Models More Grounded
CASTing Your Model: Learning to Localize Improves Self-Supervised Representations
Choose Your Neuron: Incorporating Domain Knowledge into Deep Networks through Neuron Importance
SOrTing VQA Models: Improving Consistency via Gradient Alignment
Visual Explanations from Deep Networks
Trick or Treat: Thematic Reinforcement for Artistic Typography