On Adversarial Training & Robustness with Bhavna Gopal

May 8, 2024
44 mins

Episode Description

"Understanding what's going on in a model is important to fine-tune it for specific tasks and to build trust."

Bhavna Gopal is a PhD candidate at Duke, research intern at Slingshot with experience at Apple, Amazon and Vellum.

We discuss

  • How adversarial robustness research impacts the field of AI explainability.
  • How do you evaluate a model's ability to generalize?
  • What adversarial attacks should we be concerned about with LLMs?
See all episodes