EP 14: Past Tense Pitfalls: The Curious Case of Refusal Training in AI Language Models Podcast By  cover art

EP 14: Past Tense Pitfalls: The Curious Case of Refusal Training in AI Language Models

EP 14: Past Tense Pitfalls: The Curious Case of Refusal Training in AI Language Models

Listen for free

View show details

About this listen

In this episode of "You Are A Helpful (Research) Assistant," delve into the AI-generated, human-curated exploration of refusal training vulnerabilities in language models. Uncover the past tense attack's impact on model behavior in this insightful discussion.

adbl_web_global_use_to_activate_T1_webcro805_stickypopup
No reviews yet