descriptive claim
OpenAI publicly acknowledges that ChatGPT's safeguards (such as directing users to crisis helplines) work best in common, short exchanges and can become less reliable in long interactions, where parts of the model's safety training may degrade.
desc_chatgpt_safety_degrades_long_conversations
confidence 0.90
Evidence (1)
supports (1)
- weight0.90
locator: OpenAI spokesperson statement
“ChatGPT includes safeguards such as directing people to crisis helplines and referring them to real-world resources... we've learned over time that they can sometimes become less reliable in long interactions where parts of the model's safety training may degrade.”