Red Teaming Language Models for Contradictory Dialogues

Published:

Xiaofei Wen, Bangzheng Li, Tenghao Huang, Muhao Chen, “Red Teaming Language Models for Contradictory Dialogue” In Submission.