RP-Bench

Help calibrate the roleplay benchmark by rating AI responses. Your votes validate whether our LLM judges agree with real humans.