Publishing Our Work on Evaluating the Effectiveness of LLMs, CoBBLEr
Our study on evaluating the effectiveness of LLMs as automatic evaluators using a new benchmark, CoBBLEr has been published on ArXiv.
Our study on evaluating the effectiveness of LLMs as automatic evaluators using a new benchmark, CoBBLEr has been published on ArXiv.