LucianoSphere (Luciano Abriata, PhD)
By calculating semantic entropy with a second LLM, we can better flag answers as unreliable due to lack of knowledge
Originally appeared here:
A New Method to Detect “Confabulations” Hallucinated by Large Language Models