LLMs have a "truth vector" - an emergent linear structure that represents factual truth values
LLMs contain a specific "truth direction" denoting factual truth values
Artificial intelligence systems like large language models have shown impressive capabilities, such as engaging in conversation, answering questions, and generating coherent text. However, they are also prone to making clearly false statements or hallucinating incorrect in…
Keep reading with a 7-day free trial
Subscribe to AIModels.fyi to keep reading this post and get 7 days of free access to the full post archives.