An awesome repository & A comprehensive survey on interpretability of LLM attention heads.
-
Updated
Nov 15, 2024 - TeX
An awesome repository & A comprehensive survey on interpretability of LLM attention heads.
Explore consciousness and self-awareness as it pertains to AI systems.
Repository for the LWDA'24 presentation on 'Psychometric Profiling of GPT Models for Bias Exploration', featuring conference materials including the poster, paper, slides, and references.
Add a description, image, and links to the machine-psychology topic page so that developers can more easily learn about it.
To associate your repository with the machine-psychology topic, visit your repo's landing page and select "manage topics."