LLM Explainability

Research into interpreting and explaining Large Language Models.

Overview

As Large Language Models (LLMs) become more prevalent, understanding their decision-making processes is crucial. Our research focuses on applying SMILE and other interpretability techniques to LLMs to ensure safety, fairness, and reliability.

(Content to be added)