Title | What Do Llamas Really Think? Revealing Preference Biases in Language Model Representations |
Publication Type | Journal Article |
Year of Publication | 2023 |
Authors | Tang, R., X. Zhang, J. Lin, and F. Türe |
Journal | ArXiv |
Volume | abs/2311.18812 |
DOI | 10.48550/ARXIV.2311.18812 |