The Factuality of Large Language Models in the Legal Domain

This paper investigates the factuality of large language models (LLMs) as knowledge bases in the legal domain, in a realistic usage scenario: we allow for acceptable variations in the answer, and let the model abstain from answering when uncertain. First, we design a dataset of diverse factual questions about case law and legislation. We then use the dataset to evaluate several LLMs under different evaluation methods, including exact, alias, and fuzzy matching. Our results show that the performance improves significantly under the alias and fuzzy matching methods. Further, we explore the impact of abstaining and in-context examples, finding that both strategies enhance precision. Finally, we demonstrate that additional pre-training on legal documents, as seen with SaulLM, further improves factual precision from 63% to 81%.

Mots clés

LLM Evaluation LLM Legal NLP Knowledge Base

Domaines

Intelligence artificielle [cs.AI] Traitement du texte et du document

Fichier principal

factuality_legal_LLM.pdf (185.43 Ko)

Origine	Fichiers produits par l'(les) auteur(s)

Rajaa EL HAMDANI : Connectez-vous pour contacter le contributeur

https://telecom-paris.hal.science/hal-04701151

Soumis le : mercredi 18 septembre 2024-11:28:48

Dernière modification le : lundi 2 décembre 2024-15:06:42

Dates et versions

hal-04701151 , version 1 (18-09-2024)

Licence

Paternité

Identifiants

HAL Id : hal-04701151 , version 1

Citer

Rajaa El Hamdani, Thomas Bonald, Fragkiskos D. Malliaros, Nils Holzenberger, Fabian M. Suchanek. The Factuality of Large Language Models in the Legal Domain. CIKM 2024 - 33rd ACM International Conference on Information and Knowledge Management, Oct 2024, Boise Idaho, United States. ⟨hal-04701151⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

INRIA CVN CENTRALESUPELEC INRIA2 UNIV-PARIS-SACLAY LTCI INFRES DIG IP_PARIS ANR GS-COMPUTER-SCIENCE HUB-IA INSTITUT-MINES-TELECOM

328 Consultations

33 Téléchargements