Hostname: page-component-5db58dd55d-jnbmb Total loading time: 0 Render date: 2026-06-20T14:03:26.810Z Has data issue: false hasContentIssue false

Evaluating AI in Legal Operations: A Comparative Analysis of Accuracy, Completeness, and Hallucinations in ChatGPT-4, Copilot, DeepSeek, Lexis+ AI, and Llama 3

Published online by Cambridge University Press:  30 June 2025

Bakht Munir
Affiliation:
The University of Kansas School of Law. Email: bakht.munir@ku.edu.
Muhammad Zubair Abbasi
Affiliation:
Department of Law and Criminology, School of Law and Social Sciences, Royal Holloway, University of London . Email: Zubair.abbasi@rhul.ac.uk.
W. Blake Wilson
Affiliation:
The University of Kansas School of Law. Email: wmblakewilson@gmail.com.
Allen Colombo Jr.
Affiliation:
The University of Kansas School of Law. Email: allen.colombo@KU.edu.

Abstract

The proliferation of Artificial Intelligence (AI) is significantly transforming conventional legal practice. The integration of AI into legal services is still in its infancy and faces challenges such as privacy concerns, bias, and the risk of fabricated responses. This research evaluates the performance of the following AI tools: (1) ChatGPT-4, (2) Copilot, (3) DeepSeek, (4) Lexis+ AI, and (5) Llama 3. Based on their comparison, the research demonstrates that Lexis+ AI outperforms the other AI solutions. All these tools still encounter hallucinations, despite claims that utilizing the Retrieval-Augmented Generation (RAG) model has resolved this issue. The RAG system is not the driving force behind the results; it is one component of the AI architecture that influences but does not solely account for the problems associated with the AI tools. This research explores RAG architecture and its inherent complexities, offering viable solutions for improving the performance of AI-powered solutions.

Information

Type
Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution, and reproduction in any medium, provided the original work is properly cited.
Copyright
© The Author(s), 2025. Published by International Association of Law Libraries
Figure 0

Figure 1. Percentage of Accuracy.

Figure 1

Figure 2. Percentage of Incomplete Responses.

Figure 2

Figure 3. Percentage of Legal Hallucinations.

Figure 3

Figure 4. Proportion of Responses.

Figure 4

Figure 5. RAG Model.