Hostname: page-component-6766d58669-zlvph Total loading time: 0 Render date: 2026-05-19T13:45:45.388Z Has data issue: false hasContentIssue false

The Case for a Linked Data Research Engine for Legal Scholars

Published online by Cambridge University Press:  04 November 2019

Kody MOODLEY
Affiliation:
Faculty of Law, Maastricht University; email: kody.moodley@maastrichtuniversity.nl
Pedro V HERNANDEZ-SERRANO
Affiliation:
Institute of Data Science, Maastricht University
Amrapali J ZAVERI
Affiliation:
Institute of Data Science, Maastricht University
Marcel GH SCHAPER
Affiliation:
Faculty of Law, Maastricht University
Michel DUMONTIER
Affiliation:
Institute of Data Science, Maastricht University
Gijs VAN DIJCK
Affiliation:
Faculty of Law, Maastricht University
Rights & Permissions [Opens in a new window]

Abstract

This contribution explores the application of data science and artificial intelligence to legal research, more specifically an element that has not received much attention: the research infrastructure required to make such analysis possible. In recent years, EU law has become increasingly digitised and published in online databases such as EUR-Lex and HUDOC. However, the main barrier inhibiting legal scholars from analysing this information is lack of training in data analytics. Legal analytics software can mitigate this problem to an extent. However, current systems are dominated by the commercial sector. In addition, most systems focus on search of legal information but do not facilitate advanced visualisation and analytics. Finally, free to use systems that do provide such features are either too complex to use for general legal scholars, or are not rich enough in their analytics tools. In this paper, we motivate the case for building a software platform that addresses these limitations. Such software can provide a powerful platform for visualising and exploring connections and correlations in EU case law, helping to unravel the “DNA” behind EU legal systems. It will also serve to train researchers and students in schools and universities to analyse legal information using state-of-the-art methods in data science, without requiring technical proficiency in the underlying methods. We also suggest that the software should be powered by a data infrastructure and management paradigm following the seminal FAIR (Findable, Accessible, Interoperable and Reusable) principles.

Information

Type
Articles
Creative Commons
Creative Common License - CCCreative Common License - BYCreative Common License - NCCreative Common License - SA
This is an Open Access article, distributed under the terms of the Creative Commons Attribution-NonCommercial-ShareAlike licence (http://creativecommons.org/licenses/by-nc-sa/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the same Creative Commons licence is included and the original work is properly cited. The written permission of Cambridge University Press must be obtained for commercial re-use.
Copyright
© The Author(s) 2019
Figure 0

Table 1: A summary of some prominent software platforms for performing semi-automated and automated analysis of case law and legislative texts

Figure 1

Figure 1: A dashboard within ContraxSuite for displaying relations between legal documents based on their semantic content

Figure 2

Figure 2: ConsumerCases feature for annotating legal texts with part of speech fragments, dates, organisations, and other relevant entities

Figure 3

Figure 3: User interface of UM/NLeSC case law software to perform network analysis on Dutch case law from Rechtspraak.nl.

Figure 4

Figure 4: EUCaseNet dashboard for analysing EU case law using network analysis

Figure 5

Figure 5: The Linked Data paradigm allows answering of research questions via single queries posed to a “knowledge graph”, as opposed to manual aggregation of answers across multiple queries and separate databases

Figure 6

Figure 6: A conceptual illustration of how structured information can be obtained from unstructured court decision text, using NLP

Figure 7

Figure 7: A possible look and feel of the proposed research engine’s user interface

Figure 8

Figure 8: An overview of the technical architecture of the proposed research engine. The list of data sources is indicative but not exhaustive

Figure 9

Figure 9: High-level categories of case law that should be included in the research engine

Figure 10

Figure 10: An example plot that might be displayed in the quantitative dashboard. This particular graph displays the average duration of sample cases decided by the Court of Justice of the European Union (extracted from the EUR-Lex database)