Hostname: page-component-89b8bd64d-b5k59 Total loading time: 0 Render date: 2026-05-07T11:14:35.648Z Has data issue: false hasContentIssue false

Where law meets data: a practical guide to expert coding in legal research

Published online by Cambridge University Press:  24 April 2025

Michal Ovádek*
Affiliation:
University College London, London, UK
Philipp Schroeder
Affiliation:
LMU Munich, Munchen, Germany
Jan Zglinski
Affiliation:
LSE Law School, London, UK
*
Corresponding author: Michal Ovádek; Email: m.ovadek@ucl.ac.uk
Rights & Permissions [Opens in a new window]

Abstract

The rise of empirical methods has had a polarising effect on legal studies in Europe. On the one hand, quantitative empiricists have frequently dismissed traditional doctrinal scholarship as unscientific and its insights as unreliable. On the other hand, many doctrinal scholars are apprehensive about the perceived displacement of domain expertise from legal research caused by the empirical turn. To bridge the gap between the two camps and address their respective concerns, we propose a wider adoption of expert coding as a methodology for legal research. Expert coding is a method for systematic parsing and representation of phenomena such as legal principles in a structured form, using researchers’ subject matter expertise. To facilitate the uptake of expert coding, we provide a step-by-step guide that addresses not only the coding process but also fundamental prerequisites such as conceptualisation, operationalisation and document selection. We argue that this methodological framework leverages legal scholars’ expertise in a more impactful way than traditional doctrinal analyses. We illustrate each step and methodological principle with examples from European Union law.

Information

Type
Core analysis
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2025. Published by Cambridge University Press
Figure 0

Figure 1. The conceptual structure of ‘Political Deference’

Figure 1

Figure 2. The shaded columns show the (true) over-time distribution of the population of all General Court decisions (N = 14,706) in terms of proportion per year. The vertical lines display the yearly proportions when randomly drawing samples of different sizes (without stratification) from the population. We can see that the larger the sample size, the more it resembles the population distribution of rulings over time.

Figure 2

Table 1. Example table of two coders disagreeing about national autonomy implications of a position voiced in preliminary reference proceedings at the CJEU

Figure 3

Table 2. Example of a codebook entry for the variable ‘direct_effect’

Figure 4

Table 3. Example of a coding matrix. Each row is the work of a single coder (c1, c2, c3), each column is a court ruling (r1, . . . r10). ‘A’ stands for CJEU agreement with the national court, ‘D’ for disagreement and ‘N’ for not applicable.

Figure 5

Figure 3. Ordinary least squares regression coefficient estimates with 95% confidence intervals. Outcome variable is the Duration of proceedings for preliminary references submitted to the CJEU between 1995 and 2011 (N = 5,327).