We use cookies to distinguish you from other users and to provide you with a better experience on our websites. Close this message to accept cookies or find out how to manage your cookie settings.
To save content items to your account,
please confirm that you agree to abide by our usage policies.
If this is the first time you use this feature, you will be asked to authorise Cambridge Core to connect with your account.
Find out more about saving content to .
To save content items to your Kindle, first ensure no-reply@cambridge.org
is added to your Approved Personal Document E-mail List under your Personal Document Settings
on the Manage Your Content and Devices page of your Amazon account. Then enter the ‘name’ part
of your Kindle email address below.
Find out more about saving to your Kindle.
Note you can select to save to either the @free.kindle.com or @kindle.com variations.
‘@free.kindle.com’ emails are free but can only be saved to your device when it is connected to wi-fi.
‘@kindle.com’ emails can be delivered even when you are not connected to wi-fi, but note that service fees apply.
One concept that has seized the popular imagination is the idea of the digital judge. There is something intuitively appealing about the concept that one day our unruly, chaotic human disputes will be resolved by the cool, all-knowing rationality of a fair and impartial electronic decision maker. While the concept may be enticing, this leap from human-powered justice to electronic justice is a pretty big one. Much like the concept of self-driving cars or watches we can talk into, many people seem to have concluded that this future is inevitable, even when we don’t yet have the technology that could make it come to pass. Right now we’re just biding time, waiting for the future to arrive.
The advent of distributed ledger technology has challenged the legal community to learn how technology will change law, finance, and commerce in coming years. Blockchain technology, a method for storing data in cryptographically secure distributed ledgers, has spread all over the world and into nearly every vertical as global commerce searches for secure tools that support ecosystem-wide growth. The World Economic Forum estimates that by 2026, 10% of global commerce will be conducted on blockchain systems.1 Blockchain technology promises to bring more legal relationships into the realm of data and code, so there is a clear need for legal minds to help guide this development.
Throughout the twentieth and early twenty-first centuries, the world witnessed exponential growth in the amount of information collected, stored, and analyzed. Not unrelatedly, the cost of storing, analyzing, and transmitting information has fallen exponentially during the same period, resulting in substantially greater access to and awareness of data. A few hundred years ago, the cost of storing the sum of written human knowledge exceeded the budgets of all but the very wealthiest institutions; today, the body of all written human knowledge can be accessed by a device that nearly half of the world’s population carries in their pocket. As a result, people have become somewhat desensitized to the purpose and presence of data in modern society. In order to understand the purpose of the data and knowledge we have accrued and stored – and which is intrinsically linked to informatics – it is useful to look back to a time when knowledge was genuinely scarce and expensive to access. To understand informatics, we must understand the importance of writing and knowledge systems through history.
Understanding what happens at scale in our judicial systems seems a relatively simple problem, but has proved difficult in the past due to the inability of practitioners to access the needed information. In this chapter, we examine bankruptcy case information made available by the Federal Judicial Center in 2017 and use that data to understand the difficulty Chapter 13 bankruptcy filers have in obtaining their bankruptcy discharge, the potential factors that correlate with obtaining a discharge, and to predict how likely a specific case is to succeed. We discuss a project conducted using data from over 700,000 cases, as well as examine much smaller data sets suitable for manipulation using Excel.
The law is not free in either sense of the word: It is costly to access, and there are restrictions on its use. An ideal system would allow any member of the public free online access to all official primary law – cases, statutes, and regulations. In the twenty-first century, a digital-first court publishing system, and primary law in general, should be online, free, open, comprehensive, official, citable, and machine-readable. A modern online law system should also have eight additional, more specific characteristics: digital signing, versioning, good structure, medium-neutrality, archives, a search function, bulk conveyance, and an application program interface (API).1 Unfortunately, such an ideal system remains a distant aspiration in most jurisdictions.
Document assembly, the computer-assisted generation of texts, is a form of automation with wide applicability in law. Most legal tasks involve document preparation. Drafting effective texts is central to lawyering, judging, legislating, and regulating. The best way to support that work with intelligent tools is an ancient topic in legal informatics circles. This chapter surveys the history and current state of the legal document assembly field and reviews the typical concepts, features, and development processes involved. It describes illustrative applications of document drafting software and the knowledge representations and interfaces they leverage.
Somewhere between sonnets and Beat literature lies the notion of the semi-structured document, in which some type of, if not formalized, than perhaps anticipated, or at least understandable, textual layout, sectioning, and various forms of visual and labeled elements assist the reader in deciphering the document’s meaning. In fact, had the publisher been able to accommodate it, Faulkner had hoped to publish his masterpiece, The Sound and the Fury, in various colored text, each color denoting a different period of time, in order to assist the reader in making sense of what might otherwise be viewed as incoherent ramblings coming through his severely mentally disabled character’s narrative.2 And such is the current fate and ongoing dilemma facing a legal system that has historically been centered on the static, written document, collecting dust if lucky enough to be on a shelf, rather than buried in a random box under a desk in a remote office, yet all the while holding forth the rules and consequences of not abiding its every word and punctuation mark, scribbler’s errors included. This is the glue that holds society together.
The past decade has seen an increase in conversations about innovation in law, with a focus on how new technology can make the legal system more efficient and effective. Legal technology, in the form of artificial intelligence (AI), data analytics, and mobile applications, has been heralded as bringing a new era of legal services. This chapter advocates for a distinct but complementary approach to legal innovation, based in human-centered design, which can create new non-technological innovations and improve how lawyers and laypeople engage with legal technology. If a technology-driven approach focuses on how to make systems more intelligent and more efficient, a design-driven approach focuses on how to make systems that people want to use, are able to use, and that give them value.
The storage, description, collection, organization, and selection of legal information is central to a legal system, whether public or private, professional or lay person, structured or unstructured, textual or non-textual. The ability to get timely information where it is needed is as important to law as it is to any other field, given that law is a system of written rules and procedures. Ignorantia juris non excusat – ignorance of the law is no excuse. But where citizens seek to know the law and cannot find it, they have a right to question its legitimacy, at least as applied to them. The function of the reference librarian within a brick and mortar library, while still invaluable, does not scale to the magnitude of the problem. Technological information intermediation, such as a search engine, can thus be increasingly viewed as a necessary component of a modern legal system, and familiarity with the basic concepts is an important tool in the legal technologist’s toolkit.
The philosophy of law is the study of the nature of law: What is law? What are the criteria of a functioning legal system? What is the relationship between law and morality? A course on legal technology and legal informatics focuses on the technological implementation of a legal system. As we move away from static, printed documents toward virtual, distributed, integrated systems, software (“code”) plays an increasingly important role in the legal system. Obviously code has applications far beyond implementing law. Yet, as Lawrence Lessig points out in Code 2.0, non-law code also regulates behavior, often in a more fundamental way than laws do.1 As discussed in this chapter, code is architectural in nature and effectively limits behavior similarly to laws of physics. Only in science fiction do we entertain the notion of faster-than-light travel, perpetual motion, or evading gravitational forces. Similarly, we tend to accept the limitations of code that prevent us from, say, lending out our e-books, though such lending would certainly be legal. Even if we are aware of these limits and do not like them, most of us have no capacity to change them. As far as our behavior is concerned, code may as well be the law.2
Almost all law is expressed in natural language; therefore, natural language processing (NLP) is a key component of understanding and predicting law. Natural language processing converts unstructured text into a formal representation that computers can understand and analyze. This technology has already intersected with law, and is poised to experience rapid innovation and widespread adoption. There are three reasons for this: (1) the number of repositories of digitized machine-readable legal text data is growing; (2) advances in NLP tools are being driven by algorithmic and hardware improvements; and (3) there is great potential to dramatically improve the effectiveness of legal services due to inefficiencies in its current practice.
This case study describes how a team of computer scientists assisted a team of public health researchers by applying machine learning to extract information from statutory texts. Researchers at the University of Pittsburgh’s Graduate School of Public Health (SPH) had been manually mining specific information from federal, state, and local laws and regulations concerning public health system emergency preparedness and response. The analysts used the information to assess and compare states’ regulatory frameworks concerning emergency preparedness. They retrieved candidate legal and regulatory texts from a full-text legal information service, identified relevant spans of text, and systematically categorized the spans in terms of a coding scheme. The SPH’s coding scheme captured information about agencies and actors in a state’s public health system who were directed by statute to interact with one another in particular ways while dealing with public health emergencies. Based on the coded information, the SPH constructed statutory network diagrams of legally mandated interactions among actors. These network diagrams provide insight into those statutory texts that directed the interactions.
There are a variety of different approaches to representing legal information in order to make that information usable in legal analytics or automated reasoning systems. This variety of approaches stems from the variety of tasks that legal practitioners wish to undertake, and there is no single representation technique that squarely fits all tasks and contexts. This chapter will survey the methods available for representing legal information, and the benefits that each method can bring to the tools used in legal work. This section will also provide a high-level overview of approaches that have been developed to represent laws expressed as statutes, rules, and regulations. It will also survey the standards that have been developed and that are emerging for representing legal information. An overview of methods for representing case law will also be provided, along with a summary of how interpretation of the law can be achieved through automated reasoning mechanisms based on computational models of argument. This chapter will also consider methods of conceptualizing and reasoning about the relations within and between legal documents such as contracts, along with techniques to represent wider networks of information, such as document citations.