Search

9 - Uniformity and Diversity at the Same Time?
from Section 2 - Usage Variation amid Social Diversity and Political Change
- By Christian Mair
Edited by Adam Smith, Macquarie University, Sydney, Peter Collins, University of New South Wales, Sydney
Book:

Worldwide Perspectives on English Usage

Published online:

25 June 2026

Print publication:

16 July 2026, pp 228-246
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

Up to at least the 1960s, English was seen as a world language integrated around two equipollent standards, British and American. Since then, in the wake of decolonisation, this bi-polar constellation has given way to models arguing for various pluricentric constellations. What tends to be overlooked, however, is the fact that the agents of standardisation and the sociocultural environment in which standardisation is taking place are markedly different today from what they were in the mid twentieth century. The power of educated elites to define linguistic standards has weakened considerably, while language technologies and software algorithms enforce homogenisation of usage in the written domain, promoting new norms that are not always in line with traditional notions of ‘good English’. In addition, the global spread of English has not only involved standard varieties, but also some non-standard ones. Removed from their vernacular home-bases, these non-standard forms have gained prestige and become available for new functions. This has produced the ‘standardisation paradox’ that Global English is facing in the early twenty-first century and that the present chapter will illustrate and analyse.

Logic-Guided Data Extraction with Answer Set Programming and Large Language Models
MARIO ALVIANO, LORENZO GRILLO, NICOLA LEONE, FABRIZIO LO SCUDO
Journal:

Theory and Practice of Logic Programming , First View

Published online by Cambridge University Press:

08 July 2026, pp. 1-18
- Article
- - You have access
  - Open access
- PDF
- HTML
- Export citation
When large language models (LLMs) are used for semantic data extraction from unstructured text, producing candidate relational facts from natural language, they may remain unreliable for tasks requiring complex combinatorial reasoning and global consistency. This paper proposes a logic-guided data extraction framework combining LLM-based extraction with answer set programming (ASP). The LLM produces candidate facts, whereas ASP performs validation, inference, consistency checking, and control. Unlike existing pipelines that query the LLM independently for all target predicates, the proposed approach uses ASP reasoning to identify which predicates are logically admissible at each stage and to guide extraction queries. By interleaving LLM calls with ASP derivation, the framework infers logically implied facts without further extraction and detects inconsistencies early. We formalize the pipeline and prove that, under mild assumptions, it is equivalent to the baseline approach with respect to the final extracted facts, while requiring fewer LLM calls. We also introduce a caching mechanism for logic-based control queries, exploiting monotonicity of conjunctive queries over incrementally constructed fact sets to reduce solver invocations. Experiments on ASP-derived benchmarks show that the framework reduces LLM calls and improves extraction quality by mitigating spurious outputs, demonstrating the value of non-monotonic logic programming for controlled semantic extraction.

ProDebug: An Automated Debugging System for Prolog
RICARDO BRANCAS, VASCO MANQUINHO, RUBEN MARTINS
Journal:

Theory and Practice of Logic Programming , First View

Published online by Cambridge University Press:

06 July 2026, pp. 1-16
- Article
- - You have access
  - Open access
- PDF
- HTML
- Export citation
Prolog is a well-known declarative programming language commonly used in introductory courses on logic and reasoning. However, many students find Prolog challenging because it lacks the familiar debugging mechanisms found in imperative languages. In large classes, this difficulty is exacerbated by the challenge of providing timely and personalized feedback to students. In this work, we introduce ProDebug, the first tool to combine large language models (LLMs) with spectrum-based and mutation-based techniques for automated debugging of Prolog assignments. ProDebug automatically identifies faults and proposes bug repairs for student Git submissions. Faults are detected using three approaches – spectrum-based, mutation-based, and LLM reasoning – while repairs are generated using mutation-based techniques and LLMs. Our evaluation on 1499 buggy student submissions from a bachelor’s level programming class demonstrates the potential of automated, LLM-augmented feedback systems to scale support for declarative programming education.

Responsible NLP: Why performance is no longer enough
Salima Lamsiyah
Journal:

Natural Language Processing ,

Published online by Cambridge University Press:

29 June 2026, pp. 1-7
- Article
- - You have access
  - Open access
- PDF
- HTML
- Export citation
Natural language processing (NLP) has moved from a specialized research field into the everyday infrastructure of writing, search, translation, education, journalism, public administration, and scientific work. This transition changes what counts as progress. Accuracy, fluency, and benchmark performance remain important, but they are no longer sufficient when language technologies shape knowledge, decisions, identities, and public trust. This column introduces Responsible NLP as a research orientation that integrates fairness, transparency, privacy, safety, cultural diversity, environmental awareness, and human agency across the full life cycle of language technologies. It argues that responsibility is not an external constraint on innovation, but a condition for meaningful and trustworthy innovation. Future research must therefore ask not only whether an NLP system works but also for whom it works, under which assumptions, with what risks, and with what forms of accountability.

3 - Behavioural Data Science
from Part I - Introduction to Behavioural Data Science
- By Ganna Pogrebna
Edited by Ganna Pogrebna, The Alan Turing Institute, Thomas T. Hills, University of Warwick
Book:

The Cambridge Handbook of Behavioural Data Science

Published online:

29 May 2026

Print publication:

18 June 2026, pp 44-58
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

This chapter presents a structured account of the Framework and Topology of Methods in Behavioural Data Science, expanding upon a four-stage methodological process – data collection, data processing, feature engineering and model development – and introducing four foundational conceptual pillars: Hypothesis Testing at Scale, Data Science Modelling for Behavioural Problems, Gap Detection and Hybrid Modelling. These pillars not only represent distinct methodological orientations but also correspond to divergent epistemological and ontological commitments within the field. The chapter provides a comprehensive overview of methods and tools used at each stage of the Behavioural Data Science workflow, illustrates how emerging technologies such as large language models (LLMs) reshape methodological choices and concludes with ethical considerations and future directions. In so doing, it advances a unified topology of Behavioural Data Science methods capable of accommodating both theory-driven and data-driven approaches, while responding to gaps in traditional behavioural science paradigms.

Introduction
Lucas N. Vieira, University of Bristol
Book:

Translate

Published online:

15 May 2026

Print publication:

11 June 2026, pp 1-11
- Chapter
- - You have access
  - Open access
- PDF
- HTML
- Export citation
Summary

The Introduction explains important concepts and what they mean in this book. It also outlines the project scope, which covers both written and spoken uses of machine translation to fulfil communication and information access purposes in one of the sectors selected for analysis. Following a brief historical account of how social conceptions of machine translation have changed, the Introduction addresses a recent shift in translation research towards multilingual communication practices that take place outside education settings or the language services industry. Given how fast language technologies are evolving, it will not take long for the tools and types of human–computer interaction that appear in the book to change quite significantly. The Introduction addresses implications of this dynamic landscape for this book specifically and for translation and multilingual communication research more broadly.

A Framework to Assess the Persuasion Risks Large Language Model Chatbots Pose to Democratic Societies
Zhongren Chen, Joshua Kalla, Quan Le, Shinpei Nakamura-Sakai, Jasjeet Sekhon, Ruixiao Wang
Journal:

Journal of Experimental Political Science , First View

Published online by Cambridge University Press:

04 June 2026, pp. 1-14
- Article
- - You have access
  - Open access
- PDF
- HTML
- Export citation
We investigate whether large language models (LLMs) threaten democracy through their persuasive capabilities. Using two survey experiments (N = 10,417) and real-world simulations, we compare the cost-effectiveness of LLM chatbots against traditional campaign tactics, taking into account both the “receive” and “accept” steps in the persuasion process. Our design advances prior research by assessing extended human-LLM interactions and measuring short- and long-term effects across three political domains. We find that while LLMs are comparably persuasive to campaign ads once seen, real-world impact depends on both message reception and acceptance. Simulations estimate LLM-based persuasion costs $48–$75 per voter versus $100 for traditional methods. However, traditional methods currently scale more effectively. While LLMs do not yet offer substantially greater potential for large-scale persuasion, this may shift as capabilities improve and techniques for scalable exposure become feasible.

The Cambridge Handbook of Behavioural Data Science

Edited by Ganna Pogrebna, Thomas T. Hills
Published online:

29 May 2026

Print publication:

18 June 2026
- Book
- - Get access
    
    Buy a print copy
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
The Cambridge Handbook of Behavioural Data Science offers an essential exploration of how behavioural science and data science converge to study, predict, and explain human, algorithmic, and systemic behaviours. Bringing together scholars from psychology, economics, computer science, engineering, and philosophy, the Handbook presents interdisciplinary perspectives on emerging methods, ethical dilemmas, and real-world applications. Organised into modular parts-Human Behaviour, Algorithmic Behaviour, Systems and Culture, and Applications—it provides readers with a comprehensive, flexible map of the field. Covering topics from cognitive modelling to explainable AI, and from social network analysis to ethics of large language models, the Handbook reflects on both technical innovations and the societal impact of behavioural data, and reinforces concepts in online supplementary materials and videos. The book is an indispensable resource for researchers, students, practitioners, and policymakers who seek to engage critically and constructively with behavioural data in an increasingly digital and algorithmically mediated world.

Translate

Multilingual Access to Critical Services in the Age of AI
Lucas N. Vieira
Published online:

15 May 2026

Print publication:

11 June 2026
- Book
- - You have access
  - Open access
- Export citation
Automated translations can break down language barriers and increase access to information, but they can also be highly inaccurate. This timely book explores the social challenges and ethical considerations of using artificial intelligence (AI) translations in high-stakes professional environments. Based on contributions from over two thousand professionals from critical sectors including healthcare, social work, emergency services and the police, the analysis explores the motivations and consequences of multilingual uses of AI across these sectors. Real-life examples provided throughout the book bring home the delicate balance of risks and benefits of using AI to serve and communicate with multilingual communities. By drawing on concepts such as virtue, trust, empathy and AI literacy, this book makes a case for nuance and flexibility, defends the value of language access, and calls for greater transparency in the development and deployment of AI translation tools. This title is also available as open access on Cambridge Core.

Messy data, low-resource languages, and LLMs: Narrative analysis of pre-modern Slavic Lives of Saints
Achim Rabus, Alexander Ermakov, Iris Ferrazzo
Journal:

Computational Humanities Research / Volume 2 / 2026

Published online by Cambridge University Press:

12 May 2026, e13
- Article
- - You have access
  - Open access
- PDF
- HTML
- Export citation
This study addresses the challenges of performing narratological analysis on low-resource languages, with a focus on Old Church Slavonic. Understanding the roles, interactions, and networks of persons is central to narrative analysis, yet such investigation is hindered by the scarcity of experts and the limited availability of annotated resources. We explore both established natural language processing (NLP) methods and large language models (LLMs) for analyzing pre-modern Slavic Lives of Saints, including several Slavic versions, the Greek original, and an English translation. Pre-modern Slavic texts pose particular difficulties due to rich morphology, orthographic variation, and limited standardization, which complicate the application of both traditional NLP tools and off-the-shelf LLMs. Through experiments using annotated and non-annotated ground truth data, we demonstrate that while conventional NLP methods often reach their limits on such low-resource, highly variable texts, LLMs provide complementary capabilities that can support narratological insights, especially in tracking persons and their interactions, albeit with important caveats regarding accuracy and coverage.

Chapter 7 - Some Reflections on Language Games … and ChatGPT
- By Alessandra Buccella
Edited by Carl B. Sachs, Marymount University, Virginia
Book:

Interpreting Sellars

Published online:

23 April 2026

Print publication:

07 May 2026, pp 140-153
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

Large Language Models (LLMs) like OpenAI’s ChatGPT or Google’s Gemini are the new sensation in artificial intelligence (AI) research. These systems exhibit impressive conversational abilities and have even managed to convince some people of their possible sentience. But do LLMs actually speak our language, or do they merely appear as if they do? Do they really reason and think, or are they simply good at superficially imitating these abilities? In this chapter, I argue that Wilfrid Sellars’s functionalist-pragmatist approach to language and concept learning might be especially useful in the context of answering the questions above. In particular, conceiving the process of learning and mastering language as analogous to the process of learning to play a game within a set of normative social practices can shed light on the kind of abilities LLMs possess and what we can expect them to do in the future, including becoming genuine members of our linguistic community rather than mere “stochastic parrots.”

Chapter 15 - Utopia or Dystopia?
from Part VII - Conclusion and Outlook
Falk Huettig, Max-Planck-Institut für Psycholinguistik, The Netherlands
Book:

The Perks of Being a Bookworm

Published online:

23 April 2026

Print publication:

07 May 2026, pp 202-217
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

How likely it is that literacy, as we have known it, will be preserved in the years ahead? Or, perhaps the question has already shifted, from whether the written medium will fade to how soon that disappearance might occur. Generative artificial intelligence and related technology can support the transition toward new forms of literacy that evolve alongside emerging media. Large language models, in particular, may help preserve some of the cognitive and communicative advantages associated with “traditional” book-based language. In this way, technology could shape future media landscapes, keeping the perks of being a bookworm while softening some of the downsides of newer formats.

The attribution crisis in LLM search results: Estimating ecosystem exploitation
Ilan Strauss, Jangho Yang, Tim O’Reilly, Sruly Rosenblat, Isobel Moure
Journal:

Data & Policy / Volume 8 / 2026

Published online by Cambridge University Press:

28 April 2026, e15
- Article
- - You have access
  - Open access
- PDF
- HTML
- Export citation
Web-enabled large language models (LLMs) frequently answer queries without crediting the web pages they consume, creating an “attribution gap” in responsible artificial intelligence (AI) usage—defined as the difference between relevant URLs read and those actually cited. Drawing on approximately 14,000 real-world LMArena conversation logs with search-enabled LLM systems, we document three exploitation patterns: (1) no search: 34% of Google Gemini and 24% of OpenAI GPT-4o responses are generated without explicitly fetching any online content; (2) no citation: Gemini provides no clickable citation source in 92% of answers; (3) high-volume, low-credit: Perplexity’s Sonar visits approximately 10 relevant pages per query but cites only three to four. A negative binomial hurdle model shows that the average query answered by Gemini or Sonar leaves about three relevant websites uncited, whereas GPT-4o’s tiny uncited gap is best explained by its selective log disclosures rather than by better attribution. Citation efficiency—extra citations provided per additional relevant web page visited—varies widely across models, from 0.19 to 0.45 on identical queries, underscoring that retrieval design, not technical limits, shapes ecosystem impact. To advance auditing and monitoring of AI systems, we recommend a transparent LLM search architecture based on standardized telemetry and full disclosure of search traces and citation logs.

How reliable are retrieval-augmented and standard ChatGPT models to support flood susceptibility mapping?
Ali Pourzangbar, Mário J. Franca
Journal:

Environmental Data Science / Volume 5 / 2026

Published online by Cambridge University Press:

21 April 2026, e10
- Article
- - You have access
  - Open access
- PDF
- HTML
- Export citation
This paper evaluates the performance of baseline and domain-augmented ChatGPT models for literature-based knowledge support in flood susceptibility mapping (FSM) using machine Learning approaches. To assess this, we designed five key questions related to FSM, with benchmark responses derived from our comprehensive review article (Pourzangbar et al., Journal of Flood Risk Management 18, e70042), which analyzed 100 studies on ML applications in FSM. The same questions were posed (i) to standard ChatGPT-4 and ChatGPT-4o models without additional contextual material, and (ii) to a domain-augmented GPT-4 configuration (Chat-FSM) equipped with retrieval access to the 100 reviewed articles. The comparison highlights that GPT-based models can reasonably reproduce frequently reported machine learning models and conditioning factors from the reviewed literature, but show weaker consistency in feature selection methods, often suggesting less relevant techniques. Among the models, ChatGPT-4o demonstrated the weakest alignment with benchmark data, while Chat-FSM demonstrated the highest agreement with the benchmark dataset across most evaluated questions. In terms of application-level efficiency, GPT models required substantially less time and computational effort compared to manual literature synthesis under the defined experimental setup. While ChatGPT-based systems can support literature-informed exploration in FSM, human expertise remains essential for critical reasoning, methodological design, and application to novel or context-specific scenarios.

High-Dimensional Perception with the Double Machine Learning Lens Model
Part of
- Integrating and Analyzing Complex High-Dimensional Data in Social and Behavioral Sciences Research
Raymond Vincent Li, Jeremy C. Biesanz
Journal:

Psychometrika / Volume 91 / Issue 3 / June 2026

Published online by Cambridge University Press:

21 April 2026, pp. 889-920
- Article
- - You have access
  - Open access
- PDF
- HTML
- Export citation
Traditional perceptual models are ill-equipped for the high-dimensional data, such as text embeddings, central to modern psychology and AI. We introduce the double machine learning lens model, a framework that utilizes machine learning to handle such data. We applied this model to analyze how a modern AI and human perceivers judge social class from 9,513 aspirational essays written by 11-year-olds in 1969. A systematic comparison of 45 analytical approaches revealed that regularized linear models using dimensionality-reduced language embeddings significantly outperformed traditional dictionary-based methods and more complex non-linear models. Our top model accurately predicted human $(R^{2}_{CV} =0.61)$ and AI $(R^{2}_{CV} =0.56)$ social class perceptions, capturing over 85% of the total accuracy. These results suggest that “unmodeled knowledge” in perception may be an artifact of insufficient measurement tools rather than an unmeasurable intuitive process. We find that both AI and humans use many of the same textual cues (e.g., grammar, occupations, and cultural activities), only a subset of which are valid. Both appear to amplify subtle, real-world patterns into powerful, yet potentially discriminatory heuristics, where a small difference in actual social class creates a large difference in perception.

Prompt engineering of large language models for paper screening in medical meta-analyses and systematic reviews: A prospective comparative study
Till J. Adam, Salma A. S. Abosabie, Max Dittmer, Elise Wolf, Sara A. Abosabie, Clara Behnke, Felix Baier, Annabelle Weickmann, Ludwig Köser, Christoph U. Correll, Niklas Rutsch
Journal:

Research Synthesis Methods ,

Published online by Cambridge University Press:

14 April 2026, pp. 1-18
- Article
- - You have access
  - Open access
- PDF
- HTML
- Export citation
Interest in large language models (LLMs) as a tool for meta-analyses and systematic reviews (MA/SRs) is growing. We prospectively developed 515 unique prompts by predefined screening-related categories and tested with open-access LLMs (Llama, Mistral) against four gold-standard MA/SRs from different medical fields published after the LLMs’ training cut-offs, using a Python-based pipeline. Heterogeneity between prompts was quantified, and hypothetical workload/cost reduction with top-performing prompts calculated. Across 12,360 pipeline runs, LLMs versus MA/SRs reached average recall/sensitivity = 83.6 ± 17.0%, precision = 18.5 ± 15.6%, specificity = 36.6 ± 23.7% F1-score = 27.6 ± 17.2%, and accuracy = 61.1 ± 11.0%. F1-scores were significantly higher when prompts focused on methods (0.78 ± 0.40%), explicitly mentioned MA/SR screening (0.81 ± 0.37%), included the comparison MA/SR’s title (5.64 ± 0.37%) or selection criteria (8.05 ± 0.68%), and with more LLM parameters (70b = 4.48 ± 0.31%, 123b = 7.77 ± 0.31%), but lower when screening abstracts instead of titles (−3.67 ± 0.28%). In LLM-base preselection, top-performing F1-score prompts (recall/sensitivity = 72.2%, specificity = 66.1%, precision = 28.6%) would reduce screening demands by 34.5%−37.5%, saving 8.4–8.8 weeks of work and 17,592–18,552. Recall/sensitivity increased with less MA/SR information contrasting F1-score results, which highlights a recall/sensitivity-precision/specificity trade-off. F1-score increased with detailed MA/SR information, while recall/sensitivity increased with shorter, zeroshot prompts. We provide the first prospectively assessed prompt engineering framework for early-stage LLM-based paper screening across medical fields. The publicly available Python pipeline and full prompt list used here support further development of LLM-based evidence synthesis.

9 - Broader Implications and Future Directions
Valerio Capraro, Università degli Studi di Milano-Bicocca
Book:

The Economics of Language

Published online:

27 March 2026

Print publication:

09 April 2026, pp 308-332
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

This concluding chapter argues that language is a first-order driver of economic behaviour and outlines where the research should go next. It extends the LENS framework beyond one-shot decisions to strategic settings shaped by beliefs, and outlines the co-evolution between language and behaviour. Large language models are proposed as virtual laboratories, while a quantitative utility approach must accommodate multidimensional, non-linear emotions and norms, and expand to visual cues (VENS). The chapter highlights applications – from policy design to norm-sensitive AI – alongside serious risks of manipulation, surveillance, and bias. It closes with a call for transparent, ethically governed models that explain and responsibly influence decisions.

8 - The Role of Large Language Models
Valerio Capraro, Università degli Studi di Milano-Bicocca
Book:

The Economics of Language

Published online:

27 March 2026

Print publication:

09 April 2026, pp 275-307
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

This chapter argues that language matters for economic decisions and that modern large language models (LLMs) can quantify this effect. After outlining the limits of lexicon-based tools, it examines BERT and MoralBERT, showing that generic sentiment scores struggle to predict human behaviour, while adding moral dimensions helps but the results remain imperfect. LLM-based chatbots (e.g., GPT-4) enable context-sensitive sentiment estimates that predict framing effects, particularly in Dictator Games. Building on this, the chapter formalises language-based utility functions that combine payoffs with sentiment or moral polarity and derives testable predictions. Evidence across Dictator, Equity–Efficiency, and Bribery games supports the approach, while highlighting caveats and aveThe chapter highlights applicationsnues for refinement.

Bending the Rules: On Large Language Models and Content Moderation
Renana Keydar, Noa Mor, Yuval Shany, Omri Abend
Journal:

Israel Law Review / Volume 59 / Issue 1 / March 2026

Published online by Cambridge University Press:

06 April 2026, pp. 96-133
- Article
- - You have access
  - Open access
- PDF
- HTML
- Export citation
This article examines the transformative impact of large language models (LLMs) on online content moderation, revealing a critical gap between platforms’ rule-based policies and their AI-driven enforcement mechanisms. Using Facebook’s hate speech moderation policies and practices as a case study, we identify a paradox: while content policies are increasingly rule-oriented, AI-driven enforcement seems to operate in a standard-like manner. This disconnect creates transparency, consistency and accountability challenges relating to the delineation of online freedom of expression that are not addressed in the literature, and require attention and mitigation. In this specific context, we introduce the concept of ‘rules by the millions’ to describe how AI systems actually operate through generating vast networks of micro-rules that evade traditional regulatory oversight. This phenomenon disrupts the conventional rules-versus-standards framework used in legal theory, raising urgent questions about the adequacy of current AI governance mechanisms. Indeed, the rapid adoption of LLMs in content moderation has outpaced the human capacity to monitor them, creating a pressing need for adaptive frameworks capable of managing the evolving capacities of AI.

Evaluating the performance of large language models in taxonomic classification of questions in verbal protocols of design
Ahmed Shahriar Sakib, Ada Hurst, Frank Safayeni
Journal:

AI EDAM / Volume 40 / 2026

Published online by Cambridge University Press:

02 April 2026, e6
- Article
- - You have access
  - Open access
- PDF
- HTML
- Export citation
This study investigates the use of large language models (LLMs) to classify question utterances within verbal design protocols according to Eris’ (2004) taxonomy. We evaluate the performance of two proprietary LLMs – OpenAI’s GPT-4.1 and Anthropic’s Claude Sonnet 4.5 – across experiments designed to assess classification accuracy, sensitivity to prompt configuration and in-context learning (ICL), and generalization across datasets and models. Using two human-coded datasets of differing size and quality, we measure alignment between LLM-generated labels and human judgments at both question category and subcategory levels. Results show that both LLMs achieved moderate to strong alignment rates at the category level (up to 85.7% for GPT-4.1 and 82.9% for Claude Sonnet 4.5), with substantially lower alignment at the more granular subcategory level. Performance differences across prompt configurations and ICL conditions were small, indicating robust generalization across datasets and transferability of prompt designs. While these results suggest that LLMs can effectively support scalable question classification, human judgment and oversight remain essential. Future research should explore the development and evaluation of alternative hybrid human–LLM workflows in protocol analysis, as well as the use of smaller or open-source models to address data privacy concerns.

Search Results

Refine search

Refine search

Actions for selected content:

115 results

9 - Uniformity and Diversity at the Same Time?

Summary

Logic-Guided Data Extraction with Answer Set Programming and Large Language Models

ProDebug: An Automated Debugging System for Prolog

Responsible NLP: Why performance is no longer enough

3 - Behavioural Data Science

Summary

Introduction

Summary

A Framework to Assess the Persuasion Risks Large Language Model Chatbots Pose to Democratic Societies

The Cambridge Handbook of Behavioural Data Science

Translate

Messy data, low-resource languages, and LLMs: Narrative analysis of pre-modern Slavic Lives of Saints

Chapter 7 - Some Reflections on Language Games … and ChatGPT

Summary

Chapter 15 - Utopia or Dystopia?

Summary

The attribution crisis in LLM search results: Estimating ecosystem exploitation

How reliable are retrieval-augmented and standard ChatGPT models to support flood susceptibility mapping?

High-Dimensional Perception with the Double Machine Learning Lens Model

Prompt engineering of large language models for paper screening in medical meta-analyses and systematic reviews: A prospective comparative study

9 - Broader Implications and Future Directions

Summary

8 - The Role of Large Language Models

Summary

Bending the Rules: On Large Language Models and Content Moderation

Evaluating the performance of large language models in taxonomic classification of questions in verbal protocols of design

Search Results

Refine search

Refine search

Actions for selected content:

Save Search

115 results

Summary

Summary

Summary

The Cambridge Handbook of Behavioural Data Science

Translate

Summary

Summary

Summary

Summary