Hostname: page-component-5db58dd55d-jhf8m Total loading time: 0 Render date: 2026-06-01T11:09:40.022Z Has data issue: false hasContentIssue false

LLM for social good: A value-driven LLM framework to embed social good values

Published online by Cambridge University Press:  28 May 2026

Victor O. K. Li*
Affiliation:
Department of Electrical and Electronic Engineering, University of Hong Kong, Hong Kong
Jacqueline C. K. Lam*
Affiliation:
Department of Electrical and Electronic Engineering, University of Hong Kong, Hong Kong
Yang Han
Affiliation:
Department of Electrical and Electronic Engineering, University of Hong Kong, Hong Kong
Jon Crowcroft
Affiliation:
Department of Computer Science and Technology, University of Cambridge , Cambridge, UK
Lawrence Y. L. Cheung
Affiliation:
Department of Linguistics and Modern Languages, The Chinese University of Hong Kong, Hong Kong
*
Corresponding authors: Jacqueline C.K. Lam and Victor O.K. Li; Emails: jcklam@eee.hku.hk; vli@eee.hku.hk
Corresponding authors: Jacqueline C.K. Lam and Victor O.K. Li; Emails: jcklam@eee.hku.hk; vli@eee.hku.hk

Abstract

As large language models (LLMs) increasingly shape decision-making, public discourse, education, healthcare, and governance, a critical question emerges: Whose values are these systems truly reflecting? While today’s generative AI systems demonstrate extraordinary capabilities, they also inherit hidden biases, ethical blind spots, and implicit assumptions embedded within their training data. Existing alignment approaches often remain opaque, resource-intensive, or insufficiently adaptive to diverse societal expectations.

This paper introduces a novel value-driven LLM framework designed to systematically uncover, quantify, and realign the implicit values embedded within LLMs toward socially desirable outcomes. Built upon the AI for Social Good (AIfSG) framework, our proposed methodology operationalizes ethical alignment across six critical domains: reasoning and interpretability, bias removal, transparency and accountability, security and privacy, moral and ethical observations, and public understanding. Using advanced embedding techniques, cosine-based value-difference metrics, and topic-weighted iterative fine-tuning, the framework transforms ethical alignment from an abstract aspiration into a measurable and actionable computational process.

To demonstrate adaptability across moral and regulatory paradigms, the framework evaluates two distinct reference value systems: the Ten Commandments and the General Data Protection Regulation (GDPR). Experimental results using open-source LLMs, including Llama 3.2 and Gemma 2, reveal substantial reductions in value misalignment, ranging from approximately 25% to 70% across ethical domains, while preserving model flexibility and scalability. Visualizations in value-embedding space further confirm significant convergence between original model outputs and socially aligned reference values after iterative realignment.

Beyond technical innovation, this work positions value alignment as a foundational challenge for the future governance of generative AI. The proposed framework functions both as a diagnostic instrument for identifying ethical gaps and as an intervention mechanism for adaptive value realignment, offering policymakers, developers, and institutions a scalable pathway toward transparent, accountable, and socially responsible AI systems. By bridging computational methods with moral, legal, and societal principles, this research advances a new paradigm for embedding human-centered values directly into the next generation of intelligent systems.

Information

Type
Research Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2026. Published by Cambridge University Press
Figure 0

Table 1. Topic and reference value set

Figure 1

Figure 1. A value-driven LLM framework to embed social good values into LLMs.

Figure 2

Figure 2. Value difference across topics before and after realignment (LLM: Llama 3.2 [3B]; Reference Value Set: Ten Commandments).

Figure 3

Figure 3. Value difference across topics before and after realignment (LLM: Gemma 2 [2B]; Reference Value Set: Ten Commandments).

Figure 4

Figure 4. Value difference across topics before and after realignment (LLM: Llama 3.2 [3B]; Reference Value Set: GDPR).

Figure 5

Figure 5. Value difference across topics before and after realignment (LLM: Gemma 2 [2B]; Reference Value Set: GDPR).

Figure 6

Figure 6. Value difference in the value-embedding space before and after realignment (LLM: Llama 3.2 [3B]; Reference Value Set: Ten Commandments).

Figure 7

Figure 7. Value difference in the value-embedding space before and after realignment (LLM: Gemma 2 [2B]; Reference Value Set: Ten Commandments).

Figure 8

Figure 8. Value difference in the value-embedding space before and after realignment (LLM: Llama 3.2 [3B]; Reference Value Set: GDPR).

Figure 9

Figure 9. Value difference in the value-embedding space before and after realignment (LLM: Gemma 2 [2B]; Reference Value Set: GDPR).

Supplementary material: File

Li et al. supplementary material

Li et al. supplementary material
Download Li et al. supplementary material(File)
File 519.5 KB
Submit a response

Comments

No Comments have been published for this article.