Books & Reading List — AI Transparency Hub

Affiliate disclosure: Book links marked with [Amazon] go to Amazon.co.uk or Amazon.com via an affiliate link. We may earn a small commission if you purchase, at no extra cost to you. Academic paper links go directly to free open-access versions (arXiv, institutional repositories, or publisher pages where open access is available).

Books

🤖

AI Ethics: A Textbook

P. Boddington · Springer · 2023

A comprehensive academic introduction to AI ethics, covering foundational moral frameworks, fairness, bias, accountability, and the governance of AI systems. Written for students and practitioners, it bridges philosophical theory with the practical challenges of building and deploying AI responsibly.

Buy on Amazon → AI Ethics Governance Textbook

⚖️

The Ethical Algorithm: The Science of Socially Aware Algorithm Design

M. Kearns & A. Roth · Oxford University Press USA · 2020

Two leading computer scientists argue that algorithmic harms are not inevitable — they are engineering choices. The book introduces technical methods for building fairness, privacy, and social responsibility directly into algorithm design, making it essential reading for practitioners who want to move beyond "do no harm" rhetoric to concrete solutions.

Buy on Amazon → Algorithmic Fairness Privacy AI Ethics

🌍

Atlas of AI: Power, Politics, and the Planetary Costs of Artificial Intelligence

K. Crawford · Yale University Press · 2022

A critical examination of the hidden costs of AI — from the mines that supply its hardware and the data-centre workers who label its training data, to the ways AI systems concentrate power and reshape labour, politics, and the environment. An important counterweight to purely technical accounts of AI progress.

Buy on Amazon → Critical AI Politics Environment

🧠

Artificial Intelligence: A Modern Approach (4th ed.)

S. Russell & P. Norvig · Pearson · 2020

The definitive AI textbook, used in universities worldwide for over two decades. The fourth edition covers the full breadth of AI — from search, logic, and planning through machine learning, deep learning, and natural language processing — with new material on probabilistic programming, privacy, fairness, and the societal impacts of AI. The essential reference for anyone wanting a rigorous technical foundation.

Buy on Amazon → Textbook Foundational Machine Learning

Foundational Academic Papers

These are some of the most cited and influential papers on large language models, AI safety, and responsible AI. All links go to free open-access versions.

Attention Is All You Need

Vaswani et al. · Google Brain / Google Research · NeurIPS 2017

The paper that introduced the Transformer architecture — the foundation of every major LLM in use today, including GPT-4, Claude, Gemini, and Llama. Essential background reading for understanding why modern AI works the way it does.

Read on arXiv → Transformers Architecture Foundational

Language Models are Few-Shot Learners (GPT-3)

Brown et al. · OpenAI · NeurIPS 2020

The paper introducing GPT-3, which demonstrated that a large enough language model could perform tasks it was never explicitly trained on — the breakthrough that sparked the current era of generative AI. Key reading for understanding in-context learning and prompt engineering.

Read on arXiv → GPT-3 Few-Shot Learning OpenAI

Model Cards for Model Reporting

Mitchell et al. · Google · FAccT 2019

The paper that defined the model card standard — a structured format for documenting an AI model's intended uses, performance across subgroups, and limitations. This is the framework that underpins the documentation covered throughout this site.

Read on arXiv → Model Cards Transparency Responsible AI

Constitutional AI: Harmlessness from AI Feedback

Bai et al. · Anthropic · 2022

Anthropic's paper describing Constitutional AI (CAI) — the training methodology behind Claude. Instead of relying solely on human feedback, CAI uses a set of principles ("a constitution") to guide the model toward helpful and harmless behaviour. Directly relevant to understanding Claude's design.

Read on arXiv → Claude RLHF AI Safety

Sparks of Artificial General Intelligence: Early Experiments with GPT-4

Bubeck et al. · Microsoft Research · 2023

An influential early evaluation of GPT-4 that sparked significant debate about whether large language models exhibit early signs of general reasoning. Whether or not you agree with the conclusions, the paper is essential context for the current public discourse on AGI.

Read on arXiv → GPT-4 AGI Evaluation

Risks from Learned Optimization in Advanced Machine Learning Systems

Hubinger et al. · MIRI / OpenAI · 2019

Introduces the concept of "mesa-optimization" and inner misalignment — the idea that a model trained to achieve a goal might develop an internal sub-agent with subtly different objectives. Foundational reading for AI safety and long-term risk discussions.

Read on arXiv → AI Safety Alignment Risk

On the Dangers of Stochastic Parrots: Can Language Models Be Too Big?

Bender, Gebru et al. · FAccT 2021

A critical perspective arguing that very large language models carry significant risks — including environmental cost, bias amplification, and the illusion of meaning. One of the most cited critical AI papers; important for a balanced view of LLM development.

Read paper → Critical AI Bias Ethics

Scaling Laws for Neural Language Models

Kaplan et al. · OpenAI · 2020

Establishes the empirical "scaling laws" showing that model performance improves predictably with more parameters, more data, and more compute — the theoretical foundation for why AI labs keep building bigger models. Essential background for understanding the race to scale.

Read on arXiv → Scaling Compute OpenAI

EU AI Act — Key Texts

Primary legal sources and official guidance documents.

Regulation (EU) 2024/1689 — The EU AI Act (full text)

European Parliament and Council of the European Union · Official Journal · 12 July 2024

The full legislative text of the EU AI Act, in force since 1 August 2024. The definitive primary source for all obligations covered on this site.

Read on EUR-Lex → Primary Law EU AI Act

GPAI Code of Practice (Draft)

EU AI Office · 2025 (ongoing consultation)

The voluntary code of practice for providers of general-purpose AI models under Article 56 of the EU AI Act. Practical guidance on technical documentation, copyright transparency, and systemic risk obligations.

EU AI Office → GPAI EU AI Act Compliance

AI Agents Under EU Law

Nannini, Smith, Maggini et al. · arXiv · April 2026

A timely analysis of how AI agents — systems that autonomously plan, invoke external tools, and execute multi-step action chains with reduced human involvement — sit within the EU AI Act framework alongside GDPR and related directives. Covers deployments ranging from customer service and recruitment to clinical decision support, and examines how the Act's risk classification, transparency, and human oversight obligations apply to agentic systems.

Read on arXiv → AI Agents EU AI Act GDPR Compliance

Have a book or paper to suggest?

We are building this list and welcome recommendations — particularly practitioner guides, legal commentary on the EU AI Act, and recent empirical LLM research.

Suggest a title

Essential AI Reading

Books

Foundational Academic Papers

Attention Is All You Need

Language Models are Few-Shot Learners (GPT-3)

Model Cards for Model Reporting

Constitutional AI: Harmlessness from AI Feedback

Sparks of Artificial General Intelligence: Early Experiments with GPT-4

Risks from Learned Optimization in Advanced Machine Learning Systems

On the Dangers of Stochastic Parrots: Can Language Models Be Too Big?

Scaling Laws for Neural Language Models

EU AI Act — Key Texts

Regulation (EU) 2024/1689 — The EU AI Act (full text)

GPAI Code of Practice (Draft)

AI Agents Under EU Law