Ethics in Data Science

Social and moral considerations for responsible data scientists

Author

Jack Bandy

Published

June 12, 2026

1 Preface

← Back to Course Homepage

This is a placeholder page. The book is a work-in-progress.

The full data science lifecycle.

Figure 1.1: A rendition of the data science lifecycle, slightly modified from Learning Data Science.

How can data be collected responsibly? Which data are considered private or sensitive? What makes some questions inappropriate? When and how can data be published? How does one responsibly handle the persuasive power of data?

The purpose of this mini-book is to introduce data scientists to ethical considerations that arise throughout the data science lifecycle. There are many versions of the lifecycle, and this book is organized around the one described in Learning Data Science, which divides data work into four broad stages (Lau et al. 2023).

These broad stages correspond to the chapters in this book as follows:

Learning Data Science Corresponding material in this book
Introduction (this chapter)
Working Toward Wisdom
1.1. The Stages of the Lifecycle: Ask a Question Ethics in Asking Questions
1.1. The Stages of the Lifecycle: Obtain Data Ethics in Obtaining Data
1.1. The Stages of the Lifecycle: Understand the Data Ethics in Understanding
1.1. The Stages of the Lifecycle: Understand the World Ethics in Understanding
Reports, decisions, solutions Ethics in Reporting Decisions & Solutions
Concluding remarks

This is not the only book to discuss ethics in data science. Many of the topics discussed in the book have been described elsewhere, in books and articles and essays which often serve as the source material for this book, including:

What is intended to be unique and useful about this book is its alignment with the “data science lifecycle” from Learning Data Science (Lau et al. 2023). The lifecycle model offers helpful way to organize the wide array of ethical questions, dilemmas, and decisions that arise in the work of the data scientist.

Before exploring the details of these questions in Ethics in Asking Questions, Ethics in Obtaining Data, Ethics in Understanding, and Ethics in Reporting Decisions & Solutions, I have taken the liberty to include one framing chapter, Working Toward Wisdom, which zooms out and asks a somewhat audacious question: why are we doing any of this at all?

1.1 References

Barocas, Solon, and Andrew D. Selbst. 2016. “Big Data’s Disparate Impact.” California Law Review 104 (3): 671–732. https://lawcat.berkeley.edu/record/1127463.
Baumer, Benjamin S., Daniel T. Kaplan, and Nicholas J. Horton. 2021. “Chapter 8 Data Science Ethics.” https://mdsr-book.github.io/mdsr2e/ch-ethics.html.
Cairo, Alberto. 2019. How Charts Lie: Getting Smarter about Visual Information. W. W. Norton & Company. https://bookshop.org/p/books/how-charts-lie-getting-smarter-about-visual-information-alberto-cairo/6952598.
D’Ignazio, Catherine, and Lauren F. Klein. 2020. Data Feminism. MIT Press. https://doi.org/10.7551/mitpress/11805.001.0001.
Floridi, Luciano, and Mariarosaria Taddeo. 2016. “What Is Data Ethics?” Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences 374 (2083): 20160360. https://doi.org/10.1098/rsta.2016.0360.
Klein, Lauren, and Catherine D’Ignazio. 2025. “Data Feminism for Data Visualization.” October. https://openvisualizationacademy.org/courses/data-feminism-for-data-visualization/.
Lau, Sam, Joseph Gonzalez, and Deborah Nolan. 2023. Learning Data Science: Data Wrangling, Exploration, Visualization, and Modeling with Python. O’Reilly Media. https://learningds.org/ch/01/lifecycle_intro.html.
O’Neil, Cathy. 2017. Weapons of Math Destruction: How Big Data Increases Inequality and Threatens Democracy. Broadway Books. https://www.penguinrandomhouse.com/books/241363/weapons-of-math-destruction-by-cathy-oneil/.
Thomas, Rachel. 2020. “Practical Data Ethics.” https://ethics.fast.ai/.