The remainder of this page is auto-generated from all of the references across the slides for each week: click on the name of a reference to download the ebook version, if available!
Barber, David. 2012.
Bayesian Reasoning and Machine Learning. Cambridge University Press.
Damji, Jules S., Brooke Wenig, Tathagata Das, and Denny Lee. 2020.
Learning Spark. O’Reilly Media, Inc.
Eagar, Gareth. 2021. Data Engineering with AWS: Learn How to Design and Build Cloud-Based Data Transformation Pipelines Using AWS. 1st ed. Birmingham: Packt Publishing Limited.
Firth, John Rupert. 1957. Papers in Linguistics, 1934-1951. Oxford University Press.
Harenslak, Bas P., and Julian de Ruiter. 2021. Data Pipelines with Apache Airflow. Simon and Schuster.
Leskovec, Jure, Anand Rajaraman, and Jeffrey David Ullman. 2014.
Mining of Massive Datasets. Cambridge University Press.
Loukides, Mike. 2010.
“What Is Data Science?” O’Reilly Media.
Mell, Peter, and Timothy Grance. 2011.
“The NIST Definition of Cloud Computing.” National Institute of Standards and Technology, Special Publication 800 (2011): 145.
Needham, Mark, Michael Hunger, and Michael Simons. 2024.
DuckDB in Action.
Simon and Schuster.
Raasveldt, Mark, and Hannes Mühleisen. 2019.
“DuckDB: An Embeddable Analytical Database.” In
Proceedings of the 2019 International Conference on Management of Data, 1981–84.
SIGMOD ’19. New York, NY, USA: Association for Computing Machinery.
Raff, Edward, Drew Farris, and Stella Biderman. n.d. “How Large Language Models Work.”
Ruiter, Julian de, Ismael Cabral, Kris Geusebroek, Daniel van der Ende, and Bas Harenslak. 2026.
Data Pipelines with Apache Airflow, Second Edition.
Simon and Schuster.
Saussure, Ferdinand de. 1916. Course in General Linguistics. Open Court.
Topol, Matthew, and Wes McKinney. 2024.
In-Memory Analytics with Apache Arrow. Packt Publishing Ltd.
White, Tom E. 2015.
Hadoop: The Definitive Guide. O’Reilly Media, Inc.