About“It always seems impossible until it is done.” - Nelson Mandela

Brooke Wenig is a Machine Learning Practice Lead at Databricks. She leads a team of data scientists who develop large-scale machine learning pipelines for customers, as well as teach courses on distributed machine learning best practices. She is a co-author of Learning Spark, 2nd Edition, co-instructor of the Distributed Computing with Spark SQL Coursera course, and co-host of the Data Brew podcast. She received an MS in Computer Science from UCLA with a focus on distributed machine learning and a BA in Chinese. She enjoys cycling and taking her dog, Bingo, to the beach.


2020-Present: Co-host of Data Brew podcast

Data + AI Summit 2021: Pandas API on Spark Keynote Demo

Data + AI Summit 2021: YOLO with Data-Driven Software (with Tim Hunter)

Data + AI Summit 2020: SQL Analytics Keynote Demo

Spark + AI Spark Summit 2020: Spark 3.0 Keynote Demo

Toronto Machine Learning Summit 2020: Managing Machine Learning Experiments with MLflow

Big Things Data + AI Conference 2020: Managing Chaos: Reproducible Machine Learning

Spark + AI Summit 2019 Europe: Koalas (Pandas on Spark) Keynote Demo

Spark + AI Summit 2019 SF: Koalas (Pandas on Spark) Keynote Demo


Fatal Force: Exploring Police Shootings With SQL Analytics (with Chengyin Eng)

How We Launched a Podcast: Lessons, (Minor) Mishaps & Key Takeaways (with Denny Lee)