The Python Exchange:

Helping Python Thrive within
the National Labs & Department of Energy

Join the Python Exchange Mailing List!

The next Python Exchange is coming up!

WEDNESDAY
APRIL
30
NOTE SPECIAL TIME
4:00 pm ET

Add event to Google Calendar

Download ICS file

Our Guest Panelist will be:

Photo of Gaël Varoquaux

Guillaume Lemaitre

“Developments in the scikit-learn Ecosystem: Going Beyond model.fit(X, y).predict(X)

Scikit-learn is one of the de facto libraries when it comes to predictive modeling with tabular data. For over a decade, it has provided traditional and reliable algorithms to address data science problems. While it excels at model fitting and prediction, these stages represent only a small portion of a data science project and are relatively well-defined. Many data scientists are familiar with the notion that 90% of their time is spent on preprocessing, while the modeling stage takes up only 10% of their efforts. Additionally, tracking and organizing experiments, as well as transitioning from experimentation to production, can be challenging. This talk aims to shed light on recent developments and efforts within the scikit-learn ecosystem. We will provide an overview of the following tools through a series of short notebook demos.

  • scikit-learn to provide some context.
  • skrub, which offers tools to prepare your tabular data and aims to bridge the gap between the database world and the scikit-learn modeling environment.
  • skops, which provides tools for transitioning from experimentation to production settings.
  • skore, which aims to provide guidance throughout the lifecycle of a data science project by abstracting the above tools, making them more general and less opinionated.

About Our Guest

Guillaume Lemaitre is Chief Machine Learning Officer and an open-source software engineer at Probabl. He is a core maintainer of several packages from the scikit-learn ecosystem such as scikit-learn, skrub, skore, and imbalanced-learn. He holds a PhD in computer science / medical imaging.

Zoom Webinar Link:

Past Events

To view all past recordings, visit

The Python Exchange YouTube Playlist

March 25th, 2025

“Narwhals: Lightweight and extensible compatibility layer between dataframe libraries!”

Marco Gorelli

February 26th, 2025

“skrub for tables: Less wrangling, more machine learning”

Gaël Varoquaux

February 26th, 2025

“skrub for tables: Less wrangling, more machine learning”

Gaël Varoquaux

January 25th, 2025

“Accelerate your Python code with Rust”

Juan Luis Cano Rodríguez

December 4th, 2024

“Possible futures for Jupyter”

Sylvain Corlay

October 30th, 2024

“Recent Tools and Trends in the Python Ecosystem”

Wes McKinney