Polars can significantly accelerate Parquet file reads. In this post, we demonstrate how to leverage Polars query optimizations to enhance the efficiency of reading from a Parquet file. Want to...
Polars, Altair and Vegafusion
Altair has been my favourite visualisation library for a long time. It allows me to make attractive visualisations with an API that is concise and consistent. The good news is that Altair is now th...
Cheatsheet for Pandas to Polars
Getting started with Polars? This post shows you how to convert some familar Pandas commands to Polars. But it also tries to go beyond that to introduce you to some of the more fundamental differen...
Fast python package installs with uv in Docker
There is a new, faster alternative to pip installs called uv from the team at Astral behind ruff. It is a drop-in replacement for pip that is designed to be faster and more reliable. I’ve been usin...
Fitting linear models within Polars
Linear models are a staple of data science and machine learning. They are simple to understand and quick to fit. With this great new Polars plugin you can now fit linear model - including Lasso and...
Reading from S3 with Polars (or DeltaLake) using AWS SSO
Polars can read and write files from S3. However, to do this Polars needs to authenticate into your AWS account. While there is a crude solution where we copy our AWS access key and secret key from...
What is a Polars expression?
I got a good question recently from a new Polars user: What is the difference between a Series and an expression in Polars? Well, a Series is a 1D data structure. An expression is a function that ...
Pandas to Polars: what to know for time series analysis
There are differences between some important time series concepts in Pandas and Polars that you should know. In this post, to help you make the Pandas to Polars switch I talk through some of these ...
Which operations work in streaming mode in Polars?
In a recent post I set out the key ideas behind streaming mode in Polars. However, streaming is still a developing part of Polars and not all operations support streaming. Be aware: the Polars dev...
Streaming large datasets in Polars
One major advantage of Polars over Pandas is that working with larger-than-memory datasets can be as easy as adding a single argument to a function call. However, streaming doesn’t work in all case...