Python Polars: The Definitive Guide

Regular price €76.99
A01=Jeroen Janssens
A01=Thijs Nieuwdorp
apache arrow
Author_Jeroen Janssens
Author_Thijs Nieuwdorp
big data
Category=UMX
data frame
data processing
eq_bestseller
eq_computing
eq_isMigrated=1
eq_isMigrated=2
eq_nobargain
eq_non-fiction
ETL
pandas
parquet
python
query engine
spark
SQL

Product details

  • ISBN 9781098156084
  • Dimensions: 178 x 233mm
  • Publication Date: 28 Feb 2025
  • Publisher: O'Reilly Media
  • Publication City/Country: US
  • Product Form: Paperback
Delivery/Collection within 10-20 working days

Our Delivery Time Frames Explained
2-4 Working Days: Available in-stock

10-20 Working Days: On Backorder

Will Deliver When Available: On Pre-Order or Reprinting

We ship your order once all items have arrived at our warehouse and are processed. Need those 2-4 day shipping items sooner? Just place a separate order for them!

Want to speed up your data analysis and work with larger-than-memory datasets? Python Polars offers a blazingly fast, multithreaded, and elegant API for data loading, manipulation, and processing. With this hands-on guide, you'll walk through every aspect of Polars and learn how to tackle practical use cases using real-world datasets.

Jeroen Janssens and Thijs Nieuwdorp from Xomnia in Amsterdam show you how this superfast DataFrame library is perfect for efficient data wrangling, ETL pipelines, and so much more. This book helps you quickly learn the syntax and understand Polars' underlying concepts. You don't need to have experience with pandas or Spark, but if you do, this book will help you make a smooth transition.

With this definitive guide at your side, you'll be able to:

  • Process larger-than-memory datasets at record speed
  • Apply the eager, lazy, and streaming APIs of Polars and decide when to use them
  • Transition smoothly from pandas or Spark to Polars
  • Integrate Polars into your existing code base
  • Work with Arrow and Parquet to efficiently read and write data
  • Translate complex ETL tasks into efficient and elegant queries
Jeroen Janssens is a Senior Machine Learning Engineer at Xomnia in Amsterdam, where he uses Polars on a daily basis. He enjoys wrangling data, implementing machine learning models, and building solutions using Python, R, JavaScript, and Bash. Previously, he ran Data Science Workshops, a training and coaching firm. Jeroen is the author of Data Science at the Command Line (O'Reilly, 2021). He has been an assistant professor at Jheronimus Academy of Data Science and a data scientist at various startups in New York City. Jeroen holds a PhD in machine learning from Tilburg University and an MSc in artificial intelligence from Maastricht University. He lives with his wife and two kids in Rotterdam, the Netherlands.