Effective Pandas: Patterns for Data Manipulation (Treading on Python)

Effective Pandas: Patterns for Data Manipulation (Treading on Python)

  • Downloads:8504
  • Type:Epub+TxT+PDF+Mobi
  • Create Date:2022-10-27 06:56:45
  • Update Date:2025-09-06
  • Status:finish
  • Author:Matt Harrison
  • ISBN:B09MYXXSFM
  • Environment:PC/Android/iPhone/iPad/Kindle

Download

Reviews

Ethan J

I love pandas, but would hope to have seen more details, this book feels like a print of the official documentation。。。but what differentiates it from documentation is it provides more structure? :/ meh

Patrick

Good if using pandas

Robert

Effective Pandas is an excellent opinionated guide to pandas。 “Opinionated” is important in the context of pandas: it’s a very flexible library that gives you rope to hang yourself, often contradicting the zen of Python: “There should be one — and preferably only one — obvious way to do it”。 This can lead to some very messy code, in which the time-pressed data scientist ends up melding several different programming philosophies just to get their aggregation to work。 No doubt some of the people r Effective Pandas is an excellent opinionated guide to pandas。 “Opinionated” is important in the context of pandas: it’s a very flexible library that gives you rope to hang yourself, often contradicting the zen of Python: “There should be one — and preferably only one — obvious way to do it”。 This can lead to some very messy code, in which the time-pressed data scientist ends up melding several different programming philosophies just to get their aggregation to work。 No doubt some of the people reading this will consider “effective pandas” to be an oxymoron。 This is justified。 Pandas seldom feels like it was “designed” in the way that R’s tidyverse is。 It is a collection of tricks, wherein fluency only arises from many hours spent pulling one’s hair out。 Effective Pandas is Harrison’s effort to define and encourage “idiomatic pandas”, using chaining。 It just so happens that chaining is the style of pandas that I had converged on due to its readability and elegance。 Seeing a nice piece of chained pandas should mellow the complaints of tidyverse folks。One area that wasn’t properly addressed (which is why I give this 4 stars instead of 5) is memory usage and performance。 Some of this is quite important, as there are some methods in pandas that create copies of objects but others that modify objects in place。 This aside, Effective Pandas is a useful and readable outline of an important tool; it has the flavour of a user guide rather than a documentation reference。 I skimmed through several chapters because I’m reasonably familiar with pandas, but I’d recommend the book to anyone who uses pandas a lot。My highlights here。 。。。more

Joao Pedro

This book is the best book on pandas out there。 It is the first work on pandas that seriously leverages the power of method chaining for running clean and efficient pandas code。 There are no in-place commands, no copies of data frames (wonderful!)。 It is also a legit hands-on material, and even the session on debugging chains that starts with commenting commands before applying more robust debugging methods mimics what data scientists are really doing in the wild。This is the future of pandas and This book is the best book on pandas out there。 It is the first work on pandas that seriously leverages the power of method chaining for running clean and efficient pandas code。 There are no in-place commands, no copies of data frames (wonderful!)。 It is also a legit hands-on material, and even the session on debugging chains that starts with commenting commands before applying more robust debugging methods mimics what data scientists are really doing in the wild。This is the future of pandas and this type of material paves the way for lazy evaluation。 。。。more

Ricardo

Best Pandas book so far。

Walter

Pandas is one of those libraries that suffers from the "guitar principle" (also known as the "Bushnell Principle" in the video game circles): it is easy to use, but difficult to master。Truly, it is one of the most straightforward and powerful data manipulation libraries, yet, because it is so easy to use, no one really spends much time trying to understand the best, most pythonic way to employ the library to its full extent。 If you haven't read Matt Harrison's book and use Pandas, chances are yo Pandas is one of those libraries that suffers from the "guitar principle" (also known as the "Bushnell Principle" in the video game circles): it is easy to use, but difficult to master。Truly, it is one of the most straightforward and powerful data manipulation libraries, yet, because it is so easy to use, no one really spends much time trying to understand the best, most pythonic way to employ the library to its full extent。 If you haven't read Matt Harrison's book and use Pandas, chances are you're like that Chad at the picnic or camping trip that pulls out his guitar to strum along the same basic chords for an hour straight。。。 Well, NO MORE! Matt Harrison is ready to drop some knowledge on you and have you riffing your own data manipulation solos like you're Slash in "November Rain", or Prince in "Purple Rain"。。。The book goes beyond explaining the data structures and methods that underpin Pandas, but he also provides a ton of practical advice regarding best practices in data manipulation and transformations。For instance, by the time you're done you'll know which functions to use to leverage Pandas' vectorized structures to ensure your code is fast and efficient, which data types provide huge savings in terms of memory allocation, how to chain operations to ensure you're always accessing the correct intermediary dataframe, how to utilize indices to give you superpowers over your data, how to debug chains, merge, join, melt, style, and more。It is by far, the best book you can get yourself if you want to take your data science skills to the next level, after all, they say modern data science is 90% data cleaning。 I mostly agree。I have recommended this book to every member of my team。 REQUIRED READING。Highest possible recommendation。 。。。more

Andrey Andreev

Without no doubt that is masterpiece。