Python for Data Analysis 3e: Data Wrangling with Pandas, Numpy, and Jupyter
H**.
Excellent
This book is very thorough and written very clearly. I bought it to assist my learning about DS/ML/AI. As Wes says in the book, the vast majority of the work in this area is data "wrangling", or getting the data into a form suitable for analysis. This book is very comprehensive and covers every aspect of the process.It starts with some introductions to Python, IPython and Jupyter. It doesn't really assume any Python knowledge, but I knew and worked with Python already and would describe it as a "whistle-stop tour". IPython is a more interactive (and dare I say, better) Python shell/interpreter which is great for exploring / testing stuff out. Jupyter is simply a web-based tool which can use IPython and has a slightly different workflow based on "cells" which can be executed.After that, there is a brief section on NumPy, and it highlights the key ideas: vectorised operations with multi-dimensional arrays. That is, to create the result of summing the elements of two arrays, a and b, you can simply write a + b. This makes for quick and convenient batch operations on data with no for loops in sight.These arrays offer a compact and highly efficient way to work with data, and they the "lingua franca" of the Python data world.The bulk of what follows is on pandas, which makes working with numpy even easier and is a heavyweight tool for loading data from a variety of formats (and storing it), cleaning it, processing it, aggregating, joining, visualising (with help from matplotlib) - everything you could want really. pandas is amazing software, and like most great software projects, it has the right abstractions (like the DataFrame), integrates seamlessly with a lot of other data libraries, and starts to feels intuitive after a while.There is also a lot of information on working with time data, and a brief chapter on building models.My favourite part was probably the end-to-end data analysis examples where Wes explores datasets from movies to baby names. Seeing the whole process there was great.The appendices are well worth reading, if not skimming, since there are more NumPy and IPython tricks.Could this book be improved in any way? Yes, I think a lot of reads a bit like a reference. Some more "real-world" examples would have been better in a lot of cases - but you do see those in the data analysis chapter. Also, the reason for using simpler examples was probably to demonstrate the point more clearly and independent of other chapters.Overall though, this gets a strong recommendation from me if you want to get more into data and machine learning.Thanks to Wes McKinney, we not only have pandas, but this book to navigate it.
A**L
Brilliant book
A must have for anyone who really wants to master Pandas.
A**R
Good book
Good gift useful
T**S
Good guide !
Handy book for introduction on how to data preparation with python. Good guide for some on journey to become a data scientists.
V**D
Excellent
I have been learning Python and it's modules for few months and I bought three books.This is the best one by far.It takes you through Python itself, Jupyter, Numpy, Pandas and some other very important modules. Nicely explained, lots of examples and you can download the code, too.Highly recommended!
A**R
Damaged on arrival
Damaged product
Trustpilot
2 days ago
2 weeks ago