Data Analysis with Open Source Tools by Philipp K. Janert

No critic rating

Waiting for minimum critic reviews

See 1 Critic Review



Collecting data is relatively easy, but turning raw information into something useful requires that you know how to extract precisely what you need. With this insightful book, intermediate to experienced programmers interested in data analysis will learn techniques for working with data in a business environment. You'll learn how to look at data to discover what it contains, how to capture those ideas in conceptual models, and then feed your understanding back into the organization through business plans, metrics dashboards, and other applications.

Along the way, you'll experiment with concepts through hands-on workshops at the end of each chapter. Above all, you'll learn how to think about the results you want to achieve -- rather than rely on tools to think for you.

Use graphics to describe data with one, two, or dozens of variablesDevelop conceptual models using back-of-the-envelope calculations, as well asscaling and probability argumentsMine data with computationally intensive methods such as simulation and clusteringMake your conclusions understandable through reports, dashboards, and other metrics programsUnderstand financial calculations, including the time-value of moneyUse dimensionality reduction techniques or predictive analytics to conquer challenging data analysis situationsBecome familiar with different open source programming environments for data analysis

"Finally, a concise reference for understanding how to conquer piles of data."--Austin King, Senior Web Developer, Mozilla

"An indispensable text for aspiring data scientists."--Michael E. Driscoll, CEO/Founder, Dataspora


About Philipp K. Janert

See more books from this Author
After previous careers in physics and software development, Philipp K. Janert currently provides consulting services for data analysis, algorithm development, and mathematical modeling. He has worked for small start-ups and in large corporate environments, both in the U.S. and overseas. He prefers simple solutions that work to complicated ones that don't, and thinks that purpose is more important than process. Philipp is the author of "Gnuplot in Action - Understanding Data with Graphs" (Manning Publications), and has written for the O'Reilly Network, IBM developerWorks, and IEEE Software. He is named inventor on a handful of patents, and is an occasional contributor to CPAN. He holds a Ph.D. in theoretical physics from the University of Washington. Visit his company website at
Published November 11, 2010 by O'Reilly Media. 540 pages
Genres: Computers & Technology, Education & Reference. Non-fiction

Unrated Critic Reviews for Data Analysis with Open Source Tools

BC Books

See more reviews from this publication

The book has its limits: it is not meant to be a book on analysis of scientific data, formal statistical analysis, network analysis, text mining, or Big Data.

Dec 09 2010 | Read Full Review of Data Analysis with Open Sourc...

Reader Rating for Data Analysis with Open Source Tools

An aggregated and normalized score based on 37 user ratings from iDreamBooks & iTunes

Rate this book!

Add Review