Here are some books which will help you to boost your knowledge of data science and some of its fundamental tools.
Data Science in Practice (Studies in Big Data) 1st ed. 2019 Edition
by Alan Said
This book approaches big data, artificial intelligence, machine learning, and business intelligence through the lens of Data Science. We have grown accustomed to seeing these terms mentioned time and time again in the mainstream media. However, our understanding of what they actually mean often remains limited. This book provides a general overview of the terms and approaches used broadly in data science and provides detailed information on the underlying theories, models, and application scenarios. Divided into three main parts, it addresses what data science is; how and where it is used; and how it can be implemented using modern open source software. The book offers an essential guide to modern data science for all students, practitioners, developers, and managers seeking a deeper understanding of how various aspects of data science work, and of how they can be employed to gain a competitive advantage.
Data Science (MIT Press Essential Knowledge series) Paperback - 18 May 2018
by John D. Kelleher
A concise introduction to the emerging field of data science, explaining its evolution, relation to machine learning, current uses, data infrastructure issues, and ethical challenges. The goal of data science is to improve decision making through the analysis of data. Today data science determines the ads we see online, the books and movies that are recommended to us online, which emails are filtered into our spam folders, and even how much we pay for health insurance. This volume in the MIT Press Essential Knowledge series offers a concise introduction to the emerging field of data science, explaining its evolution, current uses, data infrastructure issues, and ethical challenges. It has never been easier for organizations to gather, store, and process data. Use of data science is driven by the rise of big data and social media, the development of high-performance computing, and the emergence of such powerful methods for data analysis and modeling as deep learning.
The Data Science Handbook Hardcover - 14 Apr 2017
by Field Cady
A comprehensive overview of data science covering the analytics, programming, and business skills necessary to master the discipline Finding a good data scientist has been likened to hunting for a unicorn: the required combination of technical skills is simply very hard to find in one person. In addition, good data science is not just a rote application of trainable skill sets; it requires the ability to think flexibly about all these areas and understand the connections between them. This book provides a crash course in data science, combining all the necessary skills into a unified discipline. Unlike many analytics books, computer science and software engineering are given extensive coverage since they play such a central role in the daily work of a data scientist.
The book is appropriate for people who want to practice data science but lack the required skill sets. This includes software professionals who need to better understand analytics and statisticians who need to understand software. Modern data science is a unified discipline, and it is presented as such. This book is also an appropriate reference for researchers and entry-level graduate students who need to learn real-world analytics and expand their skill set. FIELD CADY is the data scientist at the Allen Institute for Artificial Intelligence, where he develops tools that use machine learning to mine scientific literature. He has also worked at Google and several Big Data startups. He has a BS in physics and math from Stanford University, and an MS in computer science from Carnegie Mellon.
Practical Statistics for Data Scientists: 50 Essential Concepts Paperback - 2017
by Peter Bruce
Statistical methods are a key part of data science, yet very few data scientists have any formal statistics training. Courses and books on basic statistics rarely cover the topic from a data science perspective. This practical guide explains how to apply various statistical methods to data science, tells you how to avoid their misuse and gives you advice on what's important and what's not.
Many data science resources incorporate statistical methods but lack a deeper statistical perspective. If you are familiar with the R programming language and have some exposure to statistics, this quick reference bridges the gap in an accessible, readable format.
With this book, youíll learn:
- Why exploratory data analysis is a key preliminary step in data science
- How random sampling can reduce bias and yield a higher quality dataset, even with big data
- How the principles of experimental design yield definitive answers to questions
- How to use regression to estimate outcomes and detect anomalies
- Key classification techniques for predicting which categories a record belongs to
- Statistical machine learning methods that ìlearnî from data
- Unsupervised learning methods for extracting meaning from unlabeled data
Mathematical Problems in Data Science: Theoretical and Practical Methods Hardcover - 22 Dec 2015
by Li M. Chen
This book describes current problems in data science and Big Data. Key topics are data classification, Graph Cut, the Laplacian Matrix, Google Page Rank, efficient algorithms, the hardness of problems, different types of big data, geometric data structures, topological data processing, and various learning methods. For unsolved problems such as incomplete data relation and reconstruction, the book includes possible solutions and both statistical and computational methods for data analysis. Initial chapters focus on exploring the properties of incomplete data sets and partial-connectedness among data points or data sets. Discussions also cover the completion problem of Netflix matrix; machine learning method on massive data sets; image segmentation and video search. This book introduces software tools for data science and Big Data such as MapReduce, Hadoop, and Spark.
This book contains three parts. The first part explores the fundamental tools of data science. It includes basic graph theoretical methods, statistical and AI methods for massive data sets. In the second part, chapters focus on the procedural treatment of data science problems including machine learning methods, mathematical image and video processing, topological data analysis, and statistical methods. The final section provides case studies on special topics in variational learning, manifold learning, business, and financial data rec