...
Full Bio
Use Machine Learning To Teach Robots to Navigate by CMU & Facebook Artificial Intelligence Research Team
226 days ago
Top 10 Artificial Intelligence & Data Science Master's Courses for 2020
227 days ago
Is Data Science Dead? Long Live Business Science
255 days ago
New Way to write code is about to Change: Join the Revolution
256 days ago
Google Go Language Future, Programming Language Programmer Will Get Best Paid Jobs
577 days ago
Top 10 Best Countries for Software Engineers to Work & High in-Demand Programming Languages
724788 views
Highest Paying Programming Language, Skills: Here Are The Top Earners
669342 views
Which Programming Languages in Demand & Earn The Highest Salaries?
474462 views
Top 5 Programming Languages Mostly Used By Facebook Programmers To Developed All Product
463254 views
World's Most Popular 5 Hardest Programming Language
394593 views
If You Want To Learn Data Science, Take These Statistics Classes
- It must be an introductory course with little to no statistics or probability experience required.
- It must be on-demand or offered every few months.
- It must be of decent length: at least ten hours in total for estimated completion.
- It must be an interactive online course, so no books or read-only tutorials. Though these are viable ways to learn statistics and probability, this guide focuses on courses.
- The degree to which each course teaches statistics through coding up examples - preferably in R or Python.
- Coverage of the fundamentals of probability and statistics. Covering descriptive statistics, inferential statistics, and probability theory is ideal.
- How much of the syllabus is relevant to data science? Does the syllabus have specialized content like genomics, as several biostatistics courses do? Does the syllabus cover advanced concepts not often used in data science?
For any aspiring data scientist, I would highly recommend learning statistics with a heavy focus on coding up examples, preferably in Python or R.
Probability deals with predicting the likelihood of future events, while statistics involves the analysis of the frequency of past events.
- Foundations of Data Analysis? - Part 1: Statistics Using R by the University of Texas at Austin (edX)
- Foundations of Data Analysis - Part 2: Inferential Statistics by the University of Texas at Austin (edX)
Excellent course! I took part 1 and enjoyed it a lot, so it was very easy to decide to go on with part 2. Dr. Mahometa and team are very good teachers and their material is of a very high quality. The exercises are interesting and the materials (videos, labs and problems) are appropriate and well chosen. I recommend this course to anyone interested in statistical analysis (as an introduction to machine learning, big data, data science, etc.). On a scale from 1 to 10, I give 50!
- Statistics with R Specialization by Duke University on Coursera
One of the greatest courses I've taken so far. [Dr. Mine Cetinkaya-Rundel is] a great teacher, very much involved in exchanges with her students. A large variety of teaching approaches and tools. Lots of practice through short tests, R-programming labs, and an in-depth project. A very lively forum with lots of help to cope with difficulties. The course is not too difficult, but the variety of the proposed material requires that students get involved quite substantially. A very nice book available for free with plenty of practice exercises.
- Introduction to Probability-The Science of Uncertainty by the Massachusetts Institute of Technology (MIT)
Many online courses are watered down in some way, but this one feels like a proper rigorous exercise-driven course similar to what you'd get in-person at a top school like MIT. The professors present concepts in lectures that have obviously been honed to a laser focus through years of pedagogical experience�¢??-�¢??there is not a single wasted second in the presentations and they go exactly at the right pace and detail for you to understand the concepts. The exercises will make you work for your knowledge and are critical for really internalizing the concepts. This is the best online course I have taken in any subject.
- MedStats: Statistics in Medicine (Stanford University/Stanford OpenEdx): Great syllabus where the examples have a medical focus. Covers a bit of R programming at the end, though not as much as UT Austin's series. A worthy option for anyone, even those not targeting medicine. It has a 4.58-star weighted average rating over 32 reviews.
- SOC120x: I "Heart" Stats: Learning to Love Statistics (University of Notre Dame/edX): Targets a non-technical audience, though likely would be good for anyone. No coding. Good production value. Course and instructors look really fun. It has a 4.54-star weighted average rating over 12 reviews.
- QM101x: Statistics for Business (Indian Institute of Management Bangalore/edX): Part of a 4-course series. Business focus. Good syllabus that uses coding. The last two courses in the series are unreleased as of November 2016 so can't make a judgment yet. It has a 4.43-star weighted average rating over 27 reviews.
- Workshop in Probability and Statistics (Udemy): Taught by Dr. George Ingersoll, Associate Dean of Executive MBA Programs at the UCLA Anderson School of Management. Costs money. Uses Excel. It has a 4.4-star weighted average rating over 452 reviews.
- Intro to Descriptive Statistics (San Jose State University/Udacity): Part of a 2-course series. Bite-sized videos. No coding. It has a 3.88-star weighted average rating over 8 reviews.
- Intro to Inferential Statistics (San Jose State University/Udacity): Part of a 2-course series. I took both courses as refreshers for my undergrad statistics classes and came away with a deeper understanding. Really enjoyed Katie Kormanik's teaching style (see video below). Bite-sized videos. No coding. It has a 4.4-star weighted average rating over 5 reviews.
- 6.008.1x: Computational Probability and Inference (Massachusetts Institute of Technology/edX): One of two courses/series to teach statistics with a focus of coding up examples in Python. Reviews suggest prior stats experience is needed and that the course is a bit unorganized. It has a 4-star weighted average rating over 12 reviews.
- Basic Statistics (University of Amsterdam/Coursera): One of two statistics courses in the University of Amsterdam's Methods and Statistics in Social Sciences Specialization. One exceedingly positive review on the series and its instructors. No coding. It has a 4.06-star weighted average rating over 8 reviews.
- Inferential Statistics (University of Amsterdam/Coursera): One of two statistics courses in the University of Amsterdam's Methods and Statistics in Social Sciences Specialization. One exceedingly positive review on the series and its instructors. No coding. It has a 4-star weighted average rating over 3 reviews.
- PH525.1x: Statistics and R (Harvard University/edX): Part of a 7-course series on edX. Life sciences focus. Uses R programming, but the reviews suggest UT Austin's series is better. It has a 3.96-star weighted average rating over 26 reviews.
- PH525.3x: Statistical Inference and Modeling for High-throughput Experiments (Harvard University/edX): Part of a 7-course series on edX. Life sciences focus. Uses R programming, but the reviews suggest UT Austin's series is better. It has a 4.63-star weighted average rating over 4 reviews.
- Intro to Statistics (Udacity): This is one of Udacity's earliest courses and it has its shortcomings, as described in this memorable review by a college educator. No coding. It has a 3.93-star weighted average rating over 41 reviews.
- Mathematical Biostatistics Boot Camp 1 (Johns Hopkins University/Coursera): Part of a 2-course series. Biostatistics focus. It has a 3.13-star weighted average rating over 23 reviews.
- Mathematical Biostatistics Boot Camp 2 (Johns Hopkins University/Coursera): Part of a 2-course series. Biostatistics focus. It has a 3.83-star weighted average rating over 3 reviews.
- KIexploRx: Explore Statistics with R (Karolinska Institutet/edX): More of a data exploration course than a statistics course. Uses coding. It has a 3.77-star weighted average rating over 22 reviews.
- Statistical Inference (Johns Hopkins University/Coursera): One of two statistics courses in JHU's data science specialization. Bad reviews. It has a 2.9-star weighted average rating over 29 reviews.
- Regression Models (Johns Hopkins University/Coursera): One of two statistics courses in JHU's data science specialization. Bad reviews. It has a 2.73-star weighted average rating over 30 reviews.
- DS101X: Statistical Thinking for Data Science and Analytics(Columbia University/edX): Part of the Microsoft Professional Program Certificate in Data Science. Short syllabus. Bad reviews. It has a 2.77-star weighted average rating over 24 reviews.
- Understanding Clinical Research: Behind the Statistics (University of Cape Town/Coursera): "This isn't a comprehensive statistics course, but it offers a practical orientation to the field of medical research and commonly used statistical analysis." Health care focus. It has a 5-star weighted average rating over 15 reviews.
- MED101x: Introduction to Applied Biostatistics: Statistics for Medical Research (Osaka University/edX): Biostatistics focus. Uses coding. It has a 4.5-star weighted average rating over 3 reviews.
- Probability and Statistics (Stanford University/Stanford OpenEdx): Curriculum looks great. The one review is really positive. No coding. It has a 4.5-star weighted average rating over 1 review.
- Inferential and Predictive Statistics for Business (University of Illinois at Urbana-Champaign/Coursera): Part of a 7-course Managerial Economics and Business Analysis Specialization. Uses Excel. It has a 5-star weighted average rating over 1 review.
- Exploring and Producing Data for Business Decision Making (University of Illinois at Urbana-Champaign/Coursera): Part of a 7-course Managerial Economics and Business Analysis Specialization. Uses Excel. It has a 5-star weighted average rating over 1 review.
- Introduction to Probability, Statistics, and Random Processes (University of Massachusetts Amherst/Independent): Videos not available for the whole course. It has a 2.5-star weighted average rating over 2 reviews.
- 005x: Introduction to Statistical Methods for Gene Mapping (Kyoto University/edX): Genetics focus. Need prior statistics and R knowledge. It has a 2.5-star weighted average rating over 1 review.
- Statistics for Genomic Data Science (Johns Hopkins University/Coursera): Genomic focus. Not a good introductory course: "A fair class for someone with an interest in this field who also happens to have a decent background in R programming." It has a 2-star weighted average rating over 2 reviews.
- Statistical Thinking in Python (Part 1) and Statistical Thinking in Python (Part 2) (DataCamp): Uses coding and Python specifically, making it one of few worthy courses or series that use that language. Seven hours of video and 120+ exercises. DataCamp is a popular option.
- A Hands-on Introduction to Statistics with R (DataCamp): Uses coding. 26 hours of video and 150+ exercises. Again, DataCamp is a popular option.
- Statistical Computing with R�¢??-�¢??a gentle introduction (University College London/Independent): Uses coding.
- Probability & Statistics (Carnegie Mellon): Uses R. Primarily text-based instruction. Designed to be equivalent to one semester of a college statistics course.
- Introduction to Probability and Statistics (Massachusetts Institute of Technology/MIT OCW): Traditional lecture format (video-taped).
- Fundamentals of Engineering Statistical Analysis (The University of Oklahoma/Janux): Engineering focus.
- Elementary Business Statistics (The University of Oklahoma/Janux): Business focus.
- STAT101x: Biostatistics for Big Data Applications (The University of Texas Medical Branch/edX): Biostatistics focus.
- 416.1x: Probability: Basic Concepts & Discrete Random Variables(Purdue University/edX): Part of a 2-course series.
- 416.2x: Probability: Distribution Models & Continuous Random Variables (Purdue University/edX): Part of a 2-course series.
- Business Statistics and Analysis Specialization (Rice University/Coursera): Uses Excel.
- Statistics 110: Probability (Harvard University): Traditional lecture format (video-taped). Often recommended on Quora.
- Statistics (Dataquest): A multi-course series with about 12 hours of content. Subscription required. One of two courses/series to teach statistics with a focus of coding up examples in Python. A note from Dataquest: "the statistics courses are being entirely re-written at the moment, due for release around the end of November."