Nand Kishor Contributor

Nand Kishor is the Product Manager of House of Bots. After finishing his studies in computer science, he ideated & re-launched Real Estate Business Intelligence Tool, where he created one of the leading Business Intelligence Tool for property price analysis in 2012. He also writes, research and sharing knowledge about Artificial Intelligence (AI), Machine Learning (ML), Data Science, Big Data, Python Language etc... ...

Full Bio 
Follow on

Nand Kishor is the Product Manager of House of Bots. After finishing his studies in computer science, he ideated & re-launched Real Estate Business Intelligence Tool, where he created one of the leading Business Intelligence Tool for property price analysis in 2012. He also writes, research and sharing knowledge about Artificial Intelligence (AI), Machine Learning (ML), Data Science, Big Data, Python Language etc...

3 Best Programming Languages For Internet of Things Development In 2018
372 days ago

Data science is the big draw in business schools
545 days ago

7 Effective Methods for Fitting a Liner
555 days ago

3 Thoughts on Why Deep Learning Works So Well
555 days ago

3 million at risk from the rise of robots
555 days ago

Top 10 Hot Artificial Intelligence (AI) Technologies
312675 views

Here's why so many data scientists are leaving their jobs
81234 views

2018 Data Science Interview Questions for Top Tech Companies
77778 views

Want to be a millionaire before you turn 25? Study artificial intelligence or machine learning
76968 views

Google announces scholarship program to train 1.3 lakh Indian developers in emerging technologies
61782 views

Learn Data Science for Excellence and not just for the Exams

By Nand Kishor |Email | Apr 9, 2018 | 11520 Views

Are you currently pursuing your masters in Data Science? Overwhelmed with Buzzwords and Information? Don't know where and how to start your study? Then start with this article and a starter kit provided, but learn it for excellence and not just for the exams.

Dear future Masters of Data Science,
Many of you might be recently started your master's studies in data science/data analytics/business analytics and little overwhelmed with buzzwords (like Big Data, Machine learning, Artificial Intelligence) and information coming from every possible channel like friends, social media, blogs and websites. Everybody is talking about it and you must be thinking "I'm doing my Masters in Data Science and I want to learn everything about it but where should actually I start and how?"

First of all, Welcome! You have entered fascinating world of Data Science, which comprises science, technology and creativity. From career perspective, you made a right decision at right time, but believe me, "This is not a Joy ride". We, the humans, are on our way to fourth industrial revolution which will lead to dramatic transformation of our world. It will change the way we transact, communicate and make decisions. Data science has opened tremendous possibilities and opportunities in every aspect of business and our daily lives. Shortly, machines will take over low skilled jobs, bots will become smarter and smarter, and the only job of humans will be to teach these robots using algorithms, mathematics and other basic sciences. Data science world is evolving and moving forward so fast that the only option remains is to adopt it as fast as you can. Which, I believe, is only possible if you make your foundation of data science very strong. Programming languages and frameworks will come and go but basics will always be the same. What I'm trying to say is, don't just learn to pass the exams or add a degree on your LinkedIn profile, earn it to excel in Tomorrow's Competitive World and also to contribute to it. Bottom line: Utilise your academic time wisely to learn data science thoroughly and make your foundation rock solid.

Across the world, there are various courses in data science offered with different names and different combinations of modules in it by different universities. E.g. MS in Data Science, MS in Business Analytics, MS in Data Analytics etc. There are certain reasons behind it, based on different expertise you will earn and the profile you will work as. Well, this will be altogether different topic if discussed further, what important is to get to the crux of it. As we all know the Venn diagram of data science below, it requires skills in mathematics, computer science and particular domain. (You can refer different versions Data Science Venn diagrams from previous KDnuggets post: /2016/10/battle-data-science-venn-diagrams.html ) As below diagram is very generic in nature, it is important to know specifics about it from academic perspective. I'll not use names of modules from any of universities/colleges but try to generalise module names in context of academia.

Mathematics Skills: Mathematics is the most important fundamental science (after Physics) which many of us hated in our schools and colleges. Ya, I know that smile :) from all engineering students, but guys this maths will really going to help you mint $s. Statistics, an important branch of mathematics, is your starting point for your journey in data science world. As a starter kit, a list of selected videos and links have provided at the end of this article which will guide you to understand core concepts of statistics.

Computer Science Skills: In the context of data science, important modules from computer science are programming, database and data warehouse, data mining and data visualization. Programming is a must and very critical skill in data science, which is the best way to automate the tasks in your analysis. Database skills are also very important to understand how data is stored and retrieved in structured as well as un-structured format. Data warehouse is advanced concept of database which is designed with specific business needs. And here comes the creative part of data science - data mining and visualization. For me, data science is fascinating because of these two modules. Data mining is about analyzing the data and drawing meaningful insights from it using different scientific methods, machine learning algorithms and creative thinking. Data mining could also be a manual. E.g. You have your monthly expenses data for last few years. If you plot a graph of months vs expenses, you will really find some interesting insights like, in which month you spent or saved less money and why? How can you save money in future by understanding how you saved in past? Did you realize, in this example while understanding what is data mining, we also talked about data visualization (Graph of months vs expenses). Like data mining, data visualization module also needs creative thinking and where you learn how to visualize data so that insights can be easily found by visual exploration or how to communicate your findings in intuitive way to different people. Please check some of the useful links and videos provided at the end of this article that will help you study these subjects thoroughly.

Domain Skills: Modules in this group are mostly elective and numerous in number depending on university/college. These are not the core but the applied modules where you learn domain/business skills and how to apply concepts from core modules to solve real world problems in particular domain/business function E.g. Marketing Analytics, Financial Analytics, Web Analytics, Social Media Analytics etc. You can choose the module depending on your interest in particular area.

Research/Industrial Project: Research project is very important module from academic as well as career perspective. You should try to apply most of your knowledge that you learnt from core and other modules to your research project. Try to read as many articles/academic papers as possible, brainstorm ideas and think loud with friends/professors/meetup groups while finding your research topic.

Apart from academics, soft skills are also important for analytics professionals. Good communication, team work, leadership and creativity, these skills can't be learned from books or videos, you have to learn it by actually working on it. These days, in every big city, different technological meetups/conferences/datathons are regularly arranged by many experts and companies to share knowledge and learn from each other. Attending such events helps you to improve your skills to convey your ideas, to learn from other's ideas and mistakes, to create your professional network and ultimately to be better than what you were yesterday.

So, learn thoroughly, share globally and grow rapidly.

Data Science Core Modules Starter Kit
This is just a starter kit, you can refer many other Data Science knowledge resources available on the Internet.
1. Statistics:
  1. Statistics PL01 - Data and Statistics
  2. Statistics PL03 - Descriptive Statistics II
  3. Statistics PL04 - Introduction to Probability
  4. Statistics PL05 - Discrete Probability Distributions
  5. Statistics PL06 - Continuous Probability Distributions
  6. Statistics PL07 - Sampling and Sampling Distributions
  7. Statistics PL08 - Interval Estimation
  8. Statistics PL09 - Hypothesis Tests
  9. Statistics PL10 - Inferences about Two Populations
  10. Statistics PL11 - Inferences about Population Variances
  11. Statistics PL12 - Goodness of Fit and Independence Tests
  12. Statistics PL13 - ANOVA (Analysis of Variance)
  13. Statistics PL14 - Simple Linear Regression
  14. Statistics PL15 - Multiple Regression
  15. Statistics PL16 - Logistic Regression
2. Programming:
  1. Introduction to Data Science with R - Data Analysis Part 1 - YouTube
  2. Introduction to R for Data Mining
  3. https://www.r-bloggers.com
  4. http://www.statmethods.net
  5. https://learnpythonthehardway.org/
  6. Google Developer Python Course
  7. Pro Python Programming Course
  8. Java Programming - Step by Step tutorial
  9. Big Data and Hadoop 1| Hadoop Tutorial 1 - YouTube
3. Database and Data Warehouse
  1. Introduction to Database Management Systems 1: Fundamental Concepts
  2. SQL for Beginners. Learn basics of SQL in 1 Hour
  3. SQL tutorials for beginners/ Oracle Database tutorials.
  4. Data Warehousing Tutorial Videos
  5. http://www.codeproject.com/Articles/652108/Create-First-Data-WareHouse
4. Data Mining:
  1. Machine Learning (Coursera)
  2. Machine Learning with R Progressing, by Brett Lantz. (Book)
5. Data Visualization:
  1. Harvard i-lab| Data Visualization for Non-Programmers
  2. What is Tableau? - 1| Data Visualization Tools| Tableau Tutorial for Beginners| Edureka
  3. Data Visualization Best Practices

Source: Kdnugget