Overview and objectives
This overarching course teaches how to tackle the various types of data science interview questions, including all the core topics from coding prerequisites to business applications.
This course is for anyone preparing for a data science interview or seeking a structured approach to mastering the wide range of data science interview questions.
Courses in this learning path are:
Introduction to Data Science
Learn how to prepare for the wide range of questions that come up in data science interviews.
8 of 8 Completed
Easy SQL Questions
Get started on tackling easy level SQL questions involving aggregations, joining multiple tables, and pulling data for beginning analytical reports.
5 of 12 Completed
Medium SQL Questions
Medium level SQL questions utilize more advanced concepts like sub-queries, window functions, and solving case study problems.
5 of 19 Completed
Hard SQL Questions
Let's tackle advanced SQL interview questions that focus on multi-joins and layers of data interpretation. These questions may come up in take-home challenges and senior level interviews.
1 of 9 Completed
Data structures in Python attempt to be more intuitive and flexible than traditional data structures in other programming languages.
4 of 9 Completed
Common DS Packages
As we said in the first section of this course, a major benefit of using Python for data science in comparison to other programming languages is the availability of a large number of useful packages that are distributed under a free license.
6 of 11 Completed
Python Questions: Hard
Let's try some hard Python questions that you would see in tougher data science interviews and many machine learning interviews.
2 of 6 Completed
Probability Theory is the branch of mathematics that deals with uncertainty, underpinning all of statistics and machine learning.
4 of 10 Completed
All areas of study in math can roughly be divided into two camps: discrete mathematics and continuous mathematics. Perhaps the best way to describe the difference between the two is to talk about what each of the branches means by "number."
3 of 12 Completed
Continuous probability distribution: A probability distribution in which the random variable X can take on any value (is continuous).
3 of 6 Completed
Thus far in this course, we have considered random variables under an idealized scenario where we know the distribution of the random variable.
3 of 7 Completed
Hypothesis testing covers the fundamental theory and background behind A/B Testing. In this course we'll cover Z and T test, multiple hypothesis testing, and the different type errors.
11 of 11 Completed
Confidence intervals help us deal with this imprecision by giving us a way to talk about a range of values with some certainty where the true value of the statistic is contained in.
2 of 6 Completed
A/B Testing & Experiment Design
Let's start with a general framework for A/B testing. In practice, an A/B testing and experimentation all follow a step by step process of setting metrics and designing experiments.
3 of 10 Completed
A/B Testing Common Scenarios
The next couple of chapters will cover common scenarios and concepts involved in A/B testing. As A/B testing involves statistical concepts, there may be terms that you need refreshing on.
3 of 9 Completed
A/B Testing Tradeoffs
There are scenarios where A/B testing is not necessarily the best course of action. Often, there are technical, infrastructure, or practical concerns that come up while planning an A/B test.
2 of 6 Completed
This is a refresher on some important statistical concepts that will help us with A/B testing and beyond. While by no means a comprehensive guide, this chapter will go over some important basics about statistical testing and probability distributions.
4 of 11 Completed
Data Analytics Fundamentals: Causal Inference
In this course we’ll go over the core concepts of causality, significance, and analyzing data. This is meant as a quick refresher and a high level overview of causal inference basics to eventually apply them in data analytics problems.
2 of 9 Completed
Diagnosing and Investigating Metrics
Investigating metrics is a type of product intuition problem that will come up frequently in interviews. Examples of this are typically phrased along the lines of - If X metric is up/down by Y percent, how would you investigate it?
3 of 12 Completed
Measuring the success of products is critical to data science and analytics interviews. Generally, this question is an encapsulation of every time a product manager or executive asks the question: “So, how is it doing?”.
2 of 11 Completed
Before launching a feature, we can imagine that the first step we’d have to take is analyzing the existing data in our product to make a decision about exactly what to build. This process is what creates the building or change of a feature problem that gets asked in product interviews.
2 of 10 Completed
Metric trade-off type questions can occur on their own in product interviews or as part of a larger product or AB testing interview discussion.
1 of 4 Completed
Modeling Case Study
The machine learning and modeling case study is the most common type of interview question that tests a combination of modeling intuition and business application.
2 of 2 Completed
Data processing and analysis is the first step that we need to consider once we've clarified details and started down the path of building the model.
1 of 5 Completed
Feature selection and feature engineering is the second part of the data processing step. Once we've understood what our data looks like, we need to begin to theorize the kinds of features we would use to build the model.
1 of 4 Completed
Model selection is usually the crux of any modeling case study problem. We want to be able to select a model or machine learning algorithm that will combine a bunch of factors to become the most optimal algorithm for the problem.
0 of 4 Completed
Machine Learning Algorithms
We have touched on the different machine learning algorithms throughout this lesson, but haven't yet dived deep into each one. The prior for this course is that you, as a candidate, have an idea of basic machine learning concepts, and the different modeling algorithms are one such example of them.
0 of 7 Completed
Most machine learning model deployment requires some technical details and implementation to doing so. But we can abstract away from that in an interview when we’re focusing on the model roll out.
0 of 9 Completed
Applied modeling is a type of case question asked about practical machine learning. The most common type of question framework is: Given an example scenario with a machine learning system or model, how would you analyze and fix the problem?
0 of 5 Completed
Generalized Linear Models and Regression
Regression models are used to predict the value of a dependent variable from one or more independent variables.
9 of 13 Completed