Prepare for and practice interview questions from Airgas.

Airgas Interview Questions

Airgas Interview Guides

Machine Learning

Coefficients of Logistic Regression

Booking Regression

Bank Fraud Model

Lasso vs Ridge

Rebalancing outcome probabilities for a classifier on imbalanced data.

Rebalance Probabilities

Data Structures & Algorithms

Given an integer N, write a function that returns all of the prime numbers up to N

Prime to N

Moving Window

Given a json string with nested objects, write a function that flattens all the objects to a single key-value dictionary.

Flatten JSON

This problem involves identifying duplicate numbers in a list of integers. The function should return a list of the duplicate numbers.

Find Duplicate Numbers in a List

Valid Anagram

Statistics

What are the assumptions of linear regression?

Assumptions of Linear Regression

Covariance vs Correlation

Correlation in Regression

What does it mean to "bootstrap" a data set?

Bootstrapping Samples

What are the regression coefficients when regressing Y on X versus X on Y, given Y = X plus normal white noise?

Regress Y on X

Analytics

Let’s say that you're in charge of an e-commerce D2C business that sells socks. What business health metrics would you care?

D2C Socks e-Commerce

Describing a data project and its challenges

Hurdles In Data Projects

Strategically resolving misaligned expectations with stakeholders for a successful project outcome

Stakeholder Communication

Let's say you work at Facebook and you're analyzing churn on the platform.

Retention Rate Disparity

Select the 2nd highest salary in the engineering department

2nd Highest Salary

Write a SQL query to count transactions filtered by several criterias.

Count Transactions

Write a query to find all users that were at some point "Excited" and have never been "Bored" with a campaign.

Always Excited Users

Total Spent on Products

When an interviewer asks a question along the lines of:

<ul>
<li>What would your current manager say about you? What constructive criticisms might he give?</li>
<li>What are your three biggest strengths and weaknesses you have identified in yourself?</li>
</ul>

How would you respond?

When asked about your strengths in an interview, what is an effective way to respond?

When asked about your strengths in an interview, what is an effective way to respond?

Your Strengths and Weaknesses I

Which of the following is an acceptable strategy when discussing weaknesses in an interview?

Which of the following is an acceptable strategy when discussing weaknesses in an interview?

Your Strengths and Weaknesses II

What do you tell an interviewer when they ask you what your strengths and weaknesses are?

Your Strengths and Weaknesses

Brainteasers

When an interviewer asks you a question along the lines of:

<ul>
<li>Why did you apply to our company?</li>
<li>What are you looking for in your next job?</li>
<li>What makes you a good fit for our company?</li>
</ul>

How should you respond?

When asked 'What are you looking for in your next job?' in an interview, how can you tie the company's employee benefits into your response?

When asked 'What are you looking for in your next job?' in an interview, how can you tie the company's employee benefits into your response?

Why Do You Want to Work With Us I

How can company values be used effectively in an interview when asked 'What makes you a good fit for our company?'

How can company values be used effectively in an interview when asked 'What makes you a good fit for our company?'

Why Do You Want to Work With Us II

When responding to the question 'Why did you apply to our company?' during an interview, what aspect should you highlight?

When responding to the question 'Why did you apply to our company?' during an interview, what aspect should you highlight?

Why Do You Want to Work With Us III

How would you answer when an Interviewer asks why you applied to their company?

Why Do You Want to Work With Us

Describe a data project you worked on. What were some of the challenges you faced?

What are the assumptions of linear regression?

Which assumption of the residuals of the standard linear regression model can not be overcome by increasing the sample size?

Regression assumptions

Let’s say that you’re training a classification model.

How would you combat overfitting when building tree-based models?

Let's say that you're training a classification model.   How would you combat overfitting when building tree-based models?

Overfit Avoidance

How would you handle the data preparation for building a machine learning model using imbalanced data?

Addressing imbalanced data in machine learning through carefully prepared techniques.

Data Preparation for Imbalanced Data

Let’s say we’re comparing two machine learning algorithms. In which case would you use a bagging algorithm versus a boosting algorithm? 

Give an example of the tradeoffs between the two.

In machine learning, when would you use a bagging algorithm over a boosting algorithm?

In machine learning, when would you use a bagging algorithm over a boosting algorithm?

Bagging vs. Boosting

Bagging vs Boosting

Talk about a time when you had trouble communicating with stakeholders. How were you able to overcome it?

Given a string, write a function to determine if it is palindrome or not.

Note: A palindrome is a word/string that is read the same way forward as it is backward, e.g. <code>&#39;reviver&#39;</code>, <code>&#39;madam&#39;</code>, <code>&#39;deified&#39;</code> and <code>&#39;civic&#39;</code> are all palindromes, while <code>&#39;tree&#39;</code>, <code>&#39;music&#39;</code> and <code>&#39;person&#39;</code> are not palindromes.

Example:

Input:

<pre tabindex="0" class="chroma"><code>word1 = &#34;tree&#34;
word2 = &#34;radar&#34;
</code></pre>

Output:

<pre tabindex="0" class="chroma"><code>def is_palindrome(word1) -&gt; False
def is_palindrome(word2) -&gt; True
</code></pre>

Given a string, write a function to determine if it is palindrome or not.

String Palindromes

Write a SQL query to select the 2nd highest salary in the engineering department.

Note: If more than one person shares the highest salary, the query should select the next highest salary.

Example:

Input:

<code>employees</code> table

<table>
<thead>
<tr>
<th>Column</th>
<th>Type</th>
</tr>
</thead>

<tbody>
<tr>
<td><code>id</code></td>
<td>INTEGER</td>
</tr>

<tr>
<td><code>first_name</code></td>
<td>VARCHAR</td>
</tr>

<tr>
<td><code>last_name</code></td>
<td>VARCHAR</td>
</tr>

<tr>
<td><code>salary</code></td>
<td>INTEGER</td>
</tr>

<tr>
<td><code>department_id</code></td>
<td>INTEGER</td>
</tr>
</tbody>
</table>
<code>departments</code> table

<table>
<thead>
<tr>
<th>Column</th>
<th>Type</th>
</tr>
</thead>

<tbody>
<tr>
<td><code>id</code></td>
<td>INTEGER</td>
</tr>

<tr>
<td><code>name</code></td>
<td>VARCHAR</td>
</tr>
</tbody>
</table>
Output:

<table>
<thead>
<tr>
<th>Column</th>
<th>Type</th>
</tr>
</thead>

<tbody>
<tr>
<td><code>salary</code></td>
<td>INTEGER</td>
</tr>
</tbody>
</table>

Imagine you are asked to build a machine learning model to decide new loan approvals for a
financial firm. You ask the data department in the company for a subset of data to get started
working on the problem. The data includes different features about applicants such as age,
occupation, zip code, height, number of children, favorite color, etc. You decide to build
multiple machine learning models to test out different ideas before settling on the best one.

How would you explain the bias-variance tradeoff with regards to
building and choosing a model to use?

Bias vs. Variance Tradeoff

How does random forest generate the forest? Additionally, why would we use it over other algorithms such as logistic regression?

What happens when you average the output on multiple decision trees?

What happens when you average the output on multiple decision trees?

Average Trees

Random Forest Explanation

Tell me about a project in which you had to clean and organize a large dataset.

Describing a real-world data cleaning and organization project

Data Cleaning Experiences

Data Pipelines

What are the key differences between classification models and regression models?

What is the MOST important difference between regression and classification models?

What is the MOST important difference between regression and classification models?

Classification vs Regression

Classification and Regression

Let’s say that you work at a bank that wants to build a model to detect fraud on the platform.

The bank wants to implement a text messaging service in addition that will text customers when the model detects a fraudulent transaction in order for the customer to approve or deny the transaction with a text response.

How would we build this model?

Let’s say that you work at a bank that wants to build a model to detect fraud on the platform.

The bank wants to implement a text messaging service that will text customers when the model detects a fraudulent transaction in order for the customer to approve or deny the transaction with a text response.

Which statement is true?

Let's say that you work at a bank that wants to build a model to detect fraud on the platform.

The bank wants to implement a text messaging service that will text customers when the model detects a fraudulent transaction in order for the customer to approve or deny the transaction with a text response.

Which statement is true?

Given an integer <code>N</code>, write a function that returns a list of all of the prime numbers up to <code>N</code>.

Note: Return an empty list there are no prime numbers less than or equal to <code>N</code>.

Example:

Input:

<pre tabindex="0" class="chroma"><code>N = 3
</code></pre>

Output:

<pre tabindex="0" class="chroma"><code>def prime_numbers(N) -&gt; [2,3]
</code></pre>

What’s the difference between Lasso and Ridge Regression?

Under what circumstances is it recommended to use Lasso regression over Ridge regression?

Under what circumstances is it recommended to use Lasso regression over Ridge regression?

What’s the relationship between PCA and K-means clustering?

What does the variable “k” in k-means clustering refer to?

What does the variable "k" in k-means clustering refer to?

Input of K-means

PCA and K-Means

How would you explain the bias variance tradeoff in machine learning to a high school student?

Replaced by QUESTION 791 - Should be deleted

Bias Variance Tradeoff

You’re given a string that may contain the characters <code>{</code>, <code>}</code>, <code>[</code>, <code>]</code>, <code>(</code>, and <code>)</code>.

Task: Verify that the string is balanced. A balanced string is one where every opening character, <code>{</code>, <code>[</code>, or <code>(</code>, has a corresponding closing character, <code>}</code>, <code>]</code>, or <code>)</code>.

Write a function called <code>is_balanced(string: str) -&gt; bool</code> which verifies the balance of a string.

Example:

<pre tabindex="0" class="chroma"><code>is_balanced(&#39;(())[]{}&#39;) -&gt; True
</code></pre>

<pre tabindex="0" class="chroma"><code>is_balanced(&#39;{([(){}])()}&#39;) -&gt; True
</code></pre>

<pre tabindex="0" class="chroma"><code>is_balanced(&#39;{}[]())&#39;) -&gt; False
</code></pre>

<hr/>

Write a function that tests whether a string of brackets is balanced.

The Brackets Problem

How do you detect and handle correlation between variables in linear regression? What will happen if you ignore the correlation in the regression model?

Describe a time when you had to define a long-term vision for a project or team and move it from concept to reality.

Interviews for leadership or senior technical roles look for more than just “finishing a project.” Your answer should specifically address:

<ol>
<li>The “Why”: What data or organizational gap necessitated this vision?</li>
<li>The Framework: How did you translate a high-level goal into a roadmap?</li>
<li>The Friction: How did you handle stakeholders or team members who were skeptical of the new direction?</li>
</ol>

Vision Setting and Execution Strategy

Business Case

Given a list of integers, identify all the duplicate values in the list. Assume that the list can contain both positive and negative numbers, and the order of the list does not matter. A number is considered a duplicate if it appears more than once in the list. Return a list of the duplicate numbers.

Example 1:

Input:

<pre tabindex="0" class="chroma"><code>nums = [1, 2, 3, 1, 2, 3]
</code></pre>

Output:

<pre tabindex="0" class="chroma"><code>find_duplicates(nums) -&gt; [1, 2, 3]
</code></pre>

The numbers 1, 2, and 3 all appear more than once in the list, so they are considered duplicates.

Example 2:

Input:

<pre tabindex="0" class="chroma"><code>nums = [1, -1, 2, 3, 3, -1]
</code></pre>

Output:

<pre tabindex="0" class="chroma"><code>find_duplicates(nums) -&gt; [-1, 3]
</code></pre>

The numbers -1 and 3 both appear more than once in the list, so they are considered duplicates. Note that the order of the output does not matter.

Example 3:

Input:

<pre tabindex="0" class="chroma"><code>nums = [1, 2, 3, 4, 5]
</code></pre>

Output:

<pre tabindex="0" class="chroma"><code>find_duplicates(nums) -&gt; []
</code></pre>

None of the numbers in the list appear more than once, so there are no duplicates.

Given two strings, write a function to return <code>True</code> if the strings are anagrams of each other and <code>False</code> if they are not.

Note: A word is not an anagram of itself.

Example 1:

Input:

<pre tabindex="0" class="chroma"><code>string_1 = &#34;listen&#34;
string_2 = &#34;silent&#34;
</code></pre>

Output:

<pre tabindex="0" class="chroma"><code>True
</code></pre>

Example 2:

Input:

<pre tabindex="0" class="chroma"><code>string_1 = &#34;banana&#34;
string_2 = &#34;bandana&#34;
</code></pre>

Output:

<pre tabindex="0" class="chroma"><code>False
</code></pre>

Let’s say you work for an e-commerce company. Vendors can send products to the company’s warehouse to be listed on the website. Users are able to order any in-stock products and submit returns for refunds if they’re not satisfied.

The front end of the website includes a vendor portal that provides sales data in daily, weekly, monthly, quarterly, and yearly intervals.

The company wants to expand worldwide. They put you in charge of designing its end-to-end architecture, so you need to know what significant factors you’ll need to consider. What clarifying questions would you ask?

What kind of end-to-end architecture would you design for this company (both for ETL and reporting)?

How would you design a data warehouse for a e-commerce company looking to expand internationally?

International e-Commerce Warehouse

How would you interpret coefficients of logistic regression for categorical and boolean variables?

Why is one-hot encoding recommended for categorical variables in logistic regression models?

Why is one-hot encoding recommended for categorical variables in logistic regression models?

Let’s say you’re interning at a fintech startup that’s building a platform for online games of chance. During testing, one of your simulations flips a coin 10 times and gets 8 tails and 2 heads.

How would you determine if the coin is fair, and what statistical approach would you use to support your conclusion?

How would you assess if a coin is fair after observing 8 tails in 10 flips?

Fair Coin

Probability

Describe an analytics experiment that you designed. How were you able to measure success?

The role of A/B testing in measuring the success rate of an analytics experiment

Success Measurement

A/B Testing

Let’s say we want to build a model to predict booking prices on Airbnb.

Between linear regression and random forest regression, which model would perform better and why?

What are the two key concepts of random forest regression?

What are the two key concepts of random forest regression?

What are the logistic and softmax functions? What is the difference between the two?

What makes them useful for use in logistic regression?

In logistic regression, how is the predicted class determined based on the logistic function?

In logistic regression, how is the predicted class determined based on the logistic function?

Softmax vs Logistic II

What is the key characteristic of the logistic function that makes it useful for mapping continuous values to a probability?

What is the key characteristic of the logistic function that makes it useful for mapping continuous values to a probability?

Softmax vs Logistic I

What are the logistic and softmax functions? What is the difference between the two?

Softmax vs Logistic

Challenge

<div><h2>Business Analyst, FP&A</h2>Location: Lawrenceville, GA - Sever RdFull timeAirgas is looking for a Business Analyst, FP&A in Lawrenceville, GA. Reporting to the VP, Finance this role is responsible for activities related to the following major areas: reporting and business analytics, annual budgeting and quarterly forecasting, cross-functional corporate projects, capital management and other ad hoc analysis.We are looking for:<ul><li>Lead the development and tracking of KPIs including building dashboards for Division level reporting, specifically utilizing the functionality of PowerBI.</li><li>Assist with the annual budgeting and quarterly forecasting process, working closely with various business units.</li><li>Support business leaders/initiatives from a financial perspective, including working cross-functionally to drive corporate projects, efficiencies, etc.</li><li>Perform extensive financial modeling in support of the key corporate initiatives.</li><li>Report and analyze other financial and operating metrics, such as daily sales, gross margin, working capital, asset utilization, capital expenditures, etc.</li><li>Build strong relationships with the various business units and corporate management teams.</li><li>Perform other related duties as required.</li></ul>Are you a match?Qualifications:<ul><li>Post-secondary education equivalent to a Finance degree or equivalent.</li><li>Accounting designation and/or MBA is preferred is an asset.</li><li>3 to 5 years of financial reporting/financial analysis experience.</li><li>Proven track record of success in prior accounting/finance roles.</li><li>Possess analytical skills and a general understanding of how to leverage computer systems and related tools to improve the efficiency and effectiveness of the FP&A function.</li></ul>Skills:<ul><li>Must be able to work with a wide variety of people with different personalities and backgrounds.</li><li>Ability to manage multiple priorities in a fast paced environment.</li><li>Strong organizational, team-building and communication skills.</li><li>Advanced knowledge of SAP and PowerBI is a must.</li><li>Knowledge of MS Office (Excel, PPT, and Access) and Google suite of products.</li><li>Detail orientated with analytical, time management, and problem solving skills.</li><li>Communicates with clarity, in writing, verbally in one-on-one or group situations, and over the telephone.</li></ul>BenefitsWe care about and support all Airgas associates. This is evident not only through our competitive compensation but also through a comprehensive benefits package that includes medical, dental, and vision plans, vacation, sick time, floating holidays, and paid holidays for full-time employees.We provide a progressive parental leave package for our eligible Airgas parents, offering generous paid time off for the birth or placement of children.Additionally, we offer our employees a 401k plan with company matching funds, tuition reimbursement, discounted college tuition for employees' dependents, and an Airgas Scholarship Program.Your differences enhance our performanceAt Airgas, we are committed to building a workplace that embraces the diversity of our employees, our customers, patients, community stakeholders and cultures across the world.We welcome and consider applications from all qualified applicants, regardless of their race, gender, sexual orientation, religion, disability or any other protected characteristic.About AirgasAirgas, an Air Liquide company, is a leading U.S. supplier of industrial, medical and specialty gases, as well as hardgoods and related products; one of the largest U.S. suppliers of safety products; and a leading U.S. supplier of ammonia products and process chemicals. Through the passion and diversity of its 18,000 associates, Airgas fosters a culture of safety, customer success, sustainability and innovation. Airgas associates are empowered to share ideas, take initiative and make decisions.Airgas is a subsidiary of Air Liquide, a world leader in gases, technologies and services for industry and healthcare. Present in 60 countries with approximately 66,500 employees, Air Liquide serves more than 4 million customers and patients.Join us for a stimulating experience: At Airgas, you matter and so does the work you do. As a member of our team, you play an important role in the success of your team, making sure our products are created sustainably and delivered safely and efficiently. In turn, you'll find a welcoming workplace where you're valued for who you are and where you can fill your potential while growing a fulfilling career whatever path you choose.Equal Employment Opportunity InformationWe are an equal opportunity employer. We welcome all qualified applicants regardless of race, color, religion, sex, sexual orientation, gender identity, national origin, disability, veteran status, or any other protected characteristic.Airgas, an Air Liquide Company is a Government contractor subject to the Vietnam Era Veterans' Readjustment Assistance Act of 1974 and Section 503 of the Rehabilitation Act of 1973.Airgas does not discriminate against qualified applicants with disabilities, and is committed to providing reasonable accommodations to the known disabilities of such individuals so as to ensure equal access to benefits and privileges of employment. If you are an individual with a disability and would like to request a reasonable accommodation as part of the employment selection process, please contact us by email at us-accommodationrequest@airgas.com.</div>

Calculate Moving Average	SQL	Easy
Predict Customer Churn	Machine Learning	Medium
A/B Test Significance	Statistics	Medium
Optimize Query Performance	SQL	Hard
Feature Importance Analysis	Machine Learning	Medium
Clean Missing Data	Python	Easy
Neural Network Architecture	Deep Learning	Hard
Calculate Cohort Retention	SQL	Medium
Bayesian Probability	Statistics	Easy
Recommend Similar Products	Machine Learning	Hard

Airgas Interview Questions

Airgas Interview Guides

Airgas Interview Questions

Challenge

Airgas Opening Jobs

Discussion & Interview Experiences

Discussion & Interview Experiences