Data Analyst

Interview questions for Data Analyst roles.

10 questions

Question 1

Difficulty: medium

Tell me about a time you turned a messy dataset into something useful for decision-making.

Sample answer

In a previous role, I inherited a customer support dataset that had duplicate records, inconsistent date formats, and missing values across several key fields. Before doing any analysis, I spent time profiling the data to understand where the quality issues were coming from. I created a cleaning workflow in Excel and SQL to standardize formats, remove duplicates, and flag records that needed manual review. Once the data was reliable, I analyzed ticket volume trends by product line and response time. That revealed one product area was generating a disproportionate number of repeat tickets, which had not been obvious before. I presented the findings to the support and product teams with a simple dashboard and clear recommendations. They used it to prioritize fixes and adjust staffing during peak hours. That experience reinforced for me that good analysis starts with strong data hygiene.

Question 2

Difficulty: easy

How do you decide which metrics matter most when a stakeholder asks for a report?

Sample answer

I start by clarifying the business goal behind the request, because the same report can mean very different things depending on the decision being made. If a stakeholder asks for a sales report, for example, I would ask whether they want to improve conversion, identify pipeline risk, or forecast revenue. That helps me focus on the right metrics instead of flooding them with every possible number. I also try to understand who will use the report and how often, since executives usually want high-level KPIs while operational teams may need more detailed breakdowns. From there, I choose metrics that are actionable, measurable, and tied to the outcome they care about. I also make sure to define each metric clearly so there is no confusion later. In my experience, a useful report is not the one with the most data, but the one that helps someone make a better decision quickly.

Question 3

Difficulty: hard

Walk me through how you would analyze a sudden drop in website conversion rate.

Sample answer

I would approach it in layers, starting with verification. First, I would confirm the drop is real and not caused by tracking issues, broken events, or a reporting delay. Then I would segment the data by device, traffic source, geography, landing page, and user type to see where the decline is concentrated. If the issue is isolated to mobile users or one acquisition channel, that gives a strong clue about where to investigate next. I would also compare the timing of the drop against product changes, site releases, campaign launches, or pricing updates. If possible, I’d look at funnel stage performance to see whether the problem is happening at entry, product view, cart, or checkout. From there, I’d test hypotheses using trend analysis and, if needed, cohort analysis. My goal would be to identify the smallest segment where the change started, then work with the relevant team to validate the cause and recommend a fix.

Question 4

Difficulty: medium

Describe a time when you had to explain a complex analysis to a non-technical audience.

Sample answer

I once worked on an analysis comparing customer retention across several acquisition channels, and the model output was more detailed than the leadership team needed. Instead of walking them through the statistical methods first, I focused on the business story: which channels brought in customers who stayed longer and which ones looked efficient upfront but underperformed later. I translated the results into plain language and used simple visuals to show the difference in retention curves. I also avoided technical jargon and explained uncertainty in practical terms, such as saying we were highly confident in the trend rather than talking about p-values. During the discussion, I checked for understanding and paused to answer questions before moving on. That approach worked well because the audience left with clear takeaways and next steps rather than just a pile of charts. It reminded me that communication is part of analysis, not something separate from it.

Question 5

Difficulty: medium

What SQL skills do you use most often as a data analyst?

Sample answer

The SQL skills I use most often are joins, aggregations, window functions, and CTEs. Joins are essential for combining data from different systems, especially when customer, transaction, and product data live in separate tables. Aggregations help me summarize performance by day, month, segment, or channel, depending on the question. Window functions are especially useful for things like running totals, ranking, and calculating retention or moving averages without losing row-level detail. I also use CTEs to keep queries readable and easier to debug, particularly when the logic gets more complex. Beyond writing the query itself, I pay close attention to data quality checks, such as duplicate handling, null values, and join cardinality, because a query can be technically correct and still produce misleading results. For me, good SQL is not just about getting the answer quickly; it is about making sure the answer is accurate, repeatable, and easy for someone else to follow.

Question 6

Difficulty: medium

How do you handle missing or inconsistent data in your analysis?

Sample answer

I handle missing or inconsistent data by first understanding why the issue exists and how much it affects the analysis. If the missingness is random and small, I may exclude those records from a specific calculation, but I always check whether that exclusion could bias the result. If the missing data is concentrated in a certain segment, that’s more serious because it could distort the conclusion. In those cases, I look for patterns, compare against source systems, and sometimes collaborate with operations or engineering to identify the root cause. For inconsistent formats, I standardize values early in the process so the analysis is based on one clean definition. I also document every assumption I make, especially when I have to impute or filter data. I think the key is not pretending the issue doesn’t exist. A strong analyst acknowledges data limitations, explains their impact, and makes the most defensible choice possible based on the context.

Question 7

Difficulty: hard

Tell me about a time you found an insight that changed a business decision.

Sample answer

At one point, I was asked to analyze why a subscription product had slower growth than expected. The early assumption was that pricing was the main issue, but when I segmented new sign-ups by channel and onboarding completion, I found something more important. Customers who came through one acquisition channel were far less likely to finish onboarding, and those users had much lower activation and retention rates. That meant the problem was not just pricing, but a mismatch between the audience and the product experience. I presented the data along with a recommendation to adjust campaign targeting and improve onboarding prompts for that segment. The team decided to shift budget away from the underperforming channel and test a simpler onboarding flow. Over the next few weeks, activation improved noticeably. What I learned from that project is that the obvious answer is not always the right one, and careful segmentation can uncover the real driver behind a business problem.

Question 8

Difficulty: easy

How do you prioritize multiple analysis requests with competing deadlines?

Sample answer

I prioritize based on business impact, urgency, and how dependent other teams are on the work. If two requests come in at the same time, I first clarify what decision each one supports and what happens if the answer is delayed. A report that influences a launch or executive decision usually gets higher priority than a general exploratory request. I also estimate the effort required and whether the request can be broken into phases, so stakeholders can get an initial answer sooner. Communication is important here because people are usually willing to adjust timelines if they understand the tradeoff. I try to be transparent about what I can deliver, when I can deliver it, and what level of detail they should expect. If necessary, I will recommend a smaller version of the analysis that answers the key question first. That approach helps me stay responsive without sacrificing quality or overcommitting.

Question 9

Difficulty: medium

What is your approach to building dashboards that people will actually use?

Sample answer

My approach is to start with the user, not the tool. I ask who the dashboard is for, what decisions they need to make, and how often they will use it. A dashboard that supports weekly operations should look very different from one meant for executive review. I keep the layout simple, highlight the most important KPIs at the top, and make sure the visuals answer specific questions instead of just looking polished. I also add clear definitions and filters so users understand what they are seeing and can explore the data without confusion. Another thing I pay attention to is load time and maintenance, because a dashboard that is slow or unreliable quickly loses trust. Before launch, I usually review it with a few end users to see whether they can find what they need without help. The best dashboards become part of the workflow because they are clear, trustworthy, and relevant.

Question 10

Difficulty: medium

How do you ensure your analysis is accurate before presenting it?

Sample answer

I use a combination of validation checks, peer review, and common sense. First, I verify that the data sources are complete and that the query logic matches the business question. I look for issues like duplicate rows, unexpected nulls, incorrect date filters, or joins that may inflate counts. Then I compare key outputs against known totals, historical trends, or an independent source when one is available. If the numbers look unusual, I dig deeper before assuming the business changed. I also like to sanity-check the result by asking whether it makes sense in context. For example, if one segment suddenly spikes, I want to know whether there was a campaign, a system change, or a data issue. Before presenting, I often have a colleague review the logic if the analysis is important or complex. That extra step catches mistakes and builds confidence in the final output. Accuracy matters because once an incorrect insight is shared, it can affect decisions quickly.