> ## Documentation Index
> Fetch the complete documentation index at: https://resources.devweekends.com/llms.txt
> Use this file to discover all available pages before exploring further.

# Probability and Statistics for Machine Learning

> Master probability and statistics through real-world examples - from house prices to Netflix recommendations

<Frame>
  <img src="https://mintcdn.com/devweeekends/X0Fp4X8lMl-ZftoO/images/courses/statistics-for-ml/probability-real-world.svg?fit=max&auto=format&n=X0Fp4X8lMl-ZftoO&q=85&s=b3e094b448ef649ca9bfd6d5c22770cf" alt="Probability and Statistics for ML" width="1080" height="1080" data-path="images/courses/statistics-for-ml/probability-real-world.svg" />
</Frame>

# Probability and Statistics for Machine Learning

## The Questions That Statistics Answers

You're looking at houses to buy. The real estate agent says: *"This 3-bedroom house is priced at \$450,000 - that's a great deal for this neighborhood!"*

**How do you know if that's true?**

You could:

1. Trust the agent blindly (risky)
2. Look at one other house and compare (not enough info)
3. Analyze ALL houses in the neighborhood to understand what's "normal"

That third option? **That's statistics.**

Statistics helps you answer questions like:

* What's the "typical" house price in this area?
* How much do prices vary?
* Is this house unusually cheap, or is it hiding problems?
* If I wait 6 months, what might prices be?

Think of statistics as a pair of corrective lenses for your brain. Without them, you're squinting at data and seeing patterns that might not be there -- or missing ones that are. With them, the signal separates from the noise.

<Warning>
  **Real Talk**: You probably remember statistics as boring formulas about "mean, median, mode" that you memorized for exams and promptly forgot.

  This time is different. We're going to show you why data scientists get paid \$150K+ to answer these exact questions - and you'll be able to answer them too.
</Warning>

<Info>
  **Estimated Time**: 20-25 hours\
  **Difficulty**: Beginner-friendly (no math prerequisites)\
  **Prerequisites**: Basic Python\
  **What You'll Build**: House price predictor, A/B test analyzer, spam classifier, and more
</Info>

<Accordion title="📋 Prerequisite Self-Check" icon="clipboard-check">
  **Before starting, make sure you can:**

  ✅ **Python Basics**

  * Work with lists and dictionaries
  * Use pandas DataFrames: `df['column']`, `df.mean()`
  * Create basic plots with matplotlib
  * Import and use libraries

  ✅ **Comfort Level**

  * Not afraid of looking at data tables
  * Willing to think about "what's typical" vs "what's unusual"
  * Curious about why experiments need control groups

  ❌ **You DON'T need:**

  * Previous statistics courses
  * Linear algebra (though it helps for regression)
  * Calculus knowledge
  * Any ML/AI experience

  **Recommended Path Options:**

  1. **Standalone**: Just this course if focused on data analysis
  2. **Full ML Prep**: [Linear Algebra](/courses/math-for-ml-linear-algebra) → [Calculus](/courses/math-for-ml-calculus) → This Course
  3. **Parallel**: Take this alongside Calculus course (they complement each other)
</Accordion>

<Accordion title="🧪 Quick Diagnostic: Are You Ready?" icon="flask">
  **Try these checks to gauge your readiness:**

  **Pandas Check** (can you read this code?):

  ```python theme={null}
  import pandas as pd
  df = pd.DataFrame({'price': [250000, 300000, 450000, 380000]})
  print(df['price'].mean())
  print(df['price'].max() - df['price'].min())
  ```

  <details>
    <summary>What do these print?</summary>
    `345000.0` (average price) and `200000` (price range). If you got this, you're ready!
  </details>

  **Intuition Check** (can you answer this?):
  You flip a fair coin 10 times and get 7 heads. Is the coin biased?

  <details>
    <summary>Think about it...</summary>
    Probably not! Getting 7 heads in 10 flips is unusual but not impossibly rare (∼12% chance). This is exactly the kind of reasoning we'll formalize in this course.
  </details>

  **Remediation Paths**:

  | Gap Identified    | Recommended Action                                                               |
  | ----------------- | -------------------------------------------------------------------------------- |
  | Python basics     | [Python Crash Course](/courses/python-crash-course) - 4-6 hours                  |
  | Pandas unfamiliar | [Pandas section of Python course](/courses/python-crash-course#pandas) - 2 hours |
  | Basic arithmetic  | Khan Academy "Basic statistics" - 1 hour                                         |
  | Graphing basics   | YouTube "Reading histograms and scatter plots" - 30 min                          |
</Accordion>

<Tip>
  **Career Multiplier**: Statistics is the language of data-driven decision making. Every tech company makes decisions based on statistical analysis. Understanding these concepts separates product managers who guess from those who know, and data scientists who report from those who impact.
</Tip>

***

## Why Statistics Matters (Before We Even Mention ML)

### Real World Example: The Coffee Shop Owner

Sarah owns a coffee shop. She's considering these decisions:

| Question                                 | What She Needs                    |
| ---------------------------------------- | --------------------------------- |
| "Should I stay open until 10 PM?"        | Average sales by hour + variation |
| "Is my new latte recipe selling better?" | Comparison between old vs new     |
| "How many cups will I sell tomorrow?"    | Prediction from patterns          |
| "Why did sales drop last Tuesday?"       | Outlier detection                 |

**Every one of these is a statistics problem.** No machine learning required!

### The Hospital Administrator

Dr. Patel needs to make decisions with limited data:

| Question                                                         | Statistical Concept      |
| ---------------------------------------------------------------- | ------------------------ |
| "Is this new drug actually better?"                              | Hypothesis testing       |
| "What's the chance a patient has diabetes given their symptoms?" | Bayes' theorem           |
| "Which factors predict heart disease?"                           | Correlation & regression |
| "Is this blood test result normal?"                              | Normal distribution      |

### The E-commerce Manager

Alex runs an online store:

| Question                                    | Statistical Concept  |
| ------------------------------------------- | -------------------- |
| "Did the new checkout page increase sales?" | A/B testing          |
| "Which customers are likely to buy again?"  | Probability          |
| "How confident am I in this survey result?" | Confidence intervals |
| "Are these two product categories related?" | Correlation          |

***

## How This Connects to Machine Learning

Now here's the beautiful thing. Once you understand statistics, **machine learning is just statistics at scale**.

| Statistics Problem                | Machine Learning Version                             |
| --------------------------------- | ---------------------------------------------------- |
| "What's the average house price?" | "Predict ANY house's price from its features"        |
| "Is the new drug better?"         | "Which of 1000 treatments is best for each patient?" |
| "Are height and weight related?"  | "Learn the relationship between 100 variables"       |
| "Is this blood test normal?"      | "Is this transaction fraudulent?"                    |

**Statistics gives you the foundation. Machine learning gives you superpowers.**

But here's what most courses get wrong: they jump straight to ML without building the statistical intuition first. That's like trying to run before you can walk.

<Tip>
  **Why ML Engineers Need Statistics**: The most common reason ML models fail in production is not bad algorithms -- it is bad data understanding. A senior engineer would say: "I can swap models in an afternoon, but misunderstanding the distribution of my training data costs me months." Statistics is how you avoid that trap.
</Tip>

<Note>
  **🔗 ML Connection**: Throughout this course, we'll highlight exactly how each concept powers real ML systems:

  | Statistics Concept      | ML Application                                     |
  | ----------------------- | -------------------------------------------------- |
  | **Mean & Variance**     | Batch normalization in neural networks             |
  | **Bayes' Theorem**      | Naive Bayes classifiers, Bayesian neural networks  |
  | **Normal Distribution** | Weight initialization, understanding model outputs |
  | **Hypothesis Testing**  | A/B tests for model comparison, feature importance |
  | **Regression**          | Linear layers in neural networks, baseline models  |
  | **MLE**                 | Training objective for most ML models              |

  Look for the 🔗 symbol in each module for these connections!
</Note>

***

## 🎮 Interactive Visualization Tools

Statistics is best learned by *seeing* data. Use these tools alongside the course:

<CardGroup cols={2}>
  <Card title="Seeing Theory" icon="eye" href="https://seeing-theory.brown.edu/">
    Beautiful interactive visualizations of probability and statistics. Use with Modules 2-4.
  </Card>

  <Card title="StatKey" icon="key" href="http://www.lock5stat.com/StatKey/">
    Simulate sampling distributions, hypothesis tests, and confidence intervals. Perfect for Modules 4-5.
  </Card>

  <Card title="Regression Visualizer" icon="chart-line" href="https://www.geogebra.org/m/xC6zq7Zv">
    Fit lines to data, see residuals, understand least squares. Use with Module 6.
  </Card>

  <Card title="Distribution Explorer" icon="chart-bar" href="https://distribution-explorer.github.io/">
    Visualize any probability distribution with adjustable parameters. Essential for Module 3.
  </Card>
</CardGroup>

<Note>
  **🔗 When to Use These Tools**:

  * **Module 2 (Probability)**: Seeing Theory - probability chapter
  * **Module 3 (Distributions)**: Distribution Explorer for every distribution we cover
  * **Module 4 (Inference)**: StatKey for sampling simulations
  * **Module 5 (Hypothesis Testing)**: StatKey for test simulations
  * **Module 6 (Regression)**: Regression Visualizer GeoGebra app
</Note>

<Accordion title="🚀 Going Deeper: For Advanced Learners" icon="graduation-cap">
  **Want more mathematical rigor?** Each module includes optional "Going Deeper" sections:

  | Module             | Advanced Topic                   | Why It Matters                                       |
  | ------------------ | -------------------------------- | ---------------------------------------------------- |
  | Probability        | Measure theory foundations       | Understand probabilistic ML rigorously               |
  | Distributions      | Moment generating functions      | Derive distribution properties from first principles |
  | Inference          | Maximum likelihood derivations   | Understand why ML training objectives work           |
  | Hypothesis Testing | Power analysis, multiple testing | Design statistically valid ML experiments            |
  | Regression         | Matrix formulation, OLS theory   | Connect to neural network linear layers              |
  | Bayesian           | Conjugate priors, MCMC           | Foundation for probabilistic ML models               |

  **These sections are OPTIONAL.** You can run A/B tests and build regression models without them. They're for learners who:

  * Have a quantitative background and want the formal treatment
  * Plan to work on probabilistic ML or Bayesian methods
  * Want to understand ML research papers deeply

  **Recommended Resources for Deep Dives:**

  * *Think Stats* by Allen Downey (free, programming-first approach)
  * *Statistical Rethinking* by Richard McElreath (Bayesian, excellent videos)
  * MIT OpenCourseWare 18.05 (rigorous but accessible probability/stats)
</Accordion>

***

## What You'll Learn (The Roadmap)

### 🏠 Module 1: Describing Data

*"What does 'normal' look like?"*

**Real-World Problem**: You're buying a house. What's a fair price?

**What You'll Learn**:

* Mean, median, mode (and when each matters)
* Variance and standard deviation (how spread out are prices?)
* Percentiles (is \$450K in the top 10%?)

**Mini-Project**: Analyze house prices in your city

***

### 🎲 Module 2: Probability Foundations

*"How likely is this to happen?"*

**Real-World Problem**: You're a doctor. A patient tests positive for a rare disease. What's the chance they actually have it?

**What You'll Learn**:

* Basic probability rules
* Conditional probability (given this, what's the chance of that?)
* Bayes' theorem (the most important formula in data science)

**Mini-Project**: Build a spam email detector

***

### 📊 Module 3: Probability Distributions

*"What patterns does randomness follow?"*

**Real-World Problem**: A factory produces light bulbs. How many will fail in the first 1000 hours?

**What You'll Learn**:

* Normal distribution (the bell curve that rules the world)
* Binomial distribution (success/failure events)
* Why these patterns appear everywhere

**Mini-Project**: Quality control simulator

***

### 🔬 Module 4: Statistical Inference

*"How confident can I be from limited data?"*

**Real-World Problem**: You survey 500 voters. Can you predict the entire election?

**What You'll Learn**:

* Sampling and why it works
* Confidence intervals (how sure are we?)
* Standard error (how much could our estimate be off?)

**Mini-Project**: Election predictor from polls

***

### ⚖️ Module 5: Hypothesis Testing

*"Is this difference real or just luck?"*

**Real-World Problem**: Your new website design got 5% more clicks. Is that real improvement or random chance?

**What You'll Learn**:

* Null and alternative hypotheses
* P-values (the most misunderstood concept in statistics)
* A/B testing the right way

**Mini-Project**: A/B test analyzer for websites

***

### 📈 Module 6: Correlation & Regression

*"How are things related?"*

**Real-World Problem**: Do houses with more bedrooms cost more? By how much?

**What You'll Learn**:

* Correlation (are two things related?)
* Simple linear regression (predict Y from X)
* Multiple regression (predict Y from X1, X2, X3...)

**Mini-Project**: House price predictor

***

### 🎯 Module 7: From Statistics to Machine Learning

*"Connecting everything together"*

**Real-World Problem**: You have all these statistical tools. How do they power AI?

**What You'll Learn**:

* The statistical foundations of ML algorithms
* Bias-variance tradeoff
* Cross-validation and model selection
* When to use statistics vs ML

**Capstone Project**: Build a complete prediction system

***

## Course Structure

Each module follows this formula:

**1. Real-World Hook** 🏠

* Start with a problem you can relate to
* No jargon, no formulas yet

**2. Intuition Building** 💡

* Visual explanations with SVG diagrams
* Multiple examples from different domains

**3. The Mathematics** 📐

* Formulas (after you understand why they exist)
* Step-by-step derivations when helpful

**4. Python Implementation** 🐍

* Code from scratch first
* Then the "real" way with libraries

**5. Practice Problems** ✍️

* Exercises with solutions
* Real datasets to explore

**6. Mini-Project** 🚀

* Apply everything you learned
* Build something you can show off

***

## Prerequisites

**Required**:

* Basic Python (variables, loops, functions)
* Willingness to think differently about data
* No math background needed (we build from scratch)

**Helpful but not required**:

* Basic algebra (we'll review what we need)
* NumPy/Pandas experience (we'll teach as we go)

***

## Industry Applications

<CardGroup cols={2}>
  <Card title="Data Science" icon="chart-pie">
    Every data science interview includes probability and statistics. From A/B testing at tech companies to risk modeling at banks.
  </Card>

  <Card title="Machine Learning" icon="brain">
    ML algorithms are built on statistical foundations. Understanding stats makes you a better ML engineer.
  </Card>

  <Card title="Product Analytics" icon="gauge">
    Product managers use hypothesis testing daily to make decisions about features, pricing, and user experience.
  </Card>

  <Card title="Quantitative Finance" icon="chart-candlestick">
    Trading algorithms, risk management, and portfolio optimization all rely heavily on probability theory.
  </Card>
</CardGroup>

***

## Interview Relevance

<Accordion title="Common Interview Topics by Company Type">
  **FAANG / Big Tech**:

  * A/B testing methodology
  * Probability puzzles (conditional probability, Bayes)
  * Experimental design
  * Statistical significance vs practical significance

  **Startups**:

  * Product metrics interpretation
  * Quick hypothesis testing
  * Data-driven decision making

  **Finance / Quant**:

  * Probability distributions
  * Time series concepts
  * Risk quantification
  * Monte Carlo methods

  **Healthcare / Biotech**:

  * Clinical trial statistics
  * Survival analysis basics
  * Multiple testing corrections
</Accordion>

***

## Your Learning Path

```
Week 1-2: Describing Data
  ↓ "What does normal look like?"
Week 2-3: Probability
  ↓ "How likely is this?"
Week 3-4: Distributions
  ↓ "What patterns exist?"
Week 4-5: Inference
  ↓ "How confident am I?"
Week 5-6: Hypothesis Testing
  ↓ "Is this real or luck?"
Week 6-7: Regression
  ↓ "How are things related?"
Week 7-8: Statistics → ML
  ↓ "How does this power AI?"
Final: Capstone Project
```

***

## Let's Start!

Ready to see the world differently? Let's begin with the most fundamental question in all of statistics:

**"What's normal?"**

Not philosophically. Statistically. When you look at a bunch of numbers, what's typical? What's unusual? And how do you tell the difference?

***

## Key Takeaways

<Note>
  **What You'll Master in This Course**:

  * ✅ **Descriptive Statistics** - Summarize any dataset with meaningful numbers
  * ✅ **Probability Theory** - Quantify uncertainty and make predictions
  * ✅ **Statistical Inference** - Draw valid conclusions from limited data
  * ✅ **Hypothesis Testing** - Determine if differences are real or random noise
  * ✅ **Regression Analysis** - Predict outcomes and understand relationships
  * ✅ **ML Foundations** - Connect statistical concepts to machine learning algorithms
</Note>

<Warning>
  **Common Mistake**: Many learners rush through probability to get to "the cool ML stuff." Don't do this! Probability and distributions are the foundation of everything in ML -- from understanding model confidence to debugging training issues. Master the basics and everything else becomes easier.

  Here is a concrete example: if you do not understand what a skewed distribution is, you will use mean squared error on skewed target variables and wonder why your model's predictions are systematically off. That single statistical insight -- "my data is skewed, so I should log-transform the target or use a different loss" -- can improve model performance more than any hyperparameter search.
</Warning>

<Accordion title="🧹 Real-World Data: It's Messy" icon="broom">
  **What textbooks don't tell you:** Real data is messy. Throughout this course, we'll explicitly address:

  | Messy Data Problem        | Where We Cover It | Why It Matters                   |
  | ------------------------- | ----------------- | -------------------------------- |
  | **Missing values**        | Module 2, 6       | 90% of datasets have them        |
  | **Outliers**              | Module 1, 6       | Can destroy your analysis        |
  | **Skewed distributions**  | Module 1, 3       | Mean ≠ median for most real data |
  | **Selection bias**        | Module 4          | Surveys often lie                |
  | **Multiple testing**      | Module 5          | P-hacking is everywhere          |
  | **Confounding variables** | Module 6          | Correlation ≠ causation          |

  **Our approach**: Every module includes a "Real-World Complications" section showing how to handle messy data. You'll work with actual messy datasets, not just clean textbook examples.
</Accordion>

<Card title="Next: Describing Data" icon="arrow-right" href="/courses/statistics-for-ml/02-describing-data">
  Learn to summarize any dataset with the right numbers
</Card>

***

## Interview Deep-Dive

<AccordionGroup>
  <Accordion title="Why should a data scientist study statistics instead of jumping straight to ML frameworks?">
    **Strong Answer:**

    * Statistics builds the reasoning layer that ML frameworks deliberately hide from you. When you call `model.fit()`, you are performing maximum likelihood estimation, gradient-based optimization, and implicit hypothesis testing all at once. Without understanding those pieces, you cannot diagnose why a model is failing.
    * In production, the hard problems are rarely "which API do I call?" They are questions like: "Is the 0.3% lift from the new model real or noise?" or "Why did accuracy drop after the last data pipeline change?" These are pure statistics problems -- confidence intervals, distribution shift detection, and sampling bias.
    * In interviews at companies like Google, Meta, and Stripe, roughly 40-60% of data science questions are statistics-first: conditional probability, experimental design, and p-value interpretation. Candidates who only know sklearn syntax get filtered out at the phone screen.
    * The practical payoff is judgment. A statistician who understands variance will not over-index on a single A/B test result. An engineer who understands confounders will not ship a feature based on a spurious correlation. That judgment is what separates senior from junior practitioners.

    **Follow-up: You mentioned distribution shift. Walk me through how you would detect it in a production ML system.**

    You monitor the input feature distributions over time -- not just model output metrics. Concretely, you compare the distribution of incoming feature values against the training distribution using a two-sample test like the Kolmogorov-Smirnov test or Population Stability Index (PSI). If PSI exceeds 0.2 for a key feature, that is a strong signal the model's assumptions are breaking. At a company processing millions of predictions daily, you would set up automated alerts on these metrics and trigger a retrain pipeline when thresholds are breached. The important nuance is that accuracy can look fine for weeks after a shift starts -- by the time accuracy visibly degrades, you have already served thousands of bad predictions.
  </Accordion>

  <Accordion title="A product manager asks you to 'just use the average' to summarize customer data. What questions do you ask before agreeing?">
    **Strong Answer:**

    * First, I ask about the shape of the distribution. If the data is symmetric and roughly normal -- like standardized test scores -- the average is a fine summary. But most business data is heavily skewed: transaction amounts, session durations, salaries, and page load times all have long right tails where the mean is misleading.
    * Second, I ask what decision is being made. If the PM wants to understand "what does a typical customer experience," the median is almost always better. If they need to forecast total revenue (where the actual sum matters), then the mean multiplied by count is what they need.
    * Third, I check for outliers. A single whale customer spending $50,000 in a dataset of $50 transactions will pull the average up dramatically. I would show both the mean and median side-by-side, and if they diverge significantly, that tells a story about the data that a single number hides.
    * In practice, at any company with real user data, I would present the median alongside the mean and P90/P99 percentiles. That trio -- median, mean, and tail percentiles -- gives stakeholders a complete picture without requiring them to look at a histogram.

    **Follow-up: Give me a real scenario where using the mean instead of the median led to a bad business decision.**

    A classic example: a SaaS company reported "average customer lifetime value is $2,400" to justify a $1,500 customer acquisition cost. But the median CLV was only $800 -- a handful of enterprise contracts worth $50K-\$200K were inflating the average. The company was actually losing money on 70% of its customers. When the board finally looked at the median and the full distribution, they realized the acquisition strategy was unsustainable for the SMB segment and pivoted their targeting. The lesson: whenever you see "average revenue per user" in a pitch deck, ask for the median. The gap between mean and median tells you how concentrated the value is.
  </Accordion>

  <Accordion title="Explain the connection between statistical inference and machine learning model evaluation.">
    **Strong Answer:**

    * At the core, ML model evaluation IS statistical inference. When you report "accuracy = 92% on the test set," you are computing a sample statistic from a finite sample and hoping it generalizes to the population of all future inputs. The standard error and confidence interval around that 92% tell you how much to trust it.
    * Cross-validation is really a form of repeated sampling. Each fold gives you one sample estimate of model performance, and the variance across folds gives you the standard error. A model with 92% mean accuracy but 8% standard deviation across folds is far less trustworthy than one with 89% mean and 1% standard deviation.
    * Hypothesis testing appears whenever you compare two models. If Model A gets 91% and Model B gets 93% on the same test set, you cannot just declare B the winner. You need a paired statistical test (like a paired t-test on fold-level metrics) to determine if the 2% gap is real or just sampling noise.
    * The bias-variance tradeoff is itself a statistical decomposition. Expected prediction error equals irreducible noise plus bias-squared plus variance. Understanding this decomposition -- which comes directly from probability theory -- is what tells you whether to make your model more complex or collect more data.

    **Follow-up: How would you explain to a junior engineer why reporting accuracy without a confidence interval is incomplete?**

    I would use this analogy: reporting "92% accuracy" without a confidence interval is like a political poll saying "Candidate A is at 52%" without mentioning the margin of error. If the margin of error is plus or minus 4%, that 52% could easily be 48% -- and the whole conclusion flips. Same with model accuracy. If you evaluated on 200 test samples, your 92% has a much wider confidence interval than if you evaluated on 20,000 samples. Two models at 92% and 90% might have overlapping confidence intervals, meaning you cannot actually distinguish them. The CI forces intellectual honesty about what you actually know versus what you are guessing.
  </Accordion>

  <Accordion title="A colleague says 'correlation equals 0.95, so X clearly causes Y.' How do you respond?">
    **Strong Answer:**

    * I would gently but firmly push back. Correlation measures linear association -- it says nothing about the direction of causation or whether a third variable is driving both. The classic example is ice cream sales and drowning deaths: r is very high, but the confounding variable is hot weather causing both.
    * To establish causation, you need one of three things: a randomized controlled experiment (the gold standard), a natural experiment with an instrumental variable, or a carefully designed observational study that controls for all known confounders plus passes sensitivity analysis for unmeasured ones.
    * In a data science context, this mistake is expensive. A company might see that "users who use Feature X have 40% higher retention" and conclude Feature X drives retention. But it could be that highly engaged users both use Feature X and retain better -- the feature is a symptom of engagement, not a cause. Shipping a prompt to force all users into Feature X would not move the retention needle.
    * The way I think about it: correlation is a necessary condition for linear causation, but it is nowhere near sufficient. When I see a strong correlation in observational data, my first instinct is to ask "what confounders could explain this?" not to celebrate.

    **Follow-up: How would you design an experiment to test whether Feature X actually causes improved retention?**

    I would run a randomized A/B test. Randomly assign 50% of new users to see a prominent onboarding prompt pushing them toward Feature X, and 50% to the normal experience. Then measure 30-day retention for both groups. The randomization ensures that the groups are balanced on all confounders -- both measured and unmeasured. If the prompted group shows significantly higher retention, that is causal evidence that Feature X exposure (or at least the prompt) drives retention. The important subtlety: you are testing the intent-to-treat effect (the prompt), not Feature X usage itself, because you cannot force usage. You would also want to check that the prompt actually increased Feature X usage in the treatment group, forming a proper causal chain.
  </Accordion>
</AccordionGroup>
