Product manager and discovery
Contents:
What discovery actually is
Discovery is the part of product management that answers what to build, not how to build it. The goal is to understand which user problems exist, which are worth solving, and what shape the solution should take — before the team spends a quarter writing code nobody wanted. A senior PM at Stripe or Notion is paid roughly $220k base plus equity to be right about this question more often than wrong, and the only reliable way is to stay in close contact with users.
Without discovery, the PM degrades into a "ticket writer": work flies in from leadership and sales, the team grinds it out, and metrics stay flat for two quarters. One sprint at a 6-person team is roughly $60–80k of fully-loaded engineering time — shipping the wrong feature burns that plus the opportunity cost of the right feature.
A team without a discovery practice: retros never ask "what user pain did this close?" — only "did we hit the dates?"
Discovery vs delivery
Delivery is the craft side: spec, design, build, ship, monitor. Discovery is the research side: interviews, prototypes, hypothesis tests, data digs. It is what separates a Linear-style product team from a feature factory.
Strong teams run both in parallel — while engineering ships feature A, the PM is already discovering feature B. Weak teams skip discovery and run on "the CEO's intuition," which is fine until the CEO is wrong twice in a row.
| Dimension | Discovery | Delivery |
|---|---|---|
| Core question | What to build | How to build it |
| Main artifact | Opportunity tree, hypothesis log | PRD, technical spec |
| Success metric | Share of validated hypotheses | Release velocity and quality |
| Primary risk | Building the wrong thing | Building it badly |
| Owners | PM + design + research | PM + engineering |
The same person often owns both — and that is exactly where the time pressure to skip discovery comes from.
Continuous discovery cadence
The approach popularized by Teresa Torres reframes discovery as a weekly habit, not a quarterly project. The load-bearing rule: at least one user touchpoint per week, every week, indefinitely.
Load-bearing rule: if your team has not talked to a user in the last 7 days, you do not have a discovery practice — you have a quarterly research event with PM theatre in between.
Regularity beats volume. One interview a week for a year (≈50 touchpoints) gives more signal than 50 bunched into one quarter — you see drift earlier, catch competitor moves sooner, and the team builds working memory per segment.
| Team size | Interviews / week | Length | Monthly theme |
|---|---|---|---|
| 1 PM, 3–5 engineers | 1–2 | 30–45 min | One opportunity branch |
| 1 PM, 6–10 engineers | 2–3 | 30–60 min | One outcome + adjacent branch |
| PM + dedicated researcher | 3–5 | 45–60 min | Two outcomes, paired analysis |
| Platform / B2B team | 1–2 deep | 60–90 min | One persona, rotating accounts |
The "monthly theme" column is what most teams miss. Without a theme, weekly interviews drift into anecdote collection. With one — "this month we figure out why annual plans convert at half the monthly rate" — every conversation compounds into a position you can defend on the roadmap.
Opportunity-solution tree
The tree is the artifact that holds the whole practice together. Four layers:
- Outcome — the business result you are moving. Example: "lift activation 38% → 50% in Q3."
- Opportunities — user pains that, if solved, would move the outcome.
- Solutions — concrete bets that could close one of those opportunities.
- Experiments — how you will test that a solution works, before committing.
Example tree — activation outcome:
- Outcome: activation 38% → 50% in Q3
- Opportunity: users do not finish onboarding because step 3 needs a teammate
- Solution: ghost teammate mode for solo signups
- Experiment: A/B vs current flow on 30% of signups, n=8,000, 14 days, primary D1 activation
- Opportunity: users do not realize the free tier covers their use case
- Solution: pricing-page rewrite + delayed upsell
- Experiment: holdout on new pricing, two-week ramp, guardrail = paid conversion
- Opportunity: users do not finish onboarding because step 3 needs a teammate
The tree lets the team see the whole map. Without it, discovery devolves into a random list of features top-of-mind for whoever spoke last in standup.
Three antipatterns. First, an opportunity as a solution in disguise — "build a referral feature" is a solution; the opportunity is "power users have no way to bring in teammates." Second, 50 leaves on one branch — you have not prioritized opportunities. Third, a tree where you cannot see disproved hypotheses — it becomes a graveyard, not a living artifact.
Discovery toolkit
Each tool answers a different question. A senior PM picks the right one rather than running all in parallel.
| Question | Right tool |
|---|---|
| What is the pain and how sharp is it? | Depth interviews + support tickets |
| How many people have this pain? | Surveys + product analytics |
| Is the solution understandable? | Usability test on prototype |
| Will people actually use it? | Prototype or Wizard-of-Oz test |
| Does it move the metric? | A/B experiment in production |
| What alternatives do users have today? | Competitive teardown + interview probing |
Surveys are not a substitute for interviews — they scale answers to questions you already understand, not new ones. Wizard-of-Oz (faking automation with humans behind the curtain) is brutally underused; it validates willingness to pay before writing backend. Support and sales calls are the freshest signal in the building — most PMs ignore them, then act surprised when a competitor ships the exact feature support has been begging for since January.
Problem interview script
A 30–45 minute interview, six sections:
- Warm-up (2 min) — who they are, what they do. No product talk yet.
- Context (5–10 min) — when they last hit the scenario, what alternatives they considered.
- Story (10–15 min) — "walk me through the last time you did X." Not "would you use a feature that does Y?"
- Pain dig (5–10 min) — what was hardest, how they worked around it, what the workaround cost.
- Do not show the solution. Pitching contaminates the data.
- Close (2 min) — who else should you talk to? Referral chains find the next five interviews without a recruiting budget.
The rule that separates good interviews from theatre: ask about the past, not the future. "Would you use a feature like this?" is almost always polite fiction. "Tell me about the last time you tried X" leaves receipts.
Artifacts and rituals
Discovery without artifacts is just conversation. The minimum kit: a user segment map (who they are, what jobs they hire the product for), a hypothesis log (assumed / testing / validated / killed), the opportunity-solution tree, an interview journal with quotes and recording links, and a PRD only when a solution graduates into delivery.
Rituals worth keeping: a discovery weekly (30–60 min) to review what we heard and what we are testing next; a quarterly review to prune the tree and retire dead opportunities; and a reverse demo where the team shows the PM what they learned from users instead of the PM showing them a feature. The trap is over-formalizing — discovery should help the team think clearly, not drown them in Notion templates.
Rolling it out from zero
A realistic 6-month rollout for a team that has never done discovery:
- Week 1. Put a recurring 45-minute discovery weekly on the calendar. Find one user. Open a hypothesis spreadsheet with five rows.
- Weeks 2–4. One interview per week. Fifteen-minute team debrief after each. No artifacts yet beyond the journal.
- Month 2. Pull a designer and one engineer into interviews. Draft the first opportunity-solution tree against the current outcome.
- Month 3. Link the roadmap to the tree. Every roadmap item must show which opportunity it closes — items that cannot answer get cut or parked.
- Months 4–6. Push cadence to 2–3 interviews per week. Introduce reverse demos. Write the "ready for delivery" checklist that gates the handoff from tree to PRD.
The signal that discovery has taken root: at sprint planning, an engineer asks "what pain does this close?" without the PM prompting. When that happens unprompted in two consecutive plannings, the practice is real.
Common pitfalls
The most common failure is treating discovery as a one-off phase. The team runs a big research project at the start of the quarter, captures findings in a deck, then defaults back to delivery for 10 weeks. By week six the findings are stale and the team has forgotten half the deck. The fix is to drop the project framing and replace it with the weekly cadence — every week the team learns one thing and updates the tree.
A close cousin is doing discovery alone. The PM books interviews, takes notes, writes the summary, presents conclusions like a verdict. Engineers and designers nod, then build whatever they would have built anyway — they were not part of the reasoning. Require a designer and one engineer in at least one interview per month, and make the debrief a conversation, not a presentation.
A third trap is jumping straight to solutions. The PM hears a pain on Tuesday and by Friday there is a Figma file. Without the full map under the outcome, the team optimizes for whichever pain spoke loudest last week. If a solution does not slot under a prioritized opportunity, it does not get built — no matter how shiny the mock.
A more subtle pitfall is presenting conclusions as facts. Anything coming out of discovery is a hypothesis. Treating it as a finding means you skip the experiment and ship based on what 8 users said in a Zoom call. Every claim in the tree should carry a confidence level and an expected experiment.
Finally, asking about the future is the interview trap everyone repeats. "Would you pay for this?" produces false positives at scary rates. The only reliable material is past behavior — what they actually did, what they spent, what they switched to.
Related reading
- Product manager and AI
- Product manager and experimentation
- JTBD framework: how to apply
- North Star metric for PM
- Product manager case interview guide
To drill PM cases, metric trees, and discovery interview prompts daily, NAILDD is launching with a structured library of product-sense and analytics problems from real PM interview loops.
FAQ
How much time should a PM spend on discovery?
A reasonable target is 30–50% of working hours. If 100% of your week disappears into delivery — standups, JIRA grooming, release coordination — the product will keep shipping but metrics will stop moving within two quarters. Defend the budget by putting discovery touchpoints on the calendar before delivery fills it.
Where do I start if my team has never done discovery?
Start with one weekly interview and a hypothesis log. That is it. Do not try to install the full Teresa Torres playbook in week one — rituals stick only after the team feels the value, which shows up around the third or fourth interview when someone says "wait, that is the third person who mentioned that."
Do I need a dedicated researcher?
If you have one, let them lead the hard sessions and segment work. If not, the PM runs the interviews — but never outsource discovery entirely. The point is for the product team to stay in direct contact with users; a layer of researchers reintroduces the telephone-game problem discovery was meant to fix.
How do I protect discovery from "we have no time"?
Three defenses. Put it on the calendar as a recurring meeting — slots are easier to defend than aspirations. Tie discovery output to the roadmap so leadership sees the link. Make interview count a team metric, not a PM metric — once engineering and design are accountable, the pressure stops landing on one person.
How is discovery different from research?
Research is the broader discipline — UX, market, academic. Discovery is the specific operating practice a product team uses to find the right things to build. Research is one tool inside discovery, alongside analytics, prototypes, and experiments.
How many interviews are enough to make a decision?
Qualitative patterns usually stabilize at 5–8 interviews per segment — beyond that you mostly hear repeats. Interviews tell you the shape of the pain, not the size. For sizing, run a survey with n ≥ 200 or pull analytics on the relevant funnel. Interviews for shape, quant for size.
How do I know discovery is failing?
If a quarter goes by and no hypothesis was killed, that is not success — you are either asking soft hypotheses you already know the answer to, or bending data to fit. A healthy practice retires roughly one in three hypotheses; if your kill rate is zero, the questions are not honest yet.