Skip to main content
← Perspectives Agency Hiring

The Agency Hiring Scorecard: 12 Questions That Predict Whether Your Next Hire Will Stick

The Agency Hiring Scorecard: 12 Questions That Predict Whether Your Next Hire Will Stick

You interviewed someone for 45 minutes. They were articulate, experienced, and well-prepared. They answered every question thoughtfully. You left the conversation feeling confident.

Six weeks later, you are having a very different conversation.

The problem isn’t that your interview was bad. The problem is that it measured the wrong things. Interview questions test whether someone can describe their work. They don’t test whether someone can do the work in your specific environment. Describing and doing are different skills. Your interview optimized for the first.

Structured evaluation research consistently shows that traditional interviews predict job performance at roughly the same rate as a coin flip. Not because interviewers are bad at their jobs. Because the format itself measures fluency, not fit.

Here are 12 questions that measure fit. They’re organized around the four dimensions of what we call the Work Formula: the framework that predicts whether someone will thrive in a specific seat. Human Energy, Context, Behaviors, and Alignment. Three questions per dimension. Each question has a specific purpose, a green-flag answer, and a red-flag answer.

These 12 questions are a filter, not a verdict. They’ll tell you whether to keep talking. They won’t tell you whether to hire. The difference between a filter and a verdict is the difference between a flashlight and a floodlight. One shows you what is directly ahead. The other shows you everything.

Human Energy: Who This Person Is at Their Core

These questions assess the candidate’s intrinsic motivation, creativity, and drive. Skills can be trained. Energy can’t. A candidate with the right energy in the wrong role will eventually leave. A candidate with the wrong energy in the right role will eventually be managed out.

Question 1: “Tell me about the last project where you completely lost track of time.”

What it reveals: Where the candidate’s genuine engagement lives. Not what they claim to enjoy. What they actually get absorbed by.

Green flag: A specific, detailed story with emotional texture. They remember the project, the problem, and why it hooked them. The work they describe maps to the work your role requires.

Red flag: A generic answer about “challenging projects” or “working with great teams.” Lack of specificity suggests they haven’t experienced deep engagement recently, or their engagement is in a domain unrelated to the role.

Question 2: “What kind of work drains you, even when you are good at it?”

What it reveals: The gap between capability and motivation. Many people can do work that slowly depletes them. They perform well on paper while becoming increasingly disengaged.

Green flag: An honest, specific answer. The draining work they describe is genuinely different from the core of your role. Even better: they explain what makes it draining, which reveals self-awareness about their own operating patterns.

Red flag: “Nothing really drains me” or “I am pretty adaptable.” Everyone has energy drains. A candidate who can’t identify theirs hasn’t done the self-reflection, or they are performing adaptability rather than demonstrating it.

Why this matters at an agency: Agency work requires sustained intensity across multiple clients. A media buyer who is drained by context-switching will burn out in an environment that requires managing six accounts simultaneously. The skill isn’t the issue. The energy cost of the skill is.

Question 3: “What would your last manager say you care about too much?”

What it reveals: The candidate’s over-indexing tendency. Every strong performer has something they care about disproportionately. The question is whether their over-indexing tendency helps or hurts in your specific environment.

Green flag: Something that aligns with the role’s demands. An account manager who “cares too much about client satisfaction” is showing you a relevant strength. A media buyer who “obsesses over data accuracy” is showing you precision.

Red flag: Something that conflicts with the role’s demands. An operations manager who “cares too much about getting buy-in from everyone” may struggle in an environment where decisions need to happen quickly without consensus.

Context: How They Work in Your Environment

These questions assess whether the candidate will thrive in your specific operating environment. The same person who excels at a 200-person agency may fail at a 15-person one. Context fit is the most overlooked predictor of retention.

Question 4: “Describe the most ambiguous work situation you have been in. What did you do before anyone told you what to do?”

What it reveals: Ambiguity tolerance and self-direction. Agency environments, especially sub-50-person ones, are structurally ambiguous. Processes are informal. Authority is unclear. The right person navigates this naturally.

Green flag: A story where they took action without waiting for permission or clarity. They defined the problem, chose a direction, and adjusted as they learned more. The ambiguity energized them rather than paralyzed them.

Red flag: A story where they sought clarity before acting. “I scheduled a meeting with my manager to define the scope.” This is a perfectly valid response in a corporate environment. It’s a warning sign at an agency where the manager is the founder and the founder is in a client meeting.

Question 5: “How many hours into your workday do you usually need before you are fully up to speed?”

What it reveals: Pace match with your agency’s rhythm. Some people need a slow ramp-up: coffee, email review, calendar check, then deep work. Others arrive operating at full speed.

Green flag: An answer that matches your agency’s actual pace. If your agency has 9 AM client calls three days a week, you need someone who is sharp at 9 AM. If deep work happens in the morning and meetings cluster in the afternoon, you need someone who can do focused work early.

Red flag: Not the answer itself, but a mismatch with your reality. A slow starter at a fast-paced agency will be behind before lunch. A fast starter at a methodical organization will feel frustrated by the warm-up culture. Neither is wrong. Both are mismatches.

Question 6: “When you disagree with a decision your manager has made, what do you do?”

What it reveals: How they navigate authority in real time. This question sounds standard. The diagnostic power is in the follow-up.

Green flag: They describe raising the disagreement directly, making their case, and then executing the decision regardless of whether their view prevailed. The sequence matters: voice the concern, then commit.

Red flag: Two opposite red flags exist. “I just do what I’m told” suggests someone who won’t push back when they see a problem heading toward a client. “I would keep arguing until I convinced them” suggests someone who will create friction in an environment where speed matters more than consensus.

The counterintuitive read: A candidate who says “I disagreed, I said so, and then I did it their way because they had context I didn’t” is showing you something rare. They have the confidence to challenge and the discipline to defer. At an agency, this combination is more valuable than either trait alone.

Behaviors: What They Actually Do

These questions test observable patterns, not self-reported skills. Behavior questions should reference specific past situations, not hypothetical futures. “What would you do if” questions test imagination. “Tell me about a time when” questions test behavior.

Question 7: “Tell me about a time when you had to say no to a client or manager who was asking for something reasonable.”

What it reveals: Boundary-setting skill. The word “reasonable” is the key. Anyone can refuse an absurd request. Refusing a reasonable one requires a different skill: the ability to protect the team or the project without making the client feel dismissed.

Green flag: The candidate explains what made the request reasonable, why they still said no, and how they offered an alternative. They didn’t just refuse. They redirected. The client relationship survived.

Red flag: They have never said no to a reasonable request. Or their example involves an unreasonable request, which is a different skill entirely. The hardest “no” at an agency is the one where the client is right that the request is fair but the team can’t absorb it without breaking something else.

We tested account manager candidates with a scenario built around this exact tension. A client requests two extra pages on a website redesign project. Reasonable scope expansion. The delivery team is at capacity. The written response reveals whether the candidate defaults to compliance, refusal, or reframing. The live role-play reveals which of those is their actual behavior under pressure.

Written and live responses match in strong candidates. They diverge in candidates who know the right answer but can’t execute it.

Question 8: “What is your process for handling three competing deadlines when you can’t complete all three on time?”

What it reveals: Prioritization philosophy and communication instinct. The question isn’t about time management. It’s about triage and transparency.

Green flag: They describe a triage process: assess impact, communicate proactively with the client, the team, and anyone waiting on the work about the tradeoff, and make a judgment call about what to delay. The communication step is non-negotiable. Silent prioritization is a red flag.

Red flag: “I would work extra hours to finish all three.” This sounds admirable. It’s actually a sign that the candidate doesn’t know how to triage and will burn out rather than disappoint. At an agency, the ability to disappoint strategically is more valuable than the ability to overwork.

Question 9: “Tell me about a project that failed. What did you do in the first hour after you realized it was failing?”

What it reveals: Crisis response pattern. Not their retrospective analysis. Their in-the-moment behavior. The “first hour” framing prevents rehearsed answers about lessons learned.

Green flag: They describe a specific action they took, not a feeling they had. “I called the client before they found out from someone else.” “I pulled the team together and asked what we could salvage.” “I stopped the campaign spend before we lost more budget.”

Red flag: Their first response to failure was to analyze, document, or assign blame rather than act. Analysis is valuable. It’s a second-hour activity. In the first hour, agency problems require action.

Alignment: Whether This Seat Fits Their Life

These questions assess whether the role delivers what the candidate is actually optimizing for. Alignment is the most subtle dimension and the one most interviews skip entirely. A candidate can have the right energy, fit the context, and exhibit the right behaviors, and still leave in eight months because the role doesn’t give them what they need from work.

Question 10: “What are you optimizing for in your career right now? Not eventually. Right now.”

What it reveals: Whether the role gives the candidate what they are currently seeking. “Right now” is the critical modifier. Someone optimizing for stability won’t thrive in a high-growth agency. Someone optimizing for learning won’t stay in a role that becomes routine.

Green flag: A clear, specific answer that aligns with what your role actually provides. If they are optimizing for autonomy and your role offers genuine autonomy, that is alignment. If they are optimizing for mentorship and you have no one to mentor them, that is a mismatch you can identify before making the offer.

Red flag: Aspirational answers disconnected from the role. “I want to build a team” when the role is an individual contributor. “I want strategic influence” when the role is execution-focused. These aren’t wrong desires. They’re misaligned with the seat.

Question 11: “What did your last role get right about how work should work? What did it get wrong?”

What it reveals: The candidate’s work environment preferences, stated through experience rather than abstraction. The “right” part tells you what conditions they need. The “wrong” part tells you what conditions they are leaving.

Green flag: Their “right” answers describe features of your environment. Their “wrong” answers describe features absent from your environment. The candidate is moving toward conditions you already have.

Red flag: Their “wrong” answers describe features present in your environment. If they are leaving because of “too many clients and not enough focus time” and your agency manages eight accounts per person, you have learned something important before making the offer.

Question 12: “If this role were exactly what we described, but the salary were 10 percent less, would you still be interested? What about 10 percent more at a role that matched less well?”

What it reveals: The relative weight of compensation versus fit in the candidate’s decision calculus. This isn’t a salary negotiation tactic. It’s a retention predictor.

Green flag: They would take the pay cut for the right fit, or at least struggle with the decision. This tells you they are choosing the role for reasons beyond money, which means they are less likely to leave for a marginal salary increase elsewhere.

Red flag: They would take the higher salary without hesitation. Compensation-first candidates aren’t wrong. They’re expensive to retain, because any competitor can outbid you.

Why this question is counterintuitive: Most interviewers avoid asking about salary tradeoffs because it feels aggressive. The research on job satisfaction shows that the relationship between pay and retention flattens above a threshold. Beyond that threshold, fit and autonomy are stronger predictors of whether someone stays. This question reveals which side of that threshold the candidate lives on.

How to Use These 12 Questions

These questions are a diagnostic, not a decision engine.

A candidate who answers all 12 well has passed the first filter. They may still be wrong for the role. A candidate who answers three poorly has failed the filter. They’re almost certainly wrong for the role.

The questions tell you what to test next. If a candidate’s energy answers are strong but their context answers reveal a pace mismatch, you know exactly what to probe in the next round. If their behavior stories are compelling but their alignment answers suggest they are optimizing for something the role doesn’t offer, you know the risk before you make the offer.

For role-specific depth on how these dimensions play out in practice, see our guides on hiring media buyers, account managers, and operations managers at agencies.

What these questions don’t do is replace behavioral evidence. A candidate can describe the right behaviors in an interview and exhibit different ones under pressure. The gap between description and demonstration is where bad hires live. Structured evaluation research calls this the “intention-behavior gap.” People genuinely believe they will behave as they describe. The work environment changes the equation.

This scorecard is what we use in the first 30 minutes of discovery. If you want the other 30 questions, the Work Drivers mapping across 32 dimensions, and the behavioral evidence testing through work simulations, that is what the $7,500 covers.

The best hires are never surprising in hindsight. They’re only surprising because nobody asked the right questions in advance.

These 12 questions are yours. Use them in your next interview. If they surface a pattern you can’t explain, that’s the signal that you need the other 30 questions. Our discovery process takes three hours of your time. We charge a flat $7,500, not a percentage of salary. Every placement carries a 120-day guarantee. 90% of our placements are still in role at 18 months.

Book a Scoping Call. We’ll tell you what your scorecard missed.

You've got the questions. See what the answers look like in practice.

Thirty minutes. No pitch. We'll walk through what your scorecard missed and whether discovery-led hiring fits.

Book a Scoping Call →