How to Evaluate Your Coaches (and What to Do With the Results)

Most gyms handle coach evaluation one of two ways: they don’t do it at all, or they do it once a year in a sit-down that nobody enjoys and nothing changes. Neither works.

The operators who consistently have strong class delivery and low instructor turnover aren’t doing anything exotic. They have a rubric, they observe regularly, and they follow through on what they find. That’s it.

Here’s a system you can run without an HR department.

Why Most Evaluations Fail

The conversation-based annual review falls apart because it relies on memory and perception instead of observed evidence. You end up reviewing your general impression of a coach, not their actual delivery. If they’re likeable, they score well. If they’ve had one bad interaction with you recently, they score poorly. Neither reflects what’s happening on the floor.

The other failure mode is skipping it entirely. “I can tell when a coach is good” is true up to a point. But without a consistent framework, you’re making staffing decisions on gut feel, and you’ll miss slow declines until they’ve already cost you members.

Build a Rubric First

Before you observe anyone, write down what “good” looks like in your gym. The criteria should reflect your specific format and culture, but most group fitness operations need to assess five areas:

1. Technical delivery

Does the coach cue movement accurately?
Are corrections timely and effective (not ignored, not constant)?
Do they adapt cues when members aren’t getting it?

2. Class management

Does the session start and end on time?
Is the energy level appropriate for the format?
Does the coach control the room without controlling every moment?

3. Member connection

Do they know names? Regular members, not just their favorites.
Do they notice who’s new, who’s struggling, who’s killing it?
Is the post-class interaction genuine or a routine?

4. Programming execution

Are they following the intended plan or ad-libbing?
If they modify, are modifications intentional and communicated?
Do they understand the “why” behind the session structure?

5. Professionalism

Are they there early enough to set up and greet arrivals?
Do they handle late joiners, equipment issues, or complaints without drama?
Do they represent the brand in the way you’d want?

Score each area 1-5. Five categories, 25 points maximum. Keep it simple enough that you’ll actually use it.

Two Types of Observations

Once you have a rubric, build in two observation types per coach per year:

Planned observations The coach knows you’re watching. These are useful for evaluating how a coach performs when they’re prepared. It removes the “caught off guard” variable and sets a fair baseline. Give them notice, observe a full session, score the rubric, debrief within 48 hours.

Unplanned drop-ins You show up at a random class, sit in the back, score the rubric. Don’t announce it in advance. The gap between planned and unplanned scores tells you something. A coach who drops two or three points when they don’t know you’re watching has a consistency problem worth discussing.

Combine the two scores for an annual picture. One observation of each type is enough for most gyms. High-volume operations or coaches with known issues might warrant more.

The Debrief Conversation

The observation is wasted if the debrief is vague. Three things make it useful:

Lead with specifics, not impressions. “Your energy was great” is useless. “You greeted four members by name in the first two minutes and transitioned from the warm-up without losing the room” is actionable information. Same on the critical side: “Some of your cues weren’t landing” versus “Three members were bracing before the hip hinge when you wanted them to hinge first.”

Make it a two-way conversation. Ask what they thought went well and where they felt the session stall. Good coaches will often identify the same issues you saw. When they don’t, you’ll learn something about their self-awareness.

Agree on one or two specific things to improve. Not a list of everything that could be better. One or two clear, observable changes with a timeline. Then check in on those specifically at the next observation.

What to Do With Low Scores

A single low score isn’t a performance issue. Patterns are.

If a coach scores below 15/25 on a planned observation, schedule a follow-up in 60-90 days and be explicit about what you’re looking for. Give them the rubric if you haven’t already. Most coaches who are underperforming have no idea what the standard actually is.

If scores are consistently low across two cycles, or if technical delivery is a persistent weak point, you have a skills gap. That’s a training conversation, not necessarily a termination conversation. Pair them with a stronger coach. Invest in a specific credential. Adjust their schedule to lower-stakes time slots while they develop.

If attitude or professionalism is the issue and it’s not improving, that’s different. Culture problems cost you more than one coach’s value.

Set Standards at Hire, Not After

The most common mistake is introducing an evaluation system to coaches who’ve been with you for years and who’ve never been told what you expect. That’s a setup for defensiveness.

The fix is front-loading. When you bring a new coach on, give them the rubric on day one. Tell them you do planned and unplanned observations. Tell them what scores mean. None of this should be a surprise at evaluation time.

For existing coaches, frame the introduction honestly: you’re formalizing a process that’s going to help everyone get better. It’s not remediation. Then follow through on that framing by treating the first round of evaluations as developmental, not disciplinary.

A Note on Frequency

Two formal observations per year is a floor, not a ceiling. If you have fewer than 10 coaches, you can realistically do one unplanned drop-in per coach per quarter without it becoming your second job. For larger teams, prioritize your newest coaches and anyone showing signs of decline.

The informal conversations matter too. Walking the floor during classes, asking for feedback from members, watching how coaches interact before and after sessions. Formal evaluation confirms what you’re already seeing, or surfaces what you’re missing.

Build It Once, Use It Consistently

The operators who get the most out of evaluation systems aren’t necessarily the ones with the most sophisticated rubrics. They’re the ones who do it consistently. Same rubric, same process, every coach, every year.

Consistency is what makes the data useful. When a coach improves their connection score from a 3 to a 5 over two cycles, you can see it and recognize it. When someone’s technical delivery starts slipping, you catch it early. You can’t do either if you’re reinventing the process every time.

Build it once. Run it consistently. The classes get better.

Coach evaluations and schedule decisions are closely connected - if a class is underperforming, it’s worth knowing whether the issue is timing or delivery. The group fitness schedule audit walks through how to tell the difference. For a deeper look at why instruction quality is the product, The Workout Isn’t the Product. The Instruction Is. picks up where the rubric leaves off. And if you’re evaluating coaches in strength formats specifically, what makes group strength classes actually build members outlines the coaching competencies that are different from cardio or mixed formats.

Why Most Evaluations Fail

Build a Rubric First

Two Types of Observations

The Debrief Conversation

What to Do With Low Scores

Set Standards at Hire, Not After

A Note on Frequency

Build It Once, Use It Consistently

More from Operations

Why Members Really Cancel (And It's Not the Price)

The Pilates Boom Is Real. Here's Why Studios Are Struggling to Keep Up

Your Average Class Has 6 People In It. Are You Charging for That?