Category Deep-Dives

Professional Services Spend Is the Hardest Category to Benchmark — Here's Why and What to Do

Sofia Mendes 9 min read
Abstract visualization of professional services spend benchmarking

If you ask procurement practitioners which indirect spend category is hardest to benchmark, professional services — legal, consulting, accounting, IT services, staffing, recruiting — comes up in almost every answer. And there are good structural reasons why. The frustrating part is that difficulty benchmarking doesn't mean you can't do it; it means most people stop trying before they get to useful data.

Professional services typically represents 15–30% of total indirect spend for mid-market companies, which puts it well inside what procurement should be actively managing. But the category gets treated as unbenchmarkable because the pricing variables are real and the category spans services that seem incomparable on the surface. A legal brief and a management consulting engagement aren't the same thing — and even within legal, a trademark registration and an M&A review aren't the same thing.

That variability is real. It doesn't mean benchmarking is impossible. It means you have to benchmark at the right level of granularity, against the right comparison set, with a clear understanding of what the benchmark can and can't tell you.

Why Standard Benchmarking Approaches Break Down Here

The Deliverable Problem

When you benchmark janitorial services, the unit is square footage cleaned per week at a given service level. The unit is stable and comparable across vendors. Professional services don't have a stable unit. A $45,000 "strategy consulting engagement" from one firm is a two-week sprint with two senior consultants producing a 40-page deck. The same label from another firm might be a ten-week engagement with rotating team members producing a recommendation report that gets revised four times. The invoice says the same thing; the value delivered is entirely different.

This means you can't benchmark professional services at the engagement level the way you can benchmark a commodity. What you can benchmark is the rate card — the underlying hourly or daily rate for each seniority tier and service type — and that's where the useful price intelligence lives.

The Relationship Variable

Professional services pricing is explicitly relationship-dependent in ways that commodity categories are not. A law firm with a long-standing client relationship and significant institutional knowledge might reasonably charge a different rate than an equivalent firm starting fresh. A consulting partner who knows your business deeply has different value than one who doesn't — and that shows up in effective hourly rate even if the nominal rate looks similar.

This is real but also frequently overstated. In our experience, mid-market companies invoke the relationship argument to avoid benchmarking entirely rather than to calibrate how much relationship premium is justified. A 10–15% premium for a genuinely embedded, high-knowledge supplier might be defensible. A 40% premium isn't, regardless of how long you've worked together.

The Scope Instability Problem

Professional services engagements are notoriously prone to scope expansion. A fixed-fee audit that expands into "additional advisory services" billed at time and materials, a recruiting retainer that includes six months of candidate relationship maintenance that wasn't in the original SOW — these create situations where the billed amount has little relationship to what was originally agreed and benchmarked.

Maverick spend in professional services frequently comes not from rogue purchasing but from approved contracts that drift beyond original scope through informal expansions that no one formally approved through a procurement process. Contract leakage from scope drift is structurally different from contract leakage in other categories, and fixing it requires scope discipline in the engagement management process, not just better rate benchmarking.

What Actually Works: A Practical Benchmarking Approach

Classify First, Then Benchmark

Before you can benchmark professional services, you need to classify your spend at a granular enough level that comparison makes sense. UNSPSC codes for professional services go down to four levels of specificity — using the top-level "professional services" code for all your legal, consulting, and IT services spend means your benchmark pool is too heterogeneous to be useful.

At minimum, you need to separate: legal services by practice area (transactional vs. litigation vs. IP vs. regulatory), management consulting by engagement type (strategy vs. operational vs. digital), IT services by engagement type (implementation vs. managed services vs. staff augmentation), accounting and audit, and recruitment and staffing separately. Within each sub-category, if your spend volume is large enough, further segmentation by supplier tier (Big 4 vs. boutique vs. solo practitioner) is worth doing.

Build a Rate Card View, Not an Engagement View

The most useful benchmark for professional services isn't "what did similar companies pay for a similar engagement" — that data is too scarce and too variable. The useful benchmark is: what is the market rate for a given role tier (partner/senior/associate) in a given service type in a given geography?

Rate cards are often obtainable through RFI processes even when you're not actively sourcing. A well-structured RFI that asks for standard billing rates by role and seniority, without committing to an engagement, gives you rate intelligence you can use for the next 18–24 months. The firms that decline to respond to rate card RFIs are giving you useful information about their willingness to engage on price.

P25–P75 percentile ranges for professional services hourly rates vary significantly by market. In major US metro areas (New York, San Francisco, Chicago), senior management consulting rates for mid-market engagements run roughly $350–$600/hour at the P50, with the spread between P25 and P75 exceeding $200/hour for the same role description. That spread is your negotiation space — not to drive everyone to P25, but to understand whether your current suppliers are operating at the high end with a justification or simply because you've never applied any pricing pressure.

Use Benchmark Data as a Conversation Opener, Not a Price Mandate

This is the piece that most procurement approaches to professional services get wrong. Entering a rate negotiation with a law firm or consulting firm with a spreadsheet that says "you're at P75 and I need you at P50 by next quarter" is a negotiation strategy that reliably damages relationships without getting you to the outcome you want.

Professional services suppliers respond well to: demonstrating that you've done market research, showing that you have options, focusing the conversation on rate structure and engagement design rather than a flat rate cut, and offering volume commitment or payment terms in exchange for rate improvement. They respond poorly to treating the relationship as purely transactional when it isn't.

We're not saying rate benchmarks aren't useful in professional services negotiations — they are, and they matter. We're saying they're most effective as context-setting and as a signal that you're engaged, rather than as a price mandate.

A Scenario: Mid-Market Manufacturing Company, $3.2M in Legal Spend

A 550-person specialty manufacturer had $3.2M in annual legal spend across eight law firms. Their procurement team had never formally managed legal spend because "legal is outside procurement's lane" — a common and often wrong assumption. When we helped them build a spend cube view of their legal spend, three things became immediately visible.

First, they had significant rate card variation across firms handling similar work. For IP prosecution (patent filings and maintenance), they were paying partner rates at two firms that ranged from $620/hour to $395/hour — roughly a 57% spread for services that are highly standardized. The higher-rate firm had never been competitive-pressured because no one in procurement had visibility into the rate comparison.

Second, a significant amount of what was classified as "legal services" in their ERP was actually paralegal and associate-level work billed at partner rates — either because the firms were understaffing engagements from a supervision perspective or because the billing codes weren't being applied correctly. This is spend under management in name only; the actual work isn't being managed to the rate structure that's supposed to apply.

Third, they had seven different invoicing arrangements across their eight firms — some fixed-fee, some hourly, some blended retainer — which made it nearly impossible to compare cost-per-outcome across suppliers. Standardizing invoicing formats was a prerequisite to getting meaningful benchmarks.

None of this required firing any law firm or making dramatic supplier changes. It required making the spend visible in a form that enabled conversation, and then having those conversations from a position of data rather than relationship inertia.

Where This Breaks Down: Honest Limits

Rate card benchmarking can get you to market rates. It can't get you to quality-adjusted rates, and for high-stakes professional services, quality matters enough that chasing the lowest rate benchmark is wrong. A company in the middle of a regulatory investigation isn't the moment to switch to a lower-rate law firm because their rate card looks better.

The right frame for professional services benchmarking is: are we paying appropriately for the quality tier we've decided to work with? That's a different question from "are we paying the cheapest price available?" and it's a much more defensible one in an internal conversation.

Addressable spend in professional services — the spend you can realistically move on without disrupting quality — is typically 40–60% of total professional services spend. The other 40–60% involves relationship-critical, quality-sensitive engagements where you're making a deliberate choice to pay above-median rates. That's fine, as long as it's a deliberate choice made with data, rather than inertia mistaken for strategy.