p99 Latency
p99 latency is the 99th percentile response time — meaning 99% of requests complete faster than this value. It's the preferred latency metric for engineering teams because averages and medians hide the tail experience that your slowest (often most complex or highest-value) users encounter. A slow p99 is almost always the experience of your power users or largest customers. High p99 latency drives disengagement even when median latency looks acceptable.
Note: Collect latency as a histogram or use percentile approximations (HdrHistogram, DDSketch). Simple averages are insufficient — a single 30-second request averages out of sight but destroys that user's experience.
Web API p99 < 500ms; critical path p99 < 200ms; real-time features p99 < 100ms
p99 > 2,000ms on core user flows measurably increases abandonment and churn
Benchmarks by segment
How to improve p99 Latency
Profile p99 specifically — the queries and code paths that dominate p99 are often different from those that dominate average latency
Add database query timeouts and connection pooling — a slow query in a hot path elevates p99 for all users
Implement circuit breakers for external API calls — a slow third-party service should degrade gracefully, not block your p99
Use CDN caching for static and semi-static content — moving delivery to the edge dramatically reduces p99 for geographic outliers
Common measurement mistakes
Tools for measuring p99 Latency
Best-in-class behavioral analytics with powerful event segmentation, funnel analysis, and retention charts that go far deeper than Google Analytics
Best-in-class event-based analytics with intuitive funnel, retention, and flow reports that surface actionable insights quickly
All-in-one product analytics platform combining analytics, session replay, feature flags, A/B testing, surveys, and a data warehouse — replacing multiple point solutions
Autocapture eliminates the need for manual event instrumentation — every click, pageview, and form interaction is tracked automatically from day one
All-in-one platform combining feature flags, A/B testing, product analytics, session replay, and web analytics — eliminating the need for separate tools
Best-in-class no-code editor for creating in-app walkthroughs, tooltips, and interactive guides without developer involvement
Frequently Asked Questions
p99 is the industry standard balance between signal and noise. p95 misses more tail problems. p999 (99.9th percentile) is so extreme that it's often dominated by outliers (a single bad deployment, a specific geographic edge case) rather than systemic issues. Use p99 as your primary SLO metric; track p999 for advanced reliability work.
Google research shows that a 100ms increase in page load time reduces conversions by 1%. For collaborative tools, latency > 300ms breaks the sense of real-time interaction. Slow p99 disproportionately affects power users who do the most complex operations — these are often your highest-value accounts.