Resilience Patterns with Polly: Circuit Breakers, Retries, and Timeouts

Q: When should I use retry vs circuit breaker vs both?

Use retry for transient failures that resolve quickly (network blips, brief timeouts). Use circuit breaker when a dependency is consistently failing and you want to fail fast. Use both together: retry handles momentary issues, circuit breaker prevents hammering a failing service. The standard resilience handler combines both with sensible defaults.

Q: Should I use Microsoft.Extensions.Http.Polly or Microsoft.Extensions.Http.Resilience?

Use Microsoft.Extensions.Http.Resilience . The older Microsoft.Extensions.Http.Polly package is deprecated. The new package is built on Polly v8 and integrates better with .NET's resilience infrastructure.

Q: What is the difference between Polly v7 and v8?

Polly v8 introduced a new API based on ResiliencePipeline instead of Policy . The new API is more composable and integrates with Microsoft.Extensions.Resilience . The concepts (retry, circuit breaker, timeout) remain the same.

Q: How do I know if my circuit breaker thresholds are correct?

Monitor your circuit breaker state transitions. If the circuit opens too frequently on minor issues, increase MinimumThroughput or FailureRatio. If it never opens during actual outages, decrease thresholds.

Q: Should I retry database operations?

Yes, for transient failures like connection timeouts. EF Core has built-in retry with EnableRetryOnFailure() . For raw ADO.NET, wrap in a Polly retry policy. Ensure operations are idempotent or use transactions.

Q: How do I test resilience patterns?

Use chaos engineering approaches. Inject failures in test environments using tools like Simmy (Polly's chaos engineering extension) or configure test doubles that fail intermittently. Verify that retries happen, circuits open, and timeouts trigger as expected.

Q: What about distributed circuit breakers?

The patterns in this article are per-instance. For distributed circuit breakers (shared state across multiple service instances), consider external state stores like Redis or purpose-built solutions. This adds complexity and is often unnecessary for most applications.

Abstract

TL;DR Polly patterns for production: when to retry, circuit breaker configuration, timeout strategies, and combining policies for fault-tolerant ASP.NET Core applications.