Rate Limiting in ASP.NET Core: Patterns That Actually Protect

Q: Which rate limiting algorithm should I use?

Fixed window for simple scenarios, sliding window for smoother limits, token bucket for burst tolerance, concurrency limiter for protecting downstream resources. The algorithm comparison section provides specific guidance for each use case.

Q: Does rate limiting protect against DDoS?

No. Rate limiting helps with application-level abuse but cannot handle volumetric DDoS attacks. Use infrastructure-level protection (Azure WAF, Cloudflare, AWS Shield).

Q: Should I use queuing?

Generally no for web applications. Queuing delays responses, which is usually worse than immediate rejection. Consider queuing only for background job APIs.

Q: How do I test rate limits?

Use load testing tools (k6, JMeter) to verify limits work correctly. Test both under-limit and over-limit scenarios.

Q: What about distributed rate limiting?

The built-in middleware uses in-memory storage. For multi-server deployments, consider Redis-backed implementations or API gateway rate limiting.

Q: How do I handle shared IPs (NAT)?

This is a trade-off. Per-IP limiting may block legitimate users behind corporate proxies. Consider higher limits, user-based limiting for authenticated requests, or API keys.

Abstract

TL;DR Fixed window vs sliding window vs token bucket: choose the right algorithm, partition by IP or user, and handle edge cases like missing IPs and exempt endpoints.