Question 1

What is a rate limiter and why is it important?

Accepted Answer

A rate limiter controls how many requests a client can make to an API within a given time window. It protects backend services from abuse, prevents resource exhaustion, and ensures fair usage across clients.

Question 2

What are the main rate limiting algorithms?

Accepted Answer

The most common algorithms are token bucket (allows bursts up to a limit), sliding window log (tracks exact request timestamps), sliding window counter (approximates sliding window with less memory), and fixed window counter (simplest but allows burst at window boundaries).

Question 3

How do you handle rate limiting in a distributed system?

Accepted Answer

In distributed setups, you need a centralized store like Redis to track request counts across all server instances. Alternatives include sticky sessions (routing a client to the same server) or approximate algorithms that tolerate slight inconsistency for better performance.

Question 4

What HTTP status code should a rate limiter return?

Accepted Answer

A rate limiter should return HTTP 429 (Too Many Requests) with a Retry-After header indicating when the client can retry. Including X-RateLimit-Limit, X-RateLimit-Remaining, and X-RateLimit-Reset headers helps clients self-regulate.

Design a Rate Limiter

You'll practice

Functional Requirements

Non-Functional Requirements

Frequently Asked Questions