How to Design a URL Shortening Service Like Bit.ly: A Complete System Design Guide

Question

Design a URL shortening service like Bit.ly. Learn about MD5 hashing, Base62 encoding, database choices, caching, and API design for a complete system.

Accepted Answer

Design a URL shortening service by using MD5 hashing combined with Base62 encoding to generate unique 7-character keys, storing mappings in a SQL/NoSQL database with Redis caching, and exposing REST APIs for creation and resolution with analytics and expiration handling. The repository provides a battle-tested blueprint for building scalable URL shorteners and paste services. This guide walks through the exact architecture used to handle millions of short links, referencing the implementation details found in and the accompanying Python code. Use Cases and Constraints Before writing code, define the functional and non-functional requirements. According to the primer's Step 1 section in , a production-grade URL shortener must support: - Create : Generate unique short links for arbitrary URLs - Read : Resolve short links to original URLs with minimal latency - Analytics : Track hit counts and usage patterns - Expiration : Automatic cleanup of stale entries after a TTL Target constraints typically include high availability , read-heavy traffic (approximately 100 M reads vs. 10 M writes per month), and the capacity to store 360 M links over three years without collisions. High-Level Architecture Components The system architecture follows a layered approach detailed in Step 2 of the primer. Each component handles a specific responsibility: - Client : Sends HTTP requests to create or retrieve shortlinks - Web Server (Reverse Proxy) : Terminates TLS and load-balances traffic to API servers - Write API : Generates unique short keys, stores mappings in the database, and persists large payloads to object storage when necessary - Read API : Looks up short keys, fetches original URLs, and handles redirects - SQL/NoSQL Store : Persistent hash table mapping with secondary indexes on - Object Store (e.g., S3): Holds large payloads for paste contents (optional for pure URL shortening) - Cache (Redis/Memcached): Serves hot lookups to maintain sub-millisecond latency - Analytics Processor : Runs MapReduce jobs over web-server logs to aggregate hit counts - Expiration Service : Scans the database for expired entries and deletes or marks them Core Implementation Details Short-Link Generation Strategy The primer specifies a two-step generation process to ensure uniqueness and sufficient key space. As implemented in the source analysis, the service uses MD5 hashing of to produce a uniformly distributed 128-bit value, then applies Base62 encoding to create URL-safe strings. Base62 uses the alphabet (62 characters). The first 7 characters provide 62⁷ ≈ 3.5 × 10¹² possible keys, exceeding the projected 360 M links requirement. Database Schema and Persistence The storage layer requires fast lookups by shortlink and efficient expiration scanning. The primer recommends: 1. Relational Database : Use as the primary key with a secondary index on to support expiration queries 2. Object Store : Offload large paste contents to S3 when payloads exceed typical database row limits (relevant for Pastebin-style services) Write scaling starts with a single master handling 4 writes/sec, with sharding or federation added as traffic grows. API Design The REST API exposes two primary endpoints. According to : Create Endpoint Returns: Resolve Endpoint The Read API handles by returning an HTTP 301 redirect to the stored URL after performing a cache lookup. Caching and Read Scaling To handle 100M+ monthly reads, place Redis or Memcached in front of the database: - Cache misses fall back to the master-slave replica set - Hot keys remain in memory for sub-millisecond resolution - Write-through or write-around strategies prevent cache staleness Analytics and Expiration Analytics processing runs MapReduce jobs over web-server logs to produce monthly hit counts, as shown in the primer's reference implementations. Expiration cleanup requires a periodic background job that scans the column and removes stale rows from both the database and cache. Scaling Considerations The suggests iterative scaling: benchmark → profile → address bottlenecks. Each component can expand independently: 1. Load Balancer : Add layer-7 load balancers to distribute traffic across API servers 2. CDN : Cache redirect responses at edge locations for popular links 3. Read Replicas : Scale database reads with additional replicas 4. Sharding : Partition the key space when the single master becomes a bottleneck The design in demonstrates that horizontal scaling requires no API changes—only infrastructure adjustments. Summary - Generate keys using MD5 hashing and Base62 encoding to create 7-character unique identifiers with 62⁷ possible combinations - Store mappings in a SQL database with as primary key and secondary indexes on for expiration queries - Expose APIs via POST for creation and GET with 301 redirects for resolution - Add Redis caching to handle read-heavy traffic and maintain low latency - Process analytics using MapReduce over logs and clean expired entries with background jobs

How to Design a URL Shortening Service Like Bit.ly: A Complete System Design Guide

Use Cases and Constraints

High-Level Architecture Components

Core Implementation Details

Short-Link Generation Strategy

Database Schema and Persistence

API Design

Caching and Read Scaling

Analytics and Expiration

Scaling Considerations

Summary

Frequently Asked Questions

How do you prevent collisions when generating short links?

What database should I use for a URL shortening service?

How do you handle analytics for billions of redirects?

What is the difference between designing Bit.ly and Pastebin?

Have a question about this repo?