How to Design Twitter's Timeline and Search Functionality: A Complete System Architecture

Question

Design Twitter's timeline and search functionality with a robust distributed system. Learn to handle 500 million daily tweets using caching, databases, and search clusters.

Accepted Answer

Design Twitter's timeline and search functionality using a distributed architecture that combines in-memory caching for real-time home timelines, relational databases for user timelines, and distributed search clusters for full-text indexing, capable of handling 500 million daily tweets and 10 billion monthly search requests. This guide breaks down the architecture to design Twitter's timeline and search functionality based on the open-source reference implementation in the repository. The design handles 100 million active users, 250 billion read requests per month, and demonstrates production-ready patterns for fan-out services, cache management, and distributed search indexing as documented in . Core Components and Responsibilities The architecture separates concerns across dedicated services to optimize for read-heavy workloads and real-time consistency requirements. | Component | Responsibility | Typical Technology | |-----------|----------------|-------------------| | Client / Web Server | Accepts HTTP/REST requests from users. | Reverse-proxy web server | | Write API | Persists new tweets, triggers fan-out, stores media, updates search index. | SQL DB for user-timeline, NoSQL/Cache (Redis) for fan-out, Object Store for media, Queue for notifications | | Fan-Out Service | Looks up followers from the User Graph Service and writes tweet IDs into each follower's home-timeline cache. | In-memory cache (Redis list) | | Timeline Service | Serves home-timeline reads by pulling tweet IDs from cache and fetching details via Tweet Info and User Info services. | Cache-aside reads O(1) for IDs, O(n) for details | | User Graph Service | Stores "who follows whom" relationships. | Graph DB or in-memory adjacency list with memory cache for fast fan-out | | Search Service | Indexes tweet text and answers keyword queries. | Search cluster (Lucene, Elasticsearch-style) | | Notification Service | Sends push/email notifications via asynchronous queue. | Message queue (e.g., Kafka) | | Read API | Coordinates reads for home-timeline, user-timeline, and search. | Application layer exposing REST endpoints | Data Flow and API Design The system implements distinct workflows for write operations (posting tweets) and read operations (timeline retrieval and search). Posting a Tweet When a user creates content, the system triggers a complex fan-out process to populate follower timelines: 1. The client POSTs to with payload containing , , , and optional 2. The Write API stores the tweet in the user-timeline SQL table for persistent storage 3. The Fan-Out Service queries the User Graph Service to retrieve follower lists 4. For each follower, the service pushes the tweet ID into their home-timeline cache (Redis list) 5. Media files are written to the Object Store 6. The Search Service indexes the tweet text for full-text search 7. A notification event is enqueued for push/email delivery Reading the Home Timeline Home timeline reads must serve pre-computed content to millions of users with low latency: 1. The client GETs 2. The Read API requests cached tweet IDs from the Timeline Service (O(1) retrieval from Redis) 3. The system performs multi-get operations on the Tweet Info and User Info services to assemble full tweet payloads 4. The response returns JSON containing , , , timestamps, and metadata Search Functionality Search requires distributed indexing and ranking across the entire tweet corpus: 1. The client GETs 2. The Search API forwards the query to the Search Service cluster 3. The service tokenizes and normalizes the query text 4. Distributed Lucene queries execute across the search cluster using scatter-gather patterns 5. Results are merged, ranked, and returned as tweet IDs 6. The Tweet Info service fetches full tweet objects for the response Scaling Considerations The architecture addresses specific bottlenecks inherent in social media platforms with asymmetric follower distributions. Fan-Out Bottleneck for Celebrity Users Users with millions of followers create write amplification challenges. The system implements hybrid strategies: - Pre-computation with limits : Store only the most recent few hundred tweets per home timeline in Redis cache; older content is rebuilt from SQL stores on demand - On-read fan-out fallback : For celebrity posts, skip real-time fan-out and instead merge search results with cached timelines during read operations, reducing write load at the cost of slightly higher read latency Cache Sizing and Eviction Home timeline caches maintain only hot data to balance memory costs against performance: - Retain approximately the most recent 100-300 tweets per user in Redis lists - Implement cache-aside patterns where cache misses trigger database queries to rebuild the timeline - Use TTL (time-to-live) policies to automatically expire stale entries Database Sharding and Replication The user-timeline database employs horizontal partitioning: - Shard by to distribute write load across multiple SQL instances -

Decision	Pro	Con
In-memory fan-out	Sub-millisecond reads for most users; predictable latency	High write amplification for celebrity users; requires large cache clusters
On-read fan-out for heavy users	Reduces write load on celebrity posts; simpler capacity planning	Slightly higher read latency; requires complex merge logic between cache and search
SQL for user-timeline	Strong consistency; easy to query historical data; ACID compliance	Not suited for massive fan-out writes; requires sharding at scale
NoSQL/Cache for home-timeline	Fast writes and reads; horizontal scalability; O(1) retrieval	Eventual consistency; cache evictions require database fallback
Lucene search	Powerful full-text search; relevance ranking; proven distributed patterns	Indexing overhead; storage cost for maintaining full tweet text in memory

Path	Purpose
`solutions/system_design/twitter/README.md`	Complete design description, use-case definitions, component breakdown, and scaling discussion
`solutions/system_design/web_crawler/README.md`	Reference implementation showing cache-aside and queue patterns applicable to the fan-out service
`README.md` (repo root)	Master index of system design topics, including latency numbers, consistency patterns, and architectural primitives
`CONTRIBUTING.md`	Guidelines for extending design documentation or contributing new system design examples

How to Design Twitter's Timeline and Search Functionality: A Complete System Architecture

Core Components and Responsibilities

Data Flow and API Design

Posting a Tweet

Reading the Home Timeline

Search Functionality

Scaling Considerations

Fan-Out Bottleneck for Celebrity Users

Cache Sizing and Eviction

Database Sharding and Replication

Search Cluster Optimization

Architecture Trade-offs

Key Implementation Files

Summary

Frequently Asked Questions

How does the fan-out service handle users with millions of followers?

What is the difference between user timeline and home timeline storage?

How does the search service index and retrieve tweets at scale?

Have a question about this repo?

Component	Responsibility	Typical Technology
Client / Web Server	Accepts HTTP/REST requests from users.	Reverse-proxy web server
Write API	Persists new tweets, triggers fan-out, stores media, updates search index.	SQL DB for user-timeline, NoSQL/Cache (Redis) for fan-out, Object Store for media, Queue for notifications
Fan-Out Service	Looks up followers from the User Graph Service and writes tweet IDs into each follower's home-timeline cache.	In-memory cache (Redis list)
Timeline Service	Serves home-timeline reads by pulling tweet IDs from cache and fetching details via Tweet Info and User Info services.	Cache-aside reads O(1) for IDs, O(n) for details
User Graph Service	Stores "who follows whom" relationships.	Graph DB or in-memory adjacency list with memory cache for fast fan-out
Search Service	Indexes tweet text and answers keyword queries.	Search cluster (Lucene, Elasticsearch-style)
Notification Service	Sends push/email notifications via asynchronous queue.	Message queue (e.g., Kafka)
Read API	Coordinates reads for home-timeline, user-timeline, and search.	Application layer exposing REST endpoints