Implementing Database Replication: Master-Slave vs Master-Master Architectures

Question

Master-slave vs master-master replication explained. Learn how to implement database replication for scalability and availability, choosing the right architecture for your needs.

Accepted Answer

Master-slave replication uses a single writable primary with read-only replicas, while master-master replication enables multiple writable nodes that require conflict resolution but eliminate write bottlenecks. Database replication is a core technique for scaling relational databases and improving availability in distributed systems. According to the repository, choosing between master-slave and master-master architectures represents a critical design decision that impacts consistency models, failover complexity, and write throughput. This guide examines both patterns as documented in the repository's core documentation and implementation examples. Understanding Database Replication Patterns What is Master-Slave Replication? In the of the system-design-primer, master-slave replication is defined as a topology where a single primary node (the master) handles all write operations while one or more secondary nodes (slaves) serve read requests. The master streams changes from its binary log to each slave, which replay these events asynchronously or semi-synchronously. If the master fails, a slave must be promoted to become the new writer. What is Master-Master Replication? The repository describes master-master replication (also called multi-master) as a configuration where two or more nodes accept write operations simultaneously. Each node records its own writes and replicates them to the other nodes. This enables continuous read/write service even if one node goes down, but introduces challenges such as conflict resolution, increased write latency, and the need for load-balancer or application-level routing logic. Architectural Comparison Write Path Characteristics Master-Slave : A single writer eliminates write conflicts and maintains strong consistency for write operations. However, this creates a write bottleneck and requires vertical scaling of the master node to handle write throughput. Master-Master : Multiple writers distribute load across nodes, enabling geographic distribution of write operations. This requires conflict resolution strategies such as last-write-wins timestamps, vector clocks, or CRDTs (Conflict-free Replicated Data Types) to handle simultaneous updates to the same data. Read Path and Consistency Master-Slave : Slaves handle read traffic, reducing load on the master and enabling horizontal read scaling. However, read-after-write consistency may be compromised on slaves due to replication lag between the master and replicas. Master-Master : All nodes can serve reads immediately after local writes, providing low-latency read access. However, cross-node reads may encounter eventual consistency where not all nodes have received the latest writes from other masters. Failover and Availability Master-Slave : When the master fails, automated or manual promotion logic must convert a slave into the new master. This process involves updating DNS entries or connection strings and ensuring the promoted slave has processed all binary log events before accepting writes. Master-Master : Any remaining master can continue accepting writes if one node fails, eliminating the need for promotion logic. However, this requires careful handling of split-brain scenarios where network partitions cause nodes to diverge. Implementation Examples The repository provides concrete configuration examples for both replication patterns. Master-Slave Setup (MySQL) Configure the master node in : Create the replication user on the master: Configure each slave to connect to the master: Master-Master Setup (Galera Cluster) For master-master replication, configure Galera Cluster in : With Galera, writes automatically replicate to all nodes: Application-Level Routing The repository suggests implementing connection routing in application code to handle write/read splitting: Key Repository References The database replication patterns are documented in the following files: - (root): Primary documentation of both replication patterns, including the Master-slave replication and Master-master replication sections that detail advantages, disadvantages, and architectural trade-offs. - : Demonstrates practical application of read replicas in a web crawler system design, showing how master-slave replication supports high-throughput read operations. Summary - Master-slave replication provides a single writable primary with read-only secondaries, offering strong consistency for writes and simple horizontal read scaling, but requires slave promotion during failover and cannot scale writes horizontally. - Master-master replication enables multiple writable nodes for geographic distribution and zero-downtime writes, but introduces conflict resolution complexity, eventual consistency challenges, and requires sophisticated load-balancing or application routing. - Both patterns require monitoring replication lag, handling network partitions to prevent split-brain scenarios, and implementing automated failover or conflict resolution strategies

Implementing Database Replication: Master-Slave vs Master-Master Architectures

Understanding Database Replication Patterns

What is Master-Slave Replication?

What is Master-Master Replication?

Architectural Comparison

Write Path Characteristics

Read Path and Consistency

Failover and Availability

Implementation Examples

Master-Slave Setup (MySQL)

Master-Master Setup (Galera Cluster)

Application-Level Routing

Key Repository References

Summary

Frequently Asked Questions

What is the main difference between master-slave and master-master database replication?

How does failover work when the master node fails in a master-slave topology?

What are the consistency implications of choosing master-master replication?

When should I implement master-slave replication instead of master-master?

Have a question about this repo?