Horizontal Partitioning in System Design

3 min readJan 25, 2023

Horizontal partitioning, also known as sharding, is a technique used in the database and system design to distribute data across multiple servers or machines. The goal of horizontal partitioning is to distribute data evenly across multiple servers, so that each server can handle a subset of the data, rather than having all the data stored on a single server. This can help to improve the performance, scalability, and availability of the system. To implement horizontal partitioning, the data is typically divided into smaller subsets, called shards, based on a partition key, such as a user ID, and each shard is stored on a separate server. Horizontal partitioning can be done manually or using a partitioning tool.

There are several techniques for implementing horizontal partitioning:

Range-based partitioning: Data is partitioned based on a range of values for a specific attribute, such as a date range or a range of numerical values.
Hash-based partitioning: Data is partitioned based on a hash function applied to a specific attribute, such as a user ID. This technique ensures that data is distributed evenly across partitions.
List partitioning: Data is partitioned based on a list of values for a specific attribute, such as a list of customer IDs.
Composite partitioning: This technique combines two or more partitioning methods to partition the data.
Bucket partitioning: In this technique, data is partitioned into a fixed number of buckets, and each bucket is stored on a separate server.

It’s important to choose the right partitioning technique based on the characteristics of the data and the specific requirements of the system. It also can be done with a combination of different methods to achieve better results.

Horizontal partitioning and vertical partitioning are two different techniques used in the database and system design to scale and distribute data.

Horizontal partitioning is typically used when the data is growing too large to be stored on a single server or when the system needs to handle a large number of concurrent requests. Horizontal partitioning distributes the data across multiple servers, which can improve the performance, scalability, and availability of the system.

On the other hand, vertical partitioning is used to split a table into smaller tables, each containing a subset of the columns. This technique is used to improve query performance by reducing the amount of data that needs to be read from the disk and to reduce contention for locks on the table.

We should consider horizontal partitioning over vertical partitioning when:
1. The data size is so large that it cannot fit on a single machine
2. The number of concurrent requests is too high for a single machine to handle
3. The goal is to scale the system horizontally
4. The data is not easily divisible into smaller subsets based on the columns

It’s important to note that each approach has its own set of trade-offs and it’s important to evaluate the specific requirements of the system before deciding which technique to use.

Sign up to discover human stories that deepen your understanding of the world.

Free

Distraction-free reading. No ads.

Organize your knowledge with lists and highlights.

Tell your story. Find your audience.

Membership

Read member-only stories

Support writers you read most

Earn money for your writing

Listen to audio narrations

Read offline with the Medium app

System Design Interview

Written by thebytestream

30 Followers

256 Following

Experienced software professional with more than 8 years of experience shipping highly scalable applications in various verticals.

No responses yet

Write a response

What are your thoughts?

Also publish to my profile

Recommended from Medium

Javarevisited

Veenarao

System Design CheatSheet for Interview

Dear Readers, I am summarizing the commonly asked concepts in system design interviews. These questions are asked in almost all the system…

Dec 23, 2024

System Design Blueprint: The Ultimate Guide

ByteByteGo System Design Alliance

Love Sharma

System Design Blueprint: The Ultimate Guide

Developing a robust, scalable, and efficient system can be daunting. However, understanding the key concepts and components can make the…

Sep 17, 2023

Coding Odyssey

Shivam Srivastava

JP Morgan Java Developer Interview — 2

Senior Java Developer Interview Experience

Mar 24

This new IDE from Google is an absolute game changer

Coding Beauty

Tari Ibaba

This new IDE from Google is an absolute game changer

This new IDE from Google is seriously revolutionary.

Mar 11

192

High-Level System Architecture of Booking.com

Talha Şahin

High-Level System Architecture of Booking.com

Take an in-depth look at the possible high-level architecture of Booking.com.

Jan 10, 2024

5 Microservices Design Patterns You Must Know in 2025

JavaGuides

Ramesh Fadatare

5 Microservices Design Patterns You Must Know in 2025

Here are five important microservices design patterns you should know in 2025, explained in simple terms with examples. Microservices…

Jan 24

See more recommendations

Help
Status
About
Careers
Press
Blog
Privacy
Rules
Terms
Text to speech

Horizontal Partitioning in System Design

Sign up to discover human stories that deepen your understanding of the world.

Free

Membership

Written by thebytestream

No responses yet

More from thebytestream

Reverse Proxy

A reverse proxy is a server that sits between the client and the origin server, forwarding client requests to the server and returning the…

Patterns and variations of the Binary Search algorithm

Binary Search is a fundamental algorithm widely used in competitive programming. Below are several patterns and variations of the Binary…

Advanced Binary Search Problem: Parallel Computing Resource Allocation Optimization

I’ll present an advanced binary search problem that combines multiple complex concepts:

Content-Based Chunking Algorithm

Problem Statement: Splitting large data into small blocks in a deterministic way.