System design algorithms

Algorithm you should know for preparing system design interview. For example, Learn Geohash / S2 Geometry algorithm for "How to design Uber like system?"

Sponsor

海外兔小班教学帮助你高效准备面试，学员大厂率超过 50%，入职不成功不收费。

How to contribute

The algorithm and its resources should:

Can answer a system design question. For instance, Building a complete Tweet index can answer "How to implement Twitter search" or "How to implement hashtag in Twitter".
Free to read or watch.
Text would be better than videos.

Requirements

Know when to use ☑️
Know how it works ✅

Bloom filter ✅
Consistent hashing ✅
Geohash / S2 Geometry ✅
Leaky bucket / Token bucket ✅
Inverted index ✅
Distributed Consensus Algorithms (e.g., Paxos, Raft) ✅
Cache Eviction Policies (e.g., LRU, LFU) ✅
Rsync algorithm ✅
HyperLogLog ✅
Trie algorithm ✅
Lossy Counting ☑️
Frugal Streaming ☑️
Operational transformation ☑️
Quadtree / Rtree ☑️
Ray casting ☑️

Bloom filter

A Bloom filter is a data structure designed to tell you, rapidly and memory-efficiently, whether an element is present in a set.

Supercharging the Git Commit Graph IV: Bloom Filters

Consistent hashing

Consistent hashing is an algorithm designed to distribute data across a cluster in a way that minimizes re-distribution when nodes are added or removed. It is particularly useful in distributed systems, such as distributed caches, distributed storage systems, and load balancing.

A Guide to Consistent Hashing

Geohash / S2 Geometry

Geohash can used by 1) dating apps to find romantic matches within a particular cell, and to create chat apps.2) Find nearby locations, and identify places of interest, restaurants, shops and accommodation establishments in an area. 3) Geohashers go on global expeditions to meet people and explore new places.

Location-based search results with DynamoDB and Geohash

Leaky bucket / Token bucket

A mechanism to control the amount and the rate of the traffic sent to the network

Ieversed index

An inverted index is a data structure used primarily in text search engines.

Distributed Consensus Algorithms

Distributed consensus algorithm that enables nodes to agree on a value despite failures, requiring a majority for safety and progress.

Cache Eviction Policies

Cache eviction policies determine which items to remove from a cache when it reaches its capacity, with common strategies including Least Recently Used (LRU), First In First Out (FIFO), and Least Frequently Used (LFU).

Caching in theory and practice

Rsync algorithm

The rsync algorithm is a technique for reducing the cost of a file transfer by avoiding the transfer of blocks that are already at the destination.

Streaming File Synchronization

HyperLogLog

HyperLogLog is an algorithm for the count-distinct problem, approximating the number of distinct elements in a multiset.

Redis HyperLogLog Explained

Trie algorithm

Trie is an efficient information reTrieval data structure. Using Trie, search complexities can be brought to optimal limit (key length)

Lossy Counting

The lossy count algorithm is an algorithm to identify elements in a data stream whose frequency count exceed a user-given threshold.

Frugal Streaming

Frugal Streaming uses only one unit of memory per group to compute a quantile for each group

Find the nth percentile of the data stream

Operational transformation

Operational transformation (OT) is a technology for supporting a range of collaboration functionalities in advanced collaborative software systems.

Quadtree / Rtree

Spatial Indexing with Quadtrees
Find nearby interest points

Ray casting

Ray casting is the most basic of many computer graphics rendering algorithms that use the geometric algorithm of ray tracing. Given a point with longitude and latitude, return the Country of the point.

Ray Casting Algorithm

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

System design algorithms

Sponsor

海外兔小班教学帮助你高效准备面试，学员大厂率超过 50%，入职不成功不收费。

How to contribute

Requirements

Table of contents

Bloom filter

Consistent hashing

Geohash / S2 Geometry

Leaky bucket / Token bucket

Ieversed index

Distributed Consensus Algorithms

Cache Eviction Policies

Rsync algorithm

HyperLogLog

Trie algorithm

Lossy Counting

Frugal Streaming

Operational transformation

Quadtree / Rtree

Ray casting

About

Releases

Packages

Contributors 3

License

resumejob/system-design-algorithms

Folders and files

Latest commit

History

Repository files navigation

System design algorithms

Sponsor

海外兔小班教学帮助你高效准备面试，学员大厂率超过 50%，入职不成功不收费。

How to contribute

Requirements

Table of contents

Bloom filter

Consistent hashing

Geohash / S2 Geometry

Leaky bucket / Token bucket

Ieversed index

Distributed Consensus Algorithms

Cache Eviction Policies

Rsync algorithm

HyperLogLog

Trie algorithm

Lossy Counting

Frugal Streaming

Operational transformation

Quadtree / Rtree

Ray casting

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Packages