A curated list of awesome resources, tools, frameworks, and educational content related to Distributed Databases. Distributed databases are designed to scale across multiple nodes, offering high availability, fault tolerance, and horizontal scalability.
- General Resources
- Distributed Database Systems
- Distributed Key-Value Stores
- Distributed SQL Databases
- Distributed Graph Databases
- Data Replication and Consensus
- Development and Testing Tools
- Educational Resources
- Community and Forums
- Contribute
- License
- Distributed Databases Explained - An overview of distributed databases and their benefits.
- Database Systems: The Complete Book - A comprehensive textbook covering distributed database design and architecture.
- The CAP Theorem - An explanation of the CAP theorem and its implications for distributed databases.
- Apache Cassandra - A highly scalable NoSQL distributed database designed for handling large amounts of data across many servers.
- MongoDB - A popular NoSQL document-based distributed database known for its flexibility and ease of use.
- Apache HBase - A distributed, scalable, NoSQL database built on top of Apache Hadoop.
- Couchbase - A distributed NoSQL database designed for interactive applications.
- Riak - A distributed NoSQL key-value store known for its high availability and fault tolerance.
- FoundationDB - A distributed database designed to handle large volumes of structured data.
- Amazon DynamoDB - A fully managed, serverless key-value store designed for high availability.
- Redis - An in-memory key-value store supporting replication and sharding.
- Etcd - A distributed key-value store for configuration management and service discovery.
- Consul - A distributed key-value store used for service discovery and configuration.
- Hazelcast - A distributed in-memory data grid for real-time applications.
- CockroachDB - A distributed SQL database designed for global consistency and high availability.
- YugabyteDB - A high-performance distributed SQL database for building cloud-native applications.
- TiDB - An open-source, distributed SQL database compatible with MySQL.
- Google Spanner - A fully managed, globally distributed SQL database offering strong consistency.
- Vitess - A database clustering system for horizontal scaling of MySQL.
- Neo4j - A scalable graph database known for its powerful querying capabilities.
- JanusGraph - A scalable graph database optimized for storing and querying large graphs.
- Dgraph - A high-performance, distributed graph database with a native GraphQL API.
- ArangoDB - A multi-model database supporting graph, document, and key-value data models.
- TigerGraph - A distributed graph database designed for real-time analytics on massive datasets.
- Paxos Consensus Algorithm - A consensus protocol used for achieving agreement in distributed systems.
- Raft Consensus Algorithm - A consensus algorithm designed to be easy to understand, used by systems like Etcd and CockroachDB.
- Zookeeper - A centralized service for maintaining configuration information, naming, and providing distributed synchronization.
- Gossip Protocol - A protocol used for data propagation in distributed databases like Cassandra and Riak.
- Apache Kafka - A distributed event streaming platform often used with distributed databases.
- Debezium - A distributed platform for change data capture (CDC) in databases.
- Jepsen - A framework for testing distributed systems and databases.
- MinIO - A high-performance, distributed object storage system compatible with Amazon S3.
- DataStax Studio - A tool for visualizing data and running queries on Apache Cassandra.
- Designing Data-Intensive Applications - A comprehensive book on building scalable and reliable distributed systems.
- Distributed Systems 101 - An online course covering the fundamentals of distributed systems and databases.
- The CAP Theorem Explained - A detailed discussion of the CAP theorem and its impact on distributed databases.
- Jepsen Blog - A blog documenting testing and analysis of distributed databases for safety and correctness.
- Reddit: r/Database - A community for discussions about databases, including distributed systems.
- Stack Overflow: Distributed Databases - A Q&A site for developers working on distributed databases.
- Discord: Distributed Systems - A server for discussions about distributed systems and databases.
- Slack: Distributed Databases - A Slack channel for real-time discussions about distributed databases.
- Database Community on Hacker News - A forum for news and discussions about distributed databases and technology.
Contributions are welcome!