RocksDB
Original author(s) | Dhruba Borthakur |
---|---|
Developer(s) | |
Initial release | May 2012 |
Stable release | 9.7.3[1]
/ 16 October 2024 |
Repository | |
Written in | C++ |
Operating system | Windows, macOS, Linux, FreeBSD, OpenBSD, Solaris, AIX |
Platform | x86, x86_64, ppc64, ppc64le, aarch64 |
Type | Embedded database |
License | Apache 2.0 or GPL 2 |
Website | rocksdb |
RocksDB is a high performance[2][3][4][5][6] embedded database for key-value data. It is a fork of Google's LevelDB optimized to exploit many CPU cores, and make efficient use of fast storage, such as solid-state drives (SSD), for input/output (I/O) bound workloads. It is based on a log-structured merge-tree (LSM tree) data structure. It is written in C++ and provides official language bindings for C++, C, and Java; alongside many third-party language bindings. RocksDB is open-source software, and was originally released under a BSD 3-clause license[7][8][9]. However, in July 2017 the project was migrated to a dual license of both Apache 2.0 and GPLv2 license[10], possibly in response to the Apache Software Foundation's blacklist of the previous BSD+Patents license clause.[11][12]
RocksDB is used in production systems at various web-scale enterprises[13] including Facebook, Yahoo!,[14] and LinkedIn.[15]
Features
RocksDB, like LevelDB, stores keys and values in arbitrary byte arrays, and data is sorted byte-wise by key or by providing a custom comparator.
RocksDB provides all of the features of LevelDB, plus:
- Transactions[16]
- Backups[17] and snapshots[18]
- Column families[19]
- Bloom filters[20]
- Time to live (TTL) support[21]
- Universal compaction[22]
- Merge operators[23]
- Statistics collection[24]
- Geospatial indexing[25]
and others: List of RocksDB features that are not in LevelDB.
RocksDB is not an SQL database (although MyRocks combines RocksDB with MySQL). Like other NoSQL and dbm stores, it has no relational data model, and it does not support SQL queries. Also, it has no direct support for secondary indexes, however a user may build their own internally using Column Families or externally. Applications use RocksDB as a library, as it does not provide a server or command-line interface.
History
RocksDB was created at Facebook by Dhruba Borthakur[26][27] in April 2012, as a fork of LevelDB with the initial stated goal of improving performance for server workloads.[28][29]
Integration
As an embeddable database, RocksDB can be used as a storage engine within a larger database management system (DBMS). For example, CockroachDB uses RocksDB as its storage engine[30], mostly for transactional workloads while Rockset uses RocksDB mostly for analytical data processing. This shows that RocksDB can be used as a storage engine for both Online transaction processing and Online analytical processing.
Alternative backend
The following projects have been started to replace or offer an alternative storage engines for already-established database systems with RocksDB:
ArangoDB
ArangoDB has added RocksDB to its previous storage engine ("mmfiles").[31] Starting with ArangoDB 3.4, RocksDB will be the default storage engine in ArangoDB.[32]
Cassandra
Cassandra on RocksDB can improve the performance of Apache Cassandra significantly (3-4 times faster in general, 100 times faster in some use-cases).[citation needed] The Instagram team at Facebook developed and open-sourced their code, along with benchmarks of their performance results.[33]
MariaDB
MariaDB can use the MyRocks storage engine (which is forked from RocksDB) since MariaDB 10.2.5 (Alpha status) [34] and stable since MariaDB 10.2.16 in 2018.[35]
MongoDB
The MongoRocks project provides a storage module for MongoDB where the storage engine is RocksDB.[36][37][38]
A related program is Rocks Strata, a tool written in Go, which allows managing incremental backups of MongoDB when RocksDB is used as the storage engine.[39]
MySQL
The MyRocks project creates a new RocksDB based storage engine for MySQL.[40][41] In-depth details about MyRocks were presented at Percona Live 2016.[42]
Embedded
The following database systems and applications have chosen to use RocksDB as their embedded storage engine:
Ceph's BlueStore
The Ceph's BlueStore storage layer uses RocksDB for metadata management in OSD devices.[43]
Apache Flink
Apache Flink uses RocksDB to store checkpoints.[44]
FusionDB
FusionDB[45] uses RocksDB as its storage engine for XML, Key/Value, and JSON.[46]
LogDevice LogsDB
LogDevice's LogsDB is built atop RocksDB.[47]
Rockset
The Rockset service that is used for operational data analytics uses RocksDB as its storage engine.[48]
SSDB
The ssdb-rocks[49] project uses RocksDB as the storage engine for the SSDB[50] NoSQL Database.
TiDB
The TiDB[51] project uses RocksDB as its storage engine.[52]
Third-party language bindings
Third-party programming language bindings available for RocksDB include:
References
- ^ . 16 October 2024 https://github.com/facebook/rocksdb/releases/tag/v9.7.3. Retrieved 16 October 2024.
{{cite web}}
: Missing or empty|title=
(help) - ^ "Performance Benchmarks". Retrieved November 29, 2015.
- ^ "Benchmarking the leveldb family". Retrieved March 10, 2016.
- ^ "Comparing LevelDB and RocksDB, take 2". Retrieved March 10, 2016.
- ^ "Benchmarking LevelDB vs. RocksDB vs. HyperLevelDB vs. LMDB Performance for InfluxDB". Retrieved March 10, 2016.
- ^ Golan-Gueta, Guy; Bortnikov, Edward; Hillel, Eschar; Keidar, Idit (April 21, 2015). "Scaling Concurrent Log-Structured Data Stores". EuroSys '15 Proceedings of the Tenth European Conference on Computer Systems. doi:10.1145/2741948.2741973.
- ^ "Facebook's latest open source effort: a flash-powered database called RocksDB". Retrieved March 10, 2016.
- ^ "Under the Hood: Building and open-sourcing RocksDB". Retrieved March 10, 2016.
- ^ "RocksDB - Facebook's Database Now Open Source". Retrieved March 10, 2016.
- ^ "GitHub pull request". Retrieved July 20, 2017.
- ^ "Apache says 'no' to Facebook code libraries". Retrieved July 20, 2017.
- ^ "GitHub issue". Retrieved July 20, 2017.
- ^ "Users.md". Retrieved December 1, 2015.
- ^ "RocksDB on Steroids". Retrieved March 10, 2016.
- ^ "Benchmarking Apache Samza: 1.2 million messages per second on a single node". Retrieved March 10, 2016.
- ^ "RocksDB transactions". GitHub. Retrieved 2016-04-04.
- ^ "How to backup RocksDB?". Retrieved 2017-07-19.
- ^ "Checkpoints". Retrieved 2017-07-19.
- ^ "Column families in RocksDB". GitHub. Retrieved 2016-04-04.
- ^ "RocksDB bloom filters". GitHub. Retrieved 2016-04-04.
- ^ "RocksDB TTL support". GitHub. Retrieved 2016-04-04.
- ^ "Universal compaction". GitHub. Retrieved 2016-04-04.
- ^ "RocksDB merge operator". GitHub. Retrieved 2016-04-04.
- ^ "RocksDB perf context and IO stats context". GitHub. Retrieved 2016-04-04.
- ^ "Spatial indexing in RocksDB". rocksdb.org. Retrieved 2018-07-19.
- ^ "First commit where RocksDB diverges from LevelDB". May 10, 2012. Retrieved March 15, 2016.
- ^ "rocksdb README file". Nov 30, 2012. Retrieved March 15, 2016.
- ^ "The History of RocksDB". November 24, 2013. Retrieved March 10, 2016.
- ^ Borthakur, Dhruba (November 22, 2013). "RocksDB: A High Performance Embedded Key-Value Store for Flash Storage - Data@Scale". Retrieved March 10, 2016.
... The story of why we decided to do RocksDB ...
- ^ Edwards, Jessica (2015-10-29). "Hello World: Meet CockroachDB, the Resilient SQL Database". The New Stack. Retrieved 2016-07-08.
- ^ "Comparing new RocksDB and MMFiles storage engines".
- ^ "RC1 ArangoDB 3.4 - Whats new?".
- ^ "Open-sourcing a 10x reduction in Apache Cassandra tail latency".
- ^ "MyRocks". MariaDB KnowledgeBase. Retrieved 2019-04-28.
- ^ https://mariadb.com/kb/en/mariadb-10216-release-notes/
- ^ "mongodb-partners/mongo-rocks".
- ^ "Integrating RocksDB with MongoDB". Retrieved July 19, 2018.
- ^ "MongoDB + RocksDB at Parse". Retrieved December 1, 2015.
- ^ "facebookgo/rocks-strata".
- ^ "facebook/mysql-5.6".
- ^ "MyRocks: MySQL on RocksDB" (PDF). Retrieved November 29, 2015.
- ^ "MyRocks Deep Dive". Retrieved May 9, 2016.
- ^ "Storage Devices -- Ceph Documentation".
- ^ "Apache Flink 1.8 Documentation: State Backends". ci.apache.org. Retrieved 2019-08-11.
- ^ "FusionDB". Evolved Binary.
- ^ "The Design and Implementation of FusionDB" (PDF). XML Prague.
- ^ "LogDevice: a distributed data store for logs". Mark Marchukov, Facebook.
- ^ "How we use RocksDB at Rockset". rockset.com. Retrieved 2019-07-10.
- ^ "ideawu/ssdb-rocks".
- ^ https://ssdb.io
- ^ "pingcap/tidb".
- ^ "TiDB Internal (I) - Data Storage". Shen Li.
- ^ "warrenfalk/rocksdb-sharp".
- ^ "b1naryth1ef/rocksdb".
- ^ "urbint/rox".
- ^ "leo-project/erocksdb".
- ^ "barrel-db/erlang-rocksdb".
- ^ "tecbot/gorocksdb".
- ^ "rocksdb-haskell: Haskell bindings to RocksDB".
- ^ "RocksJava".
- ^ "rocksdb".
- ^ "iabudiab/ObjectiveRocks".
- ^ "OCaml bindings for RocksDB".
- ^ "An OCaml RocksDb binding using ocaml-ctypes".
- ^ "RocksDB - Perl extension for RocksDB - metacpan.org".
- ^ "Photonios/rocksdb-php".
- ^ "SWI-Prolog interface for RocksDB".
- ^ "stephan-hof/pyrocksdb".
- ^ "rocksdb-ruby - RubyGems.org - your community gem host".
- ^ "spacejam/rust-rocksdb".