The main categories of NewSQL databases and their features
DOI:
https://doi.org/10.30837/rt.2022.4.211.03Keywords:
database, relational database, database management system, NoSQL, NewSQLAbstract
In the modern world, the problem of working with big data and workloads is becoming more and more acute. For more than forty years, relational databases have been the main leading systems for storing, searching and managing data. However, despite their great popularity, application experience and universality, traditional relational DBMS, due to the growing needs for scalability and performance, often cannot meet modern requirements. This has led to the emergence of new alternative data management systems, including NewSQL systems. NewSQL is a class of modern relational database management systems that provide performance comparable to NoSQL systems while maintaining the data consistency guarantees inherent in traditional database systems. The growing interest in NewSQL technology in recent times has led to an increase number of evaluations and comparisons among competing NewSQL technologies. However, today there is still a certain lack of work devoted to the study of the features of NewSQL solutions and their capabilities in comparison with other technologies. This paper discusses the main features of the most famous NewSQL products of different categories and the identified problems associated with them. To overcome the certain ambiguity in the names and translations of some terms related to the subject area under consideration, which takes place in numerous relevant sources, additional explanations are given. For comparison, the paper presents the values of important characteristics inherent in NewSQL, traditional relational and NoSQL database systems. This paper can help researchers and people from the industry choose the best storage solutions for their needs.
References
Abadi D., Ailamaki A., Andersen D., Bailis P., Balazinska M., Bernstein P., Boncz P., Chaudhuri S., et al // The Seattle Report on Database Research. ACM SIGMOD Record. 2020. 48. P. 44 – 53. https://doi.org/10.1145/3385658.3385668.
Gudivada V. N., Rao D., Raghavan V. V. Renaissance in database management: navigating the landscape of candidate systems // Computer. 2016. 49(4). P. 31–42. https://doi.org/10.1109/MC.2016.115.
Sadalage P. J., Fowler M. NoSQL Distilled A Brief Guide to the Emerging World of Polyglot Persistence. Addison-Wesley Professional, 2012. 188 p.
Using Oracle Sharding. Oracle Sharding Overview. URL: https://docs.oracle.com/en/database/oracle/oracle-database/19/shard/sharding-overview.html#GUID-0F39B1FB-DCF9-4C8A-A2EA-88705B90C5BF. (дата звернення: 17.02.2023).
Shute J., Vingralek R., Samwel B., Handy B., Whipkey C., Rollins E., Oancea M., Littlefield K., Menestrina D., Ellner S., Cieslewicz J., Rae I., Stancescu T., Apte H. F1: A distributed SQL database that scales // Proceedings of the 39th International Conference on Very Large Data Bases (VLDB) Endowment. 2013. 6(11). 1068 – 1079.
Pavlo A., Aslett M. What's really new with NewSQL? // ACM Sigmod Record. 2016. 45(2). P. 45 – 55. https://doi.org/10.1145/3003665.3003674.
NoSQL. URL: https://hostingdata.co.uk/nosql-database/ (дата звернення: 17.02.2023).
Corbett, J. C., Dean, J., Epstein, M., Fikes, A., Frost, C., Furman, J. J., Ghemawat S., Gubarev A., Heiser C., Hochschild P., Hsieh W., Kanthak S., Kogan E., Li H., Lloyd A., Melnik S., Mwaura D., Nagle D., Quinlan S., Rao R., Rolig L., Saito Y., Szymaniak M., Taylor C., Wang R., Woodford D. Spanner: Google’s globally distributed database // ACM Transactions on Computer Systems (TOCS). 2013. 31(3). P. 1 – 22. https://doi.org/10.1145/2491245.
Özsu M. T., Valduriez P. Principles of Distributed Database Systems. Fourth Edition. Springer Cham, 2020. 674 p.
Aslett M. What we talk about when we talk about NewSQL. URL: http://blogs.the451group.com/information_management/2011/04/06/what-we-talk-about-when-we-talk-about-newsql/ (дата звернення: 17.02.2023).
Valduriez P., Jiménez-Peris R., Özsu M. T. Distributed database systems: The case for NewSQL // Hameurlain, A., Tjoa, A.M. (eds) Transactions on Large-Scale Data- and Knowledge-Centered Systems XLVIII. Lecture Notes in Computer Science. Vol 12670. Berlin, Heidelberg: Springer Berlin Heidelberg, 2021. P. 1 – 15. https://doi.org/10.1007/978-3-662-63519-3_1.
Moniruzzaman A. B. M. NewSQL: Towards next-generation scalable RDBMS for online transaction processing (OLTP) for Big Data management // International Journal of Database Theory and Application. 2014. 7(6) P. 121 – 130 http://dx.doi.org/10.14257/ijdta.2014.7.6.11.
Stonebraker M. The case for shared nothing // IEEE Database Eng. Bull. 1986. 9(1). P. 4 – 9.
Duggirala S. NewSQL databases and scalable in-memory analytics // Advances in Computers. Elsevier, 2018. 109. P. 49 – 76. https://doi.org/10.1016/bs.adcom.2018.01.004.
MariaDB Xpand. URL: https://mariadb.com/products/enterprise/xpand/ (дата звернення: 17.02.2023).
MariaDB. MariaDB Acquires Clustrix Adding Distributed Database Technology. URL: https://mariadb.com/newsroom/press-releases/mariadb-acquires-clustrix-adding-distributed-database-technology/ (дата звернення: 17.02.2023).
Namuag P. An Overview of MariaDB Xpand (formerly ClustrixDB). URL: https://severalnines.com/blog/overview-mariadb-xpand-formerly-clustrixdb/ (дата звернення: 17.02.2023).
Clustrix. URL: https://dbdb.io/db/clustrix (дата звернення: 17.02.2023).
MariaDB Xpand. URL: https://mariadb.com/docs/xpand/products/mariadb-xpand/ (дата звернення: 17.02.2023).
CockroachDB. URL: www.cockroachlabs.com (дата звернення: 17.02.2023).
Cloud Spanner. URL: https://cloud.google.com/spanner/ (дата звернення: 17.02.2023).
Bacon D. F., Bales N., Bruno N., Cooper B. F., Dickinson A., Fikes A., Fraser C., Gubarev A., Joshi M., Kogan E., Lloyd A., Melnik S., Rao R., Shue D., Taylor C., Holst M. H., Woodford D. Spanner: Becoming a SQL system // Proceedings of the 2017 ACM International Conference on Management of Data (SIGMOD '17). Association for Computing Machinery, New York, NY, USA, 2017. P. 331 – 343. https://doi.org/10.1145/3035918.3056103.
HyPer. URL: https://hyper-db.de/ (дата звернення: 17.02.2023).
SingleStore. URL: https://www.singlestore.com/ (дата звернення: 17.02.2023).
NuoDB. URL: https://www.nuodb.com. (дата звернення: 17.02.2023).
SAP HANA Cloud. URL: www.sap.com/products/hana.html (дата звернення: 17.02.2023).
Sikka V., Färber F., Lehner W., Cha S. K., Peh T., Bornhövd C. Efficient transaction processing in SAP HANA database: the end of a column store myth // Proceedings of the 2012 ACM SIGMOD International Conference on Management of Data. 2012. P. 731 – 742).
H-Store. URL: https://hstore.cs.brown.edu/ (дата звернення: 17.02.2023).
Volt Active Data. URL: http://voltactivedata.com/ (дата звернення: 17.02.2023).
Simborg M. Introducing Volt Active Data. URL: https://www.voltactivedata.com/blog/2022/02/introducing-volt-active-data/ (дата звернення: 17.02.2023).
LeanXcale. URL: https://www.leanxcale.com/ (дата звернення: 17.02.2023).
Van Steen M., Tanenbaum A. S. Distributed systems. Third edition. Pearson Education, Inc. 2017. 596 p.
Gazis A., Katsiri E. Middleware 101: What to know now and for the future // Queue. 2022. 20(1). P. 10 – 23. https://doi.org/10.1145/3526211.
Harizopoulos S., Abadi D. J., Madden S., Stonebraker M. OLTP through the looking glass, and what we found there // Making Databases Work: the Pragmatic Wisdom of Michael Stonebraker. Association for Computing Machinery and Morgan & Claypool. 2018. P. 409–439. https://doi.org/10.1145/3226595.3226635.
MariaDB MaxScale. URL: https://mariadb.com/products/enterprise/components/#maxscale (дата звернення: 17.02.2023).
ScaleArc. URL: www.devgraph.com/scalearc (дата звернення: 17.02.2023).
Bernstein P. A., Cseri I., Dani N., Ellis N., Kalhan A., Kakivaya G., Lomet D. B., Manne R., Novik L., Talius T. Adapting Microsoft SQL server for cloud computing // 2011 IEEE 27th International Conference on Data Engineering, 2011. P. 1255 – 1263. https://doi.org/10.1109/ICDE.2011.5767935.
Amazon Aurora. https://aws.amazon.com/rds/aurora (дата звернення: 17.02.2023).
Connolly T. M., Begg C. E. Database systems: a practical approach to design, implementation, and management. Sixth edition. Harlow, Essex, England: Pearson Education Limited, 2015. 1329 p.
Kleppmann M. Designing data-intensive applications: The big ideas behind reliable, scalable, and maintainable systems. O'Reilly Media, Inc., 2017. 590 p.
Gray J., Reuter A. Transaction processing: concepts and techniques. Elsevier, 1992. 1070 p.
S. Kimball. Living without atomic clocks. URL: https://www.cockroachlabs.com/blog/living-without-atomic-clocks/ (дата звернення: 17.02.2023).
Spanner: TrueTime and external consistency. URL: https://cloud.google.com/spanner/docs/true-time-external-consistency (дата звернення: 17.02.2023).
TimeTools. What is the GPS Clock? URL: https://timetoolsltd.com/gps/what-is-the-gps-clock/ (дата звернення: 17.02.2023).
Kimball S., Sharif I. Living without atomic clocks. URL: https://www.cockroachlabs.com/blog/living-without-atomic-clocks/. (дата звернення: 17.02.2023).
Lamport L. The implementation of reliable distributed multiprocess systems. Computer Networks (1976). 1978. 2(2). P. 95–114. https://doi.org/10.1016/0376-5075(78)90045-4.
The Tandem Database Group. NonStop SQL: A distributed, high-performance, high-availability implementation of SQL // Gawlick, D., Haynie, M., Reuter, A. (eds) High Performance Transaction Systems. HPTS 1987. Lecture Notes in Computer Science. Springer, Berlin, Heidelberg. 2005. Vol. 359. P. 60 – 104. https://doi.org/10.1007/3-540-51085-0_43.
Malviya N., Weisberg A., Madden S., Stonebraker M. Rethinking main memory OLTP recovery. In 2014 IEEE 30th International Conference on Data Engineering, Chicago, IL, USA, IEEE. 2014. P. 604 – 615, https://doi.org/10.1109/ICDE.2014.6816685.
Schwartz B., Zaitsev P., Tkachenko V. High performance MySQL: optimization, backups, and replication. Third Edition. O'Reilly Media, Inc., 2012. 826 p.
Harrington J. L. Relational database design and implementation. 4th edition. Morgan Kaufmann, 2016. 712 p.
Navathe S., Ceri S., Wiederhold G., Dou J. Vertical partitioning algorithms for database design // ACM Transactions on Database Systems (TODS). 1984. 9(4). P. 680–710. https://doi.org/10.1145/1994.2209.
Curino C., Jones E., Zhang Y., Madden S. Schism: a workload-driven approach to database replication and partitioning // Proceedings of the VLDB Endowment. 2010. 3(1-2). P. 48 – 57. https://doi.org/10.14778/1920841.1920853.
Law Insider. Legal Definitions Dictionary. URL: https://www.lawinsider.com/dictionary/transaction-engine (дата звернення: 17.02.2023).
Elmore A. J., Arora V., Taft R., Pavlo A., Agrawal D., Abbadi A. E. Squall: Fine-grained live reconfiguration for partitioned main memory databases // Proceedings of the 2015 ACM SIGMOD International Conference on Management of Data. 2015. P. 299–313. https://doi.org/10.1145/2723372.2723726.
Serafini M., Mansour E., Aboulnaga A., Salem K., Rafiq T., Minhas U. F. Accordion: Elastic scalability for database systems supporting distributed transactions // Proceedings of the VLDB Endowment. 2014. 7(12). P. 1035 –1046. https://doi.org/10.14778/2732977.2732979.
Ismail W., Muhammed A., Abdullah Z. H., Radman A., Hendradi R., Afandi R. R. A Survey of NewSQL DBMSs focusing on Taxonomy, Comparison and Open Issues. Journal of Computer Science & Computational Mathematics. 2021. P. 87-95. https://doi.org/10.20967/jcscm.2021.04.002.
Rothnie J. B., Bernstein P. A., Fox S., Goodman N., Hammer M., Landers T. A., Reeve C., Shipman D. W., Wong E. Introduction to a system for distributed databases (SDD-1) // ACM Transactions on Database Systems (TODS). 1980. 5(1), P. 1–17. https://doi.org/10.1145/320128.320129.
Williams R., Daniels D., Haas L., Lapis G., Lindsay B. G., Ng, P., Obermarck R., Selinger P., Walker A., Wilms P., Yost R. R*: An overview of the architecture // IBM Thomas J. Watson Research Division. 1981. P. 329 – 347.
Epstein R., Stonebraker M., Wong E. Distributed query processing in a relational data base system // Proceedings of the 1978 ACM SIGMOD international conference on management of data. (SIGMOD '78). Association for Computing Machinery, New York, NY, USA. 1978. P. 169 – 180. https://doi.org/10.1145/509252.509292.
DeWitt D. J., Katz R. H., Olken F., Shapiro L. D., Stonebraker M. R., Wood D. A. Implementation tech-niques for main memory database systems // Proceedings of the 1984 ACM SIGMOD international conference on management of data. 1984. P. 1 – 8. https://doi.org/10.1145/602259.602261.
Faerber F., Kemper A., Larson,P. A., Levandoski J., Neumann T., Pavlo A. Main memory database systems // Foundations and Trends in Databases. 2017. 8(1 – 2), P. 1 – 130.
Gawlick D., Kinkade D. Varieties of concurrency control in IMS/VS fast path // IEEE Database Eng. Bull. 1985. 8(2). P. 3 – 10.
Lehman T. J., Carey M. J. A Study of Index Structures for Main Memory Database Management Systems // Proceedings of the Twelfth International Conference on Very Large Data Bases, VLDB. 1985. P. 294 – 303.
Lehman T. J., Carey M. J. Query processing in main memory database management systems // Proceedings of the 1986 ACM SIGMOD international conference on Management of data. 1986. P. 239 – 250. https://doi.org/10.1145/16894.16878.
Lehman T. J., Carey M. J. A recovery algorithm for a high-performance memory-resident database system // ACM SIGMOD Record. 1987. 16(3). P. 104 – 117. https://doi.org/10.1145/38714.38730.
Kersten M.L., Apers P.M.G., Houtsma M.A.W., van KuykE.J.A., van de Weg R.L.W. A Distributed, Main-Memory Database Machine // Kitsuregawa M., Tanaka H. (eds) Database Machines and Knowledge Base Machines. The Kluwer International Series in Engineering and Computer Science. Springer, Boston, MA. 1988. Vol. 43. P. 353 – 369. https://doi.org/10.1007/978-1-4613-1679-4_26.
Altibase. URL: https://www.altibase.com (дата звернення: 17.02.2023).
TimesTen: Fastest OLTP Database, Ultra High Availability, Elastic Scalability. URL: https://www.oracle.com/database/technologies/related/timesten.html (дата звернення: 17.02.2023).
Cambridge Dictionary. URL: https://dictionary.cambridge.org/dictionary/english/workload. (дата звернення: 17.02.2023).
HPE Nimble Storage Deployment Considerations for Microsoft SQL Server. OLTP Workloads. URL: https://infosight.hpe.com/InfoSight/media/cms/active/public/tmg_HPE_Nimble_Storage_Deployment_Considerations_for_Microsoft_SQL_Server_doc_version_family.whz/xpm1491839725334.html (дата звернення: 17.02.2023).
DeBrabant J., Pavlo A., Tu S., Stonebraker M., Zdonik S. Anti-caching: A new approach to database management system architecture // Proceedings of the VLDB Endowment. 2013. 6(14). P. 1942 – 1953. https://doi.org/10.14778/2556549.2556575.
Arulraj J., Pavlo A., Dulloor S. R. Let's talk about storage & recovery methods for non-volatile memory database systems // Proceedings of the 2015 ACM SIGMOD International Conference on Management of Data (SIGMOD '15). Association for Computing Machinery, New York, NY, USA. 2015. P. 707 – 722. https://doi.org/10.1145/2723372.2749441.
Andrei M., Lemke C., Radestock G., Schulze R., Thiel C., Blanco R., Meghlan A., Sharique M., Seifert S., Vishnoi S., Booss D., Peh T., Schreter I., Thesing W., Wagle M., Willhalm T. SAP HANA adoption of non-volatile memory // Proceedings of the VLDB Endowment. 2017. 10(12). P. 1754 – 1765. https://doi.org/10.14778/3137765.3137780.
Intel Optane Persistent Memory and SAP HANA Platform Configuration. Technology overview and de-ployment guidelines for using Intel Optane persistent memory with SAP HANA. Configuration Guide. 2019. URL: https://cdrdv2-public.intel.com/753738/sap-hana-and-intel-optane-configuration-guide.pdf (дата звернення: 17.02.2023).
Intel Optane Persistent Memory. URL: https://www.intel.com/content/www/us/en/products/docs/memory-storage/optane-persistent-memory/overview.html (дата звернення: 17.02.2023).
Kroenke D. M., Auer D. J., Yoder R. C., Vandenberg S. L. Database processing fundamentals, design, and implementation. 15th edition. Pearson. 2018. 648 p.
Taft R., Sharif I., Matei A., VanBenschoten N., Lewis, J., Grieger, T., Niemi K., Woods A., Birzin A., Poss R., Bardea P., Ranade A., Darnell B., Gruneir B., Jaffray J., Zhang L., Mattis P. Cockroachdb: The resilient geo-distributed SQL database // Proceedings of the 2020 ACM SIGMOD International Conference on Management of Data (SIGMOD '20). Association for Computing Machinery, New York, NY, USA, 2020. P. 1493 – 1509. https://doi.org/10.1145/3318464.3386134.
Garcia-Molina H., Salem K. Main memory database systems: An overview // IEEE Transactions on knowledge and data engineering. 1992. 4(6). P. 509–516. https://doi.org/10.1109/69.180602.
Google Cloud Spanner. Secondary indexes. URL: https://cloud.google.com/spanner/docs/secondary-indexes (дата звернення: 17.02.2023).
SAP HANA Performance Guide for Developers. Secondary Indexes. URL: https://help.sap.com/docs/SAP_HANA_PLATFORM/9de0171a6027400bb3b9bee385222eff/3441acf7dcf64e169ba94121acaf2350.html?version=2.0.04&locale=en-US (дата звернення: 17.02.2023).
Fitzpatrick B. Distributed caching with Memcached // Linux journal. 2004. 2004(124). P. 5.
Mohan C., Haderle D., Lindsay B., Pirahesh H., Schwarz P. ARIES: A transaction recovery method supporting fine-granularity locking and partial rollbacks using write-ahead logging // ACM Transactions on Database Systems (TODS). 1992. 17(1). P. 94 – 162. https://doi.org/10.1145/128765.128770.
Using VoltDB. V.12.1. URL: https://docs.voltdb.com/UsingVoltDB/ (дата звернення: 17.02.2023).
Ren K., Diamond T., Abadi D. J., Thomson A. Low-overhead asynchronous checkpointing in main-memory database systems // Proceedings of the 2016 International Conference on Management of Data (SIGMOD '16). Association for Computing Machinery, New York, NY, USA. 2016. P. 1539 – 1551. https://doi.org/10.1145/2882903.2915966.
Reid R. Practical CockroachDB: Building Fault-Tolerant Distributed SQL Databases. 1st ed. Apress. 2022. 254 p.
MariaDB MaxScale technical brief. Enterprise security. URL: https://mariadb.com/wp-content/uploads/2019/09/mariadb-maxscale-security_datasheet_1041.pdf (дата звернення: 17.02.2023).
Khasawneh T. N., AL-Sahlee M. H., Safia A. A. SQL, NewSQL, and NoSQL databases: A comparative survey // 2020 11th International Conference on Information and Communication Systems (ICICS), Irbid, Jordan, IEEE. 2020. P. 013 – 021. https://doi.org/10.1109/ICICS49469.2020.239513.
Downloads
Published
How to Cite
Issue
Section
License
Authors who publish with this journal agree to the following terms:
1. Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under a Creative Commons Attribution License that allows others to share the work with an acknowledgement of the work's authorship and initial publication in this journal.
2. Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgement of its initial publication in this journal.
3. Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work (See The Effect of Open Access).