摘要:Google 20年发的时序库的paper 首先看看系统架构, 设计时,首先考虑CAP的balance问题,向Spanner这样的高一致性方案,明显不适合监控系统,所以Monarch牺牲一致性来保证实时性和可用性 Monarch’s design is determined by its prima
阅读全文
摘要:TCPH的表结构 最小的且不变的, - Region,Nation(FK,REGION记录属于哪个区域) 1万级别, - Supplier(供应商),Skey,Name,地址,NKey(FK,记录所属国家key),电话,acctbal(Account Balance,账户余额) 10万级别, - C
阅读全文
摘要:该论文主要的目的是,讨论TCPH中的Choke points, Chokepoints are those technological challenges underlying a benchmark, whose resolution will significantly improve the
阅读全文
摘要:INTRODUCTION Estimating the selectivity of a query—the fraction of input tuples that satisfy the query’s predicate—is a fundamental component in cost-
阅读全文
摘要:25年前提出的问题仍然没有得到解决,学者们总是在攻克他们能攻克的问题, 而不是他们应该去攻克的问题,有点意思! 说到Cardinality是查询优化的阿基琉斯之踵,但是之前很多研究都关注在新的结构来解决individual local predicates的估计问题,作者认为完全就是无用功 并提出应
阅读全文
摘要:Oracle的SQL引擎大体分为这几部分, (1)首先要进入Prepare的阶段,Parser,SA和TC A SQL statement goes through the Parser, Semantic Analysis (SA), and Type-Check (TC) first befor
阅读全文
摘要:Why Key Components in Optimizer are Still Not Accurate? In this section, we summarize the reasons why the cardinality estimation, cost model, and plan
阅读全文
摘要:INTRODUCTION The Needs of IoT systems 正对IoT系统的以下4点需求,高吞吐,有效存储,实时查询和高可用,当前的系统无法比较好的解决。 With the rapid proliferation(rapid increase in numbers) of conne
阅读全文
摘要:INTRODUCTION The problem of finding a good join order is one of the most studied problems in the database field. Figure 1 illustrates the classical, c
阅读全文
摘要:Introduction 引入Shared-nothing架构 Shared-nothing architectures have been the foundation of traditional query execution engines and data warehousing syst
阅读全文
摘要:主导的学习组件,是query-based,workload-driven,以一堆有代表性的queryset去实际运行,产生训练集。 The predominant「占主导的」 approach for learned DBMS components is that they capture the
阅读全文
摘要:这篇文章,主体在说,如何降低训练的代价 从两个方面,降低training set大小,降低单个训练样本的label的收集成本 ABSTRACT (现状)Today's query optimizers use fast selectivity estimation techniques but ar
阅读全文
摘要:Histogram和KDE的分别 参考,https://www.jianshu.com/p/428ae3658f85 Histogram的问题比较明显,首先不平滑,再者分布和区间大小和偏移有很大关系,比如底下两个分布图差别非常大 所以Histogram误差是比较大的 KDE的基本思想比较简单 首先K
阅读全文
摘要:摘要 (点题)Query optimizers depend on selectivity estimates of query predicates to produce a good execution plan. (当前的方案)When a query contains multiple pr
阅读全文
摘要:INTRODUCTION Managing large data sets has always been the raison d’ˆetre (a French expression commonly used in English, meaning "reason for being" or
阅读全文
摘要:INTRODUCTION (Disk-oriented数据库的问题,维护buffer pool)DBMSs invariably(always) maintain a buffer pool of blocks in main memory for faster access. When an ex
阅读全文
摘要:ABSTRACT (点题)A new class of database management systems (DBMSs) called NewSQL tout(吹捧) their ability to scale modern on-line transaction processing (O
阅读全文
摘要:ABSTRACT (点题)Multi-version concurrency control (MVCC) is currently the most popular transaction management scheme in modern database management system
阅读全文
摘要:摘要 Hybrid Transactional and Analytical Processing (HTAP) databases require processing transactional and analytical queries in isolation to remove the
阅读全文
摘要:这篇主要是说,如何利用compressed bitmap来提升查询性能 虽然之前有很多bitmap的压缩方案,但是,新提出的Roaring bitmap会更加高效 参考,Better bitmap performance with Roaring bitmaps BitMap,为了有效和compac
阅读全文