随笔分类 -  Database

1 2 3 下一页

论文解析 -- Monarch: Google’s Planet-Scale In-Memory Time Series Database
摘要:Google 20年发的时序库的paper 首先看看系统架构, 设计时,首先考虑CAP的balance问题,向Spanner这样的高一致性方案,明显不适合监控系统,所以Monarch牺牲一致性来保证实时性和可用性 Monarch’s design is determined by its prima 阅读全文

posted @ 2022-04-19 17:55 fxjwind 阅读(294) 评论(0) 推荐(0)

TPC-H 分析
摘要:TCPH的表结构 最小的且不变的, - Region,Nation(FK,REGION记录属于哪个区域) 1万级别, - Supplier(供应商),Skey,Name,地址,NKey(FK,记录所属国家key),电话,acctbal(Account Balance,账户余额) 10万级别, - C 阅读全文

posted @ 2021-09-28 17:29 fxjwind 阅读(643) 评论(0) 推荐(0)

论文解析 -- TPC-H Analyzed: Hidden Messages and Lessons Learned from an Influential Benchmark
摘要:该论文主要的目的是,讨论TCPH中的Choke points, Chokepoints are those technological challenges underlying a benchmark, whose resolution will significantly improve the 阅读全文

posted @ 2021-09-28 17:13 fxjwind 阅读(681) 评论(0) 推荐(0)

论文解析 -- QuickSel: Quick Selectivity Learning with Mixture Models (SIGMOD’20)
摘要:INTRODUCTION Estimating the selectivity of a query—the fraction of input tuples that satisfy the query’s predicate—is a fundamental component in cost- 阅读全文

posted @ 2021-05-10 14:32 fxjwind 阅读(388) 评论(0) 推荐(0)

Is Query Optimization a “Solved” Problem? (2014, Guy Lohman)
摘要:25年前提出的问题仍然没有得到解决,学者们总是在攻克他们能攻克的问题, 而不是他们应该去攻克的问题,有点意思! 说到Cardinality是查询优化的阿基琉斯之踵,但是之前很多研究都关注在新的结构来解决individual local predicates的估计问题,作者认为完全就是无用功 并提出应 阅读全文

posted @ 2021-04-29 14:45 fxjwind 阅读(202) 评论(0) 推荐(0)

Adaptive Statistics in Oracle 12c(PVLDB 2017)
摘要:Oracle的SQL引擎大体分为这几部分, (1)首先要进入Prepare的阶段,Parser,SA和TC A SQL statement goes through the Parser, Semantic Analysis (SA), and Type-Check (TC) first befor 阅读全文

posted @ 2021-04-28 17:18 fxjwind 阅读(270) 评论(0) 推荐(0)

论文解析 -- A Survey on Advancing the DBMS Query Optimizer: Cardinality Estimation, Cost Model, and Plan Enumeration (Data Science and Engineering 2021)
摘要:Why Key Components in Optimizer are Still Not Accurate? In this section, we summarize the reasons why the cardinality estimation, cost model, and plan 阅读全文

posted @ 2021-04-23 17:47 fxjwind 阅读(426) 评论(0) 推荐(0)

论文解析 -- Db2 Event Store: A Purpose-Built IoT Database Engine (PVLDB 2020)
摘要:INTRODUCTION The Needs of IoT systems 正对IoT系统的以下4点需求,高吞吐,有效存储,实时查询和高可用,当前的系统无法比较好的解决。 With the rapid proliferation(rapid increase in numbers) of conne 阅读全文

posted @ 2021-04-16 15:13 fxjwind 阅读(332) 评论(0) 推荐(0)

论文解析 -- How Good Are Query Optimizers, Really? (TUM PVLDB 2015)
摘要:INTRODUCTION The problem of finding a good join order is one of the most studied problems in the database field. Figure 1 illustrates the classical, c 阅读全文

posted @ 2021-04-02 16:04 fxjwind 阅读(649) 评论(0) 推荐(0)

论文解析 -- Building An Elastic Query Engine on Disaggregated Storage (NSDI 2020)
摘要:Introduction 引入Shared-nothing架构 Shared-nothing architectures have been the foundation of traditional query execution engines and data warehousing syst 阅读全文

posted @ 2021-03-25 13:50 fxjwind 阅读(542) 评论(0) 推荐(1)

DeepDB: Learn from Data, not from Queries! (2020,PVLDB)
摘要:主导的学习组件,是query-based,workload-driven,以一堆有代表性的queryset去实际运行,产生训练集。 The predominant「占主导的」 approach for learned DBMS components is that they capture the 阅读全文

posted @ 2021-03-11 14:19 fxjwind 阅读(538) 评论(0) 推荐(0)

论文解析 -- Efficiently Approximating Selectivity Functions using Low Overhead Regression Models (2020)
摘要:这篇文章,主体在说,如何降低训练的代价 从两个方面,降低training set大小,降低单个训练样本的label的收集成本 ABSTRACT (现状)Today's query optimizers use fast selectivity estimation techniques but ar 阅读全文

posted @ 2021-03-03 17:50 fxjwind 阅读(168) 评论(0) 推荐(0)

Self-Tuning, GPU-Accelerated Kernel Density Models for Multidimensional Selectivity Estimation
摘要:Histogram和KDE的分别 参考,https://www.jianshu.com/p/428ae3658f85 Histogram的问题比较明显,首先不平滑,再者分布和区间大小和偏移有很大关系,比如底下两个分布图差别非常大 所以Histogram误差是比较大的 KDE的基本思想比较简单 首先K 阅读全文

posted @ 2021-01-21 11:36 fxjwind 阅读(278) 评论(0) 推荐(0)

论文解析 -- Selectivity Estimation for Range Predicates using Lightweight Models (2019)
摘要:摘要 (点题)Query optimizers depend on selectivity estimates of query predicates to produce a good execution plan. (当前的方案)When a query contains multiple pr 阅读全文

posted @ 2021-01-19 15:45 fxjwind 阅读(827) 评论(0) 推荐(0)

论文解析 -- LeanStore: In-Memory Data Management Beyond Main Memory
摘要:INTRODUCTION Managing large data sets has always been the raison d’ˆetre (a French expression commonly used in English, meaning "reason for being" or 阅读全文

posted @ 2021-01-05 14:55 fxjwind 阅读(849) 评论(1) 推荐(1)

论文解析 -- Anti-Caching: A New Approach to Database Management System Architecture
摘要:INTRODUCTION (Disk-oriented数据库的问题,维护buffer pool)DBMSs invariably(always) maintain a buffer pool of blocks in main memory for faster access. When an ex 阅读全文

posted @ 2020-12-25 16:08 fxjwind 阅读(431) 评论(0) 推荐(0)

论文解析 -- What’s Really New with NewSQL?
摘要:ABSTRACT (点题)A new class of database management systems (DBMSs) called NewSQL tout(吹捧) their ability to scale modern on-line transaction processing (O 阅读全文

posted @ 2020-12-23 18:06 fxjwind 阅读(438) 评论(0) 推荐(0)

论文解析 -- An Empirical Evaluation of In-Memory Multi-Version Concurrency Control
摘要:ABSTRACT (点题)Multi-version concurrency control (MVCC) is currently the most popular transaction management scheme in modern database management system 阅读全文

posted @ 2020-12-16 17:38 fxjwind 阅读(648) 评论(0) 推荐(0)

论文解析 -- TiDB: A Raftbased HTAP Database
摘要:摘要 Hybrid Transactional and Analytical Processing (HTAP) databases require processing transactional and analytical queries in isolation to remove the 阅读全文

posted @ 2020-12-01 15:50 fxjwind 阅读(785) 评论(0) 推荐(0)

Optimizing Druid with Roaring bitmaps
摘要:这篇主要是说,如何利用compressed bitmap来提升查询性能 虽然之前有很多bitmap的压缩方案,但是,新提出的Roaring bitmap会更加高效 参考,Better bitmap performance with Roaring bitmaps BitMap,为了有效和compac 阅读全文

posted @ 2020-04-27 15:52 fxjwind 阅读(452) 评论(0) 推荐(0)

1 2 3 下一页