摘要: https://avro.apache.org/docs/current/ Introduction Apache Avro™ is a data serialization system. Avro provides: Rich data structures. A compact, fast, 阅读全文
posted @ 2017-10-31 23:45 papering 阅读(210) 评论(0) 推荐(0)
摘要: https://hadoop.apache.org/docs/current/hadoop-project-dist/hadoop-hdfs/ArchivalStorage.html Introduction Archival Storage is a solution to decouple gr 阅读全文
posted @ 2017-10-31 23:38 papering 阅读(309) 评论(0) 推荐(0)
摘要: splittability CompressedStorage CompressedStorage CompressedStorage Skip to end of metadata Created by Confluence Administrator, last modified by Left 阅读全文
posted @ 2017-10-31 23:26 papering 阅读(313) 评论(0) 推荐(0)
摘要: http://cis.stvincent.edu/html/tutorials/swd/btree/btree.html Introduction A B-tree is a specialized multiway tree designed especially for use on disk. 阅读全文
posted @ 2017-10-31 21:51 papering 阅读(170) 评论(0) 推荐(0)
摘要: https://kafka.apache.org/intro.html 阅读全文
posted @ 2017-10-31 17:03 papering 阅读(164) 评论(0) 推荐(0)
摘要: https://kafka.apache.org/intro.html Kafka as a Messaging System How does Kafka's notion of streams compare to a traditional enterprise messaging syste 阅读全文
posted @ 2017-10-31 12:03 papering 阅读(199) 评论(0) 推荐(0)
摘要: limit 阅读全文
posted @ 2017-10-31 11:22 papering 阅读(152) 评论(0) 推荐(0)
摘要: rmds mapper 阅读全文
posted @ 2017-10-31 11:21 papering 阅读(126) 评论(0) 推荐(0)
摘要: https://spark.apache.org/sql/ Performance & Scalability Spark SQL includes a cost-based optimizer, columnar storage and code generation to make querie 阅读全文
posted @ 2017-10-31 00:10 papering 阅读(171) 评论(0) 推荐(0)