摘要: https://avro.apache.org/docs/current/ Introduction Apache Avro™ is a data serialization system. Avro provides: Rich data structures. A compact, fast, 阅读全文
posted @ 2017-10-31 23:45 papering 阅读(206) 评论(0) 推荐(0)
摘要: https://hadoop.apache.org/docs/current/hadoop-project-dist/hadoop-hdfs/ArchivalStorage.html Introduction Archival Storage is a solution to decouple gr 阅读全文
posted @ 2017-10-31 23:38 papering 阅读(305) 评论(0) 推荐(0)
摘要: splittability CompressedStorage CompressedStorage CompressedStorage Skip to end of metadata Created by Confluence Administrator, last modified by Left 阅读全文
posted @ 2017-10-31 23:26 papering 阅读(309) 评论(0) 推荐(0)
摘要: http://cis.stvincent.edu/html/tutorials/swd/btree/btree.html Introduction A B-tree is a specialized multiway tree designed especially for use on disk. 阅读全文
posted @ 2017-10-31 21:51 papering 阅读(165) 评论(0) 推荐(0)
摘要: https://kafka.apache.org/intro.html 阅读全文
posted @ 2017-10-31 17:03 papering 阅读(159) 评论(0) 推荐(0)
摘要: https://kafka.apache.org/intro.html Kafka as a Messaging System How does Kafka's notion of streams compare to a traditional enterprise messaging syste 阅读全文
posted @ 2017-10-31 12:03 papering 阅读(198) 评论(0) 推荐(0)
摘要: limit 阅读全文
posted @ 2017-10-31 11:22 papering 阅读(151) 评论(0) 推荐(0)
摘要: rmds mapper 阅读全文
posted @ 2017-10-31 11:21 papering 阅读(123) 评论(0) 推荐(0)
摘要: https://spark.apache.org/sql/ Performance & Scalability Spark SQL includes a cost-based optimizer, columnar storage and code generation to make querie 阅读全文
posted @ 2017-10-31 00:10 papering 阅读(168) 评论(0) 推荐(0)