摘要: Apache Kafka is optimized for small messages. According to benchmarks, the best performance occurs with 1 KB messages. Larger messages (for example, 1 阅读全文
posted @ 2018-05-25 16:53 大数据从业者FelixZh 阅读(575) 评论(0) 推荐(0)
摘要: To achieve high availability and consistency targets, adjust the following parameters to meet your requirements: Replication Factor Preferred Leader E 阅读全文
posted @ 2018-05-25 16:52 大数据从业者FelixZh 阅读(413) 评论(0) 推荐(0)
摘要: This topic describes additional steps you can take to ensure the safety and integrity of your data stored in Apache Kafka, with features available in 阅读全文
posted @ 2018-05-25 16:50 大数据从业者FelixZh 阅读(814) 评论(0) 推荐(0)
摘要: I wrote a blog post about how LinkedIn uses Apache Kafka as a central publish-subscribe log for integrating data between applications, stream processi 阅读全文
posted @ 2018-05-25 16:33 大数据从业者FelixZh 阅读(588) 评论(0) 推荐(0)
摘要: 如果某个broker挂了,leader副本在该broker上的分区就要重新进行leader选举。来简要描述下leader选举的过程 1.4.1 KafkaController会监听ZooKeeper的/brokers/ids节点路径,一旦发现有broker挂了,执行下面的逻辑。这里暂时先不考虑Kaf 阅读全文
posted @ 2018-05-25 16:02 大数据从业者FelixZh 阅读(2165) 评论(1) 推荐(0)
摘要: 消费端出现offset重置为latest, earliest现象,类似log: 原因:该consumer消费的topic的leader和followers的状态不一致时,发生leader切换,会发生offset out of range,此时consumer进行消费时发现offset非法,会进行of 阅读全文
posted @ 2018-05-25 15:20 大数据从业者FelixZh 阅读(3877) 评论(0) 推荐(0)
摘要: MapReduce简介 在Hadoop MapReduce中,框架会确保reduce收到的输入数据是根据key排序过的。数据从Mapper输出到Reducer接收,是一个很复杂的过程,框架处理了所有问题,并提供了很多配置项及扩展点。一个MapReduce的大致数据流如下图: 更详细的MapReduc 阅读全文
posted @ 2018-05-25 15:03 大数据从业者FelixZh 阅读(1083) 评论(0) 推荐(1)
摘要: Cloudera recently announced formal support for Apache Kafka. This simple use case illustrates how to make web log analysis, powered in part by Kafka, 阅读全文
posted @ 2018-05-25 14:32 大数据从业者FelixZh 阅读(510) 评论(0) 推荐(0)
摘要: The new integration between Flume and Kafka offers sub-second-latency event processing without the need for dedicated infrastructure. In this previous 阅读全文
posted @ 2018-05-25 14:26 大数据从业者FelixZh 阅读(691) 评论(0) 推荐(0)
摘要: The versatility of Apache Spark’s API for both batch/ETL and streaming workloads brings the promise of lambda architecture to the real world. Few thin 阅读全文
posted @ 2018-05-25 14:17 大数据从业者FelixZh 阅读(385) 评论(0) 推荐(0)
摘要: http://www.cnblogs.com/xuliangxing/p/7151812.html 本文对Redis的过期机制简单的讲解一下 讲解之前我们先抛出一个问题,我们知道很多时候服务器经常会用到redis作为缓存,有很多数据都是临时缓存一下,可能用过之后很久都不会再用到了(比如暂存sessi 阅读全文
posted @ 2018-05-25 09:15 大数据从业者FelixZh 阅读(2962) 评论(0) 推荐(0)
大数据从业者