打赏
摘要: 不多说,直接上干货! 一切来源于官网 Kafka for Stream Processing It isn't enough to just read, write, and store streams of data, the purpose is to enable real-time proc 阅读全文
posted @ 2017-04-25 23:33 大数据和AI躺过的坑 阅读(435) 评论(0) 推荐(0)
摘要: 不多说,直接上干货! 一切来源于官网 Kafka as a Storage System Any message queue that allows publishing messages decoupled from consuming them is effectively acting as 阅读全文
posted @ 2017-04-25 23:32 大数据和AI躺过的坑 阅读(258) 评论(0) 推荐(0)
摘要: 不多说,直接上干货! 一切来源于官网 Kafka as a Messaging System How does Kafka's notion of streams compare to a traditional enterprise messaging system? Messaging trad 阅读全文
posted @ 2017-04-25 23:28 大数据和AI躺过的坑 阅读(230) 评论(0) 推荐(0)
摘要: 不多说,直接上干货! 一切来源于官网 Guarantees At a high-level Kafka gives the following guarantees: Messages sent by a producer to a particular topic partition will b 阅读全文
posted @ 2017-04-25 23:26 大数据和AI躺过的坑 阅读(274) 评论(0) 推荐(0)
摘要: 不多说,直接上干货! 一切来源于官网 Consumers Consumers label themselves with a consumer group name, and each record published to a topic is delivered to one consumer 阅读全文
posted @ 2017-04-25 23:18 大数据和AI躺过的坑 阅读(484) 评论(0) 推荐(0)
摘要: 不多说,直接上干货! 一切来源于官网 Producers Producers publish data to the topics of their choice. The producer is responsible for choosing which record to assign to 阅读全文
posted @ 2017-04-25 23:12 大数据和AI躺过的坑 阅读(301) 评论(0) 推荐(0)
摘要: 不多说,直接上干货! 一切来源于官网 Distribution The partitions of the log are distributed over the servers in the Kafka cluster with each server handling data and req 阅读全文
posted @ 2017-04-25 23:03 大数据和AI躺过的坑 阅读(284) 评论(0) 推荐(0)
摘要: 不多说,直接上干货! 一切来源于官网 Topics and Logs Let's first dive into the core abstraction Kafka provides for a stream of records—the topic. A topic is a category 阅读全文
posted @ 2017-04-25 23:01 大数据和AI躺过的坑 阅读(399) 评论(0) 推荐(0)
摘要: 不多说,直接上干货! 一切来源于官网 Apache Kafka™ is a distributed streaming platform. What exactly does that mean? We think of a streaming platform as having three ke 阅读全文
posted @ 2017-04-25 22:59 大数据和AI躺过的坑 阅读(591) 评论(0) 推荐(0)
摘要: 看kafka消费者信息? 进入cd $ZOOKEEPER_HOME/bin下。 阅读全文
posted @ 2017-04-25 22:25 大数据和AI躺过的坑 阅读(17526) 评论(0) 推荐(0)
摘要: 什么是kafka同步生产者,什么是kafka异步生产者? 比如这里某个topic有3个分区。 kafka同步生产者:这个生产者写一条消息的时候,它就立马发送到某个分区去。 kafka异步生产者:这个生产者写一条消息的时候,先是写到某个缓冲区,这个缓冲区里的数据还没写到broker集群里的某个分区的时 阅读全文
posted @ 2017-04-25 21:45 大数据和AI躺过的坑 阅读(6711) 评论(0) 推荐(2)
摘要: 不多说,直接上干货! Flume Sources官网剖析(博主推荐) Flume Channels官网剖析(博主推荐) Flume Channel Selectors官网剖析(博主推荐) Flume Sinks官网剖析(博主推荐) Flume Sink Processors官网剖析(博主推荐) Ev 阅读全文
posted @ 2017-04-25 15:40 大数据和AI躺过的坑 阅读(458) 评论(0) 推荐(0)
摘要: 不多说,直接上干货! Flume Sources官网剖析(博主推荐) Flume Channels官网剖析(博主推荐) Flume Channel Selectors官网剖析(博主推荐) Flume Sinks官网剖析(博主推荐) Flume Sink Processors官网剖析(博主推荐) 一切 阅读全文
posted @ 2017-04-25 15:35 大数据和AI躺过的坑 阅读(275) 评论(0) 推荐(0)
摘要: 不多说,直接上干货! Flume Sources官网剖析(博主推荐) Flume Channels官网剖析(博主推荐) Flume Channel Selectors官网剖析(博主推荐) Flume Sinks官网剖析(博主推荐) 一切来源于flume官网 Flume Sink Processors 阅读全文
posted @ 2017-04-25 15:32 大数据和AI躺过的坑 阅读(494) 评论(0) 推荐(0)
摘要: 不多说,直接上干货! Flume Sources官网剖析(博主推荐) Flume Channels官网剖析(博主推荐) Flume Channel Selectors官网剖析(博主推荐) 一切来源于flume官网 Flume Sinks HDFS Sink Hive Sink Logger Sink 阅读全文
posted @ 2017-04-25 15:28 大数据和AI躺过的坑 阅读(755) 评论(0) 推荐(0)
摘要: 不多说,直接上干货! Flume Sources官网剖析(博主推荐) Flume Channels官网剖析(博主推荐) 一切来源于flume官网 Flume Channel Selectors Replicating Channel Selector (default) Multiplexing C 阅读全文
posted @ 2017-04-25 15:07 大数据和AI躺过的坑 阅读(337) 评论(0) 推荐(0)
摘要: 不多说,直接上干货! Flume Sources官网剖析(博主推荐) 一切来源于flume官网 Flume Channels Memory Channel JDBC Channel Kafka Channel File Channel Spillable Memory Channel Pseudo 阅读全文
posted @ 2017-04-25 14:55 大数据和AI躺过的坑 阅读(749) 评论(0) 推荐(0)
摘要: 不多说,直接上干货! 一切来源于flume官网 Flume Sources Avro Source Thrift Source Exec Source JMS Source Converter Spooling Directory Source Event Deserializers LINE AV 阅读全文
posted @ 2017-04-25 12:07 大数据和AI躺过的坑 阅读(1972) 评论(0) 推荐(0)
摘要: 进入我这篇博客的博友们,相信你们具备有一定的spark学习基础和实践了。 先给大家来梳理下。spark的运行模式和常用的standalone、yarn部署。这里不多赘述,自行点击去扩展。 1、Spark运行模式概述 2、Spark standalone模式的安装(spark-1.6.1-bin-ha 阅读全文
posted @ 2017-04-25 11:50 大数据和AI躺过的坑 阅读(503) 评论(0) 推荐(0)