Apache Flink - 随笔分类(第3页) - fxjwind

flink - accumulator

摘要：读accumlator JobManager 在job finish的时候会汇总accumulator的值， newJobStatus match { case JobStatus.FINISHED => try { val accumulatorResults = executionGraph.getAccumulatorsSerialized() va... 阅读全文

posted @ 2016-08-10 17:28 fxjwind 阅读(1723) 评论(0) 推荐(0) 编辑

摘要：http://wuchong.me/blog/2016/04/26/flink-internals-how-to-handle-backpressure/ https://ci.apache.org/projects/flink/flink-docs-master/internals/back_pressure_monitoring.html 反压之所以会是需要的，因为当源端流量过大，而... 阅读全文

posted @ 2016-08-03 15:33 fxjwind 阅读(775) 评论(0) 推荐(0) 编辑

Flink - DataStream

摘要：先看例子， final StreamExecutionEnvironment env = StreamExecutionEnvironment.getExecutionEnvironment(); DataStream> stream = env.addSource(...); stream .keyBy(0) .timeWindow(Time.of(2500, MILLI... 阅读全文

posted @ 2016-07-26 09:48 fxjwind 阅读(2733) 评论(0) 推荐(0) 编辑

Flink - FlinkKafkaConsumer08

摘要：先看 AbstractFetcher 这个可以理解就是，consumer中具体去kafka读数据的线程，一个fetcher可以同时读多个partitions的数据来看看 /** * Base class for all fetchers, which implement the connections to Kafka brokers and * pull records f... 阅读全文

posted @ 2016-07-06 20:11 fxjwind 阅读(3077) 评论(0) 推荐(0) 编辑

Flink - Working with State

摘要：https://ci.apache.org/projects/flink/flink-docs-release-1.3/dev/stream/state.html#custom-serialization-for-managed-state All transformations in Flink may look like functions (in the functional pro... 阅读全文

posted @ 2016-07-01 15:45 fxjwind 阅读(2796) 评论(0) 推荐(0) 编辑

Why Apache Beam? A data Artisans perspective

摘要：https://cloud.google.com/dataflow/blog/dataflow-beam-and-spark-comparison https://github.com/apache/incubator-beam https://www.oreilly.com/ideas/the-world-beyond-batch-streaming-101 https://www.ore... 阅读全文

posted @ 2016-06-30 18:20 fxjwind 阅读(742) 评论(0) 推荐(0) 编辑

Flink - Generating Timestamps / Watermarks

摘要：https://ci.apache.org/projects/flink/flink-docs-release-1.0/apis/streaming/event_timestamps_watermarks.html To work with Event Time, streaming programs need to set the time characteristic accordin... 阅读全文

posted @ 2016-06-29 15:58 fxjwind 阅读(2732) 评论(3) 推荐(0) 编辑

Flink DataStream API Programming Guide

摘要：Example Program The following program is a complete, working example of streaming window word count application, that counts the words coming from a w 阅读全文

posted @ 2016-04-26 12:07 fxjwind 阅读(5775) 评论(6) 推荐(0) 编辑

Flink Internals

摘要：https://cwiki.apache.org/confluence/display/FLINK/Flink+Internals Memory Management (Batch API) Introduction Memory management in Flink serves the purpose to control how much memory certain runt... 阅读全文

posted @ 2016-04-08 16:58 fxjwind 阅读(1211) 评论(0) 推荐(1) 编辑

Off-heap Memory in Apache Flink and the curious JIT compiler

摘要：https://flink.apache.org/news/2015/09/16/off-heap-memory.html Running data-intensive code in the JVM and making it well-behaved is tricky. Systems that put billions of data objects naively onto th... 阅读全文

posted @ 2016-04-08 11:20 fxjwind 阅读(1014) 评论(0) 推荐(0) 编辑

Peeking into Apache Flink's Engine Room

摘要：http://flink.apache.org/news/2015/03/13/peeking-into-Apache-Flinks-Engine-Room.html Join Processing in Apache Flink In this blog post, we cut through Apache Flink’s layered architecture and take ... 阅读全文

posted @ 2016-04-06 14:41 fxjwind 阅读(933) 评论(0) 推荐(0) 编辑

Flink - Juggling with Bits and Bytes

摘要：http://www.36dsj.com/archives/33650 http://flink.apache.org/news/2015/05/11/Juggling-with-Bits-and-Bytes.html http://www.bigsynapse.com/addressing-big-data-performance ，addressing-big-data-performan... 阅读全文

posted @ 2016-03-29 20:02 fxjwind 阅读(911) 评论(0) 推荐(0) 编辑

Flink DataSet API Programming Guide

摘要：https://ci.apache.org/projects/flink/flink-docs-release-0.10/apis/programming_guide.html Example Program 编程的风格和spark很类似， ExecutionEnvironment -- SparkContext DataSet – RDD Transformations 这... 阅读全文

posted @ 2016-01-29 13:50 fxjwind 阅读(3142) 评论(0) 推荐(0) 编辑

fxjwind

随笔分类 - Apache Flink