摘要:
In this page, I am going to talk about the 'hello world' model that is linear regression and train it with 2 different ways. one is the "closed-form" 阅读全文
摘要:
Linear System Vector Equations The Matrix Equation Solution Sets of Linear Systems Linear Indenpendent Introduction to Linear Transformation The Matri 阅读全文
摘要:
If a tree is not balanced, it is not efficient and it is the same efficient as a linked list in the worst situation for seaching a given key. Self-bal 阅读全文
摘要:
Why? look at the following 2 pieces of code for implementing a simple web server based on socket, can you point out the problems(I put them in the com 阅读全文
摘要:
We are going to explain how join works in MR , we will focus on reduce side join and map side join. Reduce Side Join Assuming we have 2 datasets , one 阅读全文
摘要:
Map Reduce Application(Partitioninig/Group data by a defined key) Assuming we want to group data by the year(2008 to 2016) of their [last access date 阅读全文
摘要:
Top 10 IDs base on their value First , we need to set the reduce to 1. For each map task, it is not a good idea to output each key/value pair. Instead 阅读全文
摘要:
In this page, I will explain the following important MR concepts. 1) Job: how the job is inited , executed. 2) MR components: How they work to process 阅读全文
摘要:
为什么想用英文写了?我获取知识、技术的大部分途径都是通过英文,所以按照自己的理解用英文写下来也比较容易,另外,很多term都是不能翻译的,如果要持续学习技术和知识,那就不但要习惯去阅读,听,还要写,说。可惜从IBM出来后,很少有机会和人去说了,只能写了。就当提高自己英文水平吧 I am going 阅读全文
摘要:
HDFS架构 the core of HADOOP/distributed systems is storeage(HDFS) and resource manager(YARN) for computing engines built on it. Master/Slave: The charac 阅读全文
摘要:
Overview YARN provides API not for application developers but for the great developers working on new computing engines. YARN make it easy and unified 阅读全文
摘要:
线程安全定义 "A class is thread-safe if it behaves correctly when accessed from multiple threads, regardless of the scheduling or interleaving of the execut 阅读全文