06 2015 档案

摘要:如果有表中有重复键,做表连接后,可能会产生很大的数据表A 有两个重复键a:col1 col2a 1a 2表B 有两个重复键a:col1 col2a 3a 4做内连接:select * from A join B on A.col1 = B.col1, 结果a 1 ... 阅读全文
posted @ 2015-06-09 20:35 lihui1625 阅读(128) 评论(0) 推荐(0)
摘要:1. 元数据配置 配置属性 javax.jdo.option.ConnectionURL, javax.jdo.option.ConnectionDriverName 等 2. 集合数据类型 STRUCT: MAP: ARRAY: 3.分割符 \n 分行 \001 分列 \002 分隔ARRAY、S 阅读全文
posted @ 2015-06-03 21:16 lihui1625 阅读(157) 评论(0) 推荐(0)
摘要:1. Word Count 例子 inputfile = load 'file' as (line); 内容: {line: bytearray} (Look at the stars, ) (Look how they shine for you, ) (And everything you do 阅读全文
posted @ 2015-06-03 20:38 lihui1625 阅读(160) 评论(0) 推荐(0)
摘要:Pig: 1. 内连接: join_table = join table1 by col1, join table2 by col2 2. 外连接: 左外连接:join_table = join table1 by col1 left outer, join table2 by col2 其中out 阅读全文
posted @ 2015-06-02 18:11 lihui1625 阅读(133) 评论(0) 推荐(0)