Kettle启动及简单操作(1)

官方Hadoop配置

http://wiki.pentaho.com/display/BAD/Configuring+Pentaho+for+your+Hadoop+Distro+and+Version

1.官网下载kettle

 

http://community.pentaho.com/projects/data-integration/

 

https://sourceforge.net/projects/pentaho/files/Data%20Integration/6.1/pdi-ce-6.1.0.1-196.zip/download

 

2.解压kettle

 

3.进入目录运行kettle

 

Windows下双击spoon.bat

 

Linux下运行

 

sh spoon.sh

 

 

4.配置kettle连接hadoop

 

 

 

 

1)修改

 

E:\pdi-ce-6.1.0.1-196 power\data-integration\plugins\pentaho-big-data-plugin\plugin.properties

 

 

 

 

修改此文件中active.hadoop.configuration=hdp24

 

 Copy文件core-site.xml, hdfs-site.xml, httpfs-site.xml, mapred-site.xml, yarn-site.xml到E:\pdi-ce-6.1.0.1-196 power\data-integration\plugins\pentaho-big-data-plugin\hadoop-configurations\hdp24下

 

 

 

修改文件中主机名为ip地址

 

重启spoon.bat进入图形化界面

 

I 新建转换

 

 

 

II 新建job

 

 

 

posted @ 2016-07-30 13:53  派。  阅读(3228)  评论(0编辑  收藏  举报