04 2014 档案

摘要:三台 服务器 n0,n2,n3 centos 6.4 X64 JDK, SCALA 2.11 Hadoop 2.2.0 spark-0.9.1-bin-hadoop2.tgz 说明: 1.所有机器上安装scala 2.所有机器上安装spark,可从master机器配置好,用scp 复制到剩余节点. 阅读全文
posted @ 2014-04-24 17:10 GrantYu 阅读(492) 评论(0) 推荐(0)
摘要:倒排索引(Inverted Index)倒排索引是一种索引结构,它存储了单词与单词自身在一个或多个文档中所在位置之间的映射。倒排索引通常利用关联数组实现。它拥有两种表现形式:inverted file index,其表现形式为 {词项,词项所在文档的ID}full inverted index,其表... 阅读全文
posted @ 2014-04-16 17:22 GrantYu 阅读(1338) 评论(0) 推荐(1)
摘要:Creating a Hadoop-2.x project in Eclipsehttp://snap.stanford.edu/class/cs246-data-2014/hw0.pdfHadoop WordCount with new map reduce apihttp://codesfusion.blogspot.com/2013/10/hadoop-wordcount-with-new-map-reduce-api.html 阅读全文
posted @ 2014-04-09 23:58 GrantYu 阅读(251) 评论(0) 推荐(0)
摘要:IntelliJ Project for Building Hadoop – The Definitive Guide Exampleshttp://vichargrave.com/intellij-project-for-building-hadoop-the-definitive-guide-examples/ 阅读全文
posted @ 2014-04-09 22:51 GrantYu 阅读(181) 评论(0) 推荐(0)
摘要:Creating a Hadoop-2.x project in Eclipsehortonworks:MapReduce Portshttp://docs.hortonworks.com/HDPDocuments/HDP1/HDP-1.2.0/bk_reference/content/reference_chap2_2.htmlhadoop-1.x 集群默认配置和常用配置http://www.cnblogs.com/ggjucheng/archive/2012/04/17/2454590.htmlEclipse下搭建Hadoop-2.x开发环境{good}http://blog.csdn.n 阅读全文
posted @ 2014-04-09 19:16 GrantYu 阅读(382) 评论(0) 推荐(0)
摘要:Create a Hadoop Build and Development Environmenthttp://vichargrave.com/create-a-hadoop-build-and-development-environment-for-hadoop/Debugging Hadoop Applications with IntelliJhttp://vichargrave.com/debugging-hadoop-applications-with-intellij/ 阅读全文
posted @ 2014-04-07 15:31 GrantYu 阅读(166) 评论(0) 推荐(0)
摘要:Hadoop-2.3.0的Eclipse插件编译#cd /usr/local/src/hadoop2x-eclipse-plugin-master/src/contrib/eclipse-plugin#ant jar -Dversion=2.3.0 -Declipse.home=/usr/local/eclipse -Dhadoop.home=/home/hm/hadoop编译很简单:经常出现的问题。因为Proxy问题不能获得ivy-2.1.0.jar,需要设置代理Can't get http://repo2.maven.org/maven2/org/apache/ivy/ivy/2. 阅读全文
posted @ 2014-04-03 15:43 GrantYu 阅读(1425) 评论(3) 推荐(0)
摘要:Storm集群安装部署步骤【详细版】假设1.已安装jdk,python,unzip 2.已经搭建Zookeeper集群;1. 安装Storm依赖库;需要在Nimbus和Supervisor机器上安装Storm的依赖库 1.1 ZeroMQ $./configure $make $sudo make install 1.2 JZMQ $./autogen.sh $./configure $make $sudo make install ZMQ和JZMQ默认安装在/usr/local/lib 下2. 下载并解压Storm发布版本https://github.com/nathanmarz/storm 阅读全文
posted @ 2014-04-03 11:48 GrantYu 阅读(253) 评论(0) 推荐(0)
摘要:How-to: Use HBase Bulk Loading, and Whyhttp://blog.cloudera.com/blog/2013/09/how-to-use-hbase-bulk-loading-and-why/ 阅读全文
posted @ 2014-04-03 11:47 GrantYu 阅读(147) 评论(0) 推荐(0)
摘要:Hbase分布式安装Hbase分布式安装hbase-0.98.0-hadoop2-bin.tar前提是已经安装好 Hadoop,zookeeperhadoop port9000zookeeper port 2181 , dir/var/lib/zookeeper[hm@n0 ~]$ tar -zxv... 阅读全文
posted @ 2014-04-03 11:46 GrantYu 阅读(284) 评论(0) 推荐(0)
摘要:安装zookeeperclusterzookeeper-3.4.5.tar.gzn0 10.69.10.193n1 10.69.10.188n2 10.69.10.192n0 上解压 zookeeper-3.4.5.tar.gz[hm@n0 ~]$ tar z-xvf zookeeper-3.4.5... 阅读全文
posted @ 2014-04-03 11:43 GrantYu 阅读(383) 评论(0) 推荐(0)
摘要:root/finance, hm/finance一,安装tigervnc-server VNC软件包[root@localhost ~]# yum install tigervnc-server[root@localhost ~]# rpm -qc tigervnc-server/etc/sysco... 阅读全文
posted @ 2014-04-01 14:38 GrantYu 阅读(2261) 评论(0) 推荐(0)
摘要:Apache Hadoop -2.2.0 - How to Install a Three Nodes Clusterhttp://tonylixu.blogspot.ca/2014/02/apache-hadoop-how-to-install-three.htmlCentos 6.5 hadoop 2.2.0 全分布式安装http://xjliao.me/2014/03/21/hadoop-2.2.0-cluster-setup.html==============================cluster: n0,n1,n2n0:NameNode,ResourceManager ;n 阅读全文
posted @ 2014-04-01 14:00 GrantYu 阅读(412) 评论(0) 推荐(0)