摘要:http://www.coreservlets.com/hadoop-tutorial/
阅读全文
摘要:1.下载解压2.conf/hbase-env.sh3.conf/hbase-site.xml hbase.rootdir hdfs://master:54310/hbase hbase.cluster.distributed true hbase.zookeeper.quorum master,slave1,slave2 ...
阅读全文
摘要:http://jiajun.iteye.com/blog/899632一、简介historystarted by chad walters and jim2006.11 G release paper on BigTable2007.2 inital HBase prototype created as Hadoop contrib2007.10 First useable Hbase2008.1 Hadoop become Apache top-level project and Hbase becomes subproject2008.10 Hbase 0.18,0.19 released
阅读全文
摘要:HBase is a distributed column-oriented database built on top of HDFS. HBase is theHadoop application to use when you require real-time read/write random access tovery large datasets.Applications store data into labeled tables. Tables are made of rows and columns. Tablecells—the intersection of row a
阅读全文
摘要:1.下载pig-0.12.0.tar.gz,并解压2.设置环境变量vi .bash_profileexport PIG_HOME=/home/hduser/pigexport PATH=$PIG_HOME/bin:$HADOOP_HOME/bin:$HOME/bin:$PATH#PATH=$PATH:$HOME/binexport PATHexport PIG_CLASSPATH=$HADOOP_HOME/conf$HADOOP_HOME/bin一定要在PATH中2.pig -x localA = LOAD '/home/hduser/hadoop/access_log.txt'
阅读全文
摘要:1.hdfs-site.xml dfs.replication 2 Default block replication. dfs.http.address master:50070 dfs.secondary.http.address slave1:50090 2.master记录secondaryNamenode节点地址#slave1为secondaryNamenodeslave1check
阅读全文
摘要:configure ssh1.add a dedicated hadoop system usersudo groupadd hadoopsudo useradd -G hadoop hdusersudo passwd hduser2.generate ssh private keyssh-keyg...
阅读全文
摘要:装载自http://www.linuxidc.com/Linux/2012-06/62304.htmzookeeper.session.timeout默认值:3分钟(180000ms)说明:RegionServer与Zookeeper间的连接超时时间。当超时时间到后,ReigonServer会被Zookeeper从RS集群清单中移除,HMaster收到移除通知后,会对这台server负责的regions重新balance,让其他存活的RegionServer接管.调优:这个timeout决定了RegionServer是否能够及时的failover。设置成1分钟或更低,可以减少因等待超时而被延长
阅读全文