Quantcast
Channel: 旁门左道 » Hadoop
Browsing all 10 articles
Browse latest View live

Hive安装Tips

Hive安装 下载地址 http://hive.apache.org/releases.html wget http://labs.renren.com/apache-mirror//hive/hive-0.6.0/hive-0.6.0-bin.tar.gz tar vxzf hive-0.6.0-bin.tar.gz sudo mv hive-0.6.0-bin...

View Article


hadoop thrift client

http://code.google.com/p/hadoop-sharp/ 貌似不给力,pass http://wiki.apache.org/hadoop/HDFS-APIs http://wiki.apache.org/hadoop/MountableHDFS http://wiki.apache.org/hadoop/Hbase/Stargate...

View Article


热门话题,时间及空目录的处理

  先查看hadoop目录的文件数,然后再决定是不是在input里面加上该目录 [dev@platformB dailyrawdata]$  hadoop fs -ls /trendingtopics |wc -l 3 计算时间的方法 [dev@platformB dailyrawdata]$ lastdate=20110619 [dev@platformB dailyrawdata]$ echo...

View Article

Hive derby lock及目录权限错误

FAILED: Error in metadata: javax.jdo.JDOFatalDataStoreException: Cannot get a connection, pool error Could not create a validated object, cause: A read-only user or a user in a read-only database is...

View Article

Hadoop and MapReduce: Big Data Analytics [gartner]

收藏,下载地址:http://dl.medcl.com/get.php?id=29&path=books%2Fgartner%2CHadoop+and+MapReduce+Big+Data+Analytics.7z Hadoop and MapReduce: Big Data Analytics 14 January 2011 Marcus Collins Gartner Burton...

View Article


流计算是什么东东?

  貌似现在正在流行流计算,流计算或流式计算主要用来做实时数据分析,如实时交易数据,广告,查询等,...

View Article

brisk调试部署全纪录

brisk快速测试记录。 参考链接: http://www.datastax.com/docs/0.8/brisk/about_pig 设置环境变量 vi /etc/profile   export BRISK_HOME=/usr/local/brisk-1.0 export PATH=$PATH:$BRISK_HOME/bin 生效 . /etc/profile On linux systems,...

View Article

how 2 run hadoop streaming job over brisk

/usr/local/brisk-1.0/bin/brisk hadoop jar /usr/local/brisk-1.0/resources/hadoop/hadoop-streaming-0.20.203.1-brisk1-beta2.jar \ -file /tmp/testmr/mapper.py \ -file /tmp/testmr/reducer.py \ -reducer...

View Article


clouderaCDH3国内源

贡献一个cloudra CDH3 国内源 #如何使用呢? #yum clean all &yum list updates   wget http://repo.medcl.net/cloudera-chd3-medcl.repo -O /etc/yum.repos.d/cloudera-cdh3-cn.repo   yum search hadoop yum -y install...

View Article


cloudra-manager修改使用自定义源

使用cloudra-manager来管理hadoop集群,但是官方源太慢了,搭本地源呗,另外repo写死在package里面了,将包解开,修改下,替换repo仓库地址为本地源即可。 cloudra-manager默认使用了它自己的源,如果要替换为本地的源,必须修改替换它的rpm包。   需要预先安装的打包工具: yum -y install rpm-build...

View Article
Browsing all 10 articles
Browse latest View live