-
Notifications
You must be signed in to change notification settings - Fork 0
/
Copy pathREADME
49 lines (36 loc) · 1.15 KB
/
README
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
How to setup a Mapreduce Hbase Cluster
1) create instance https://openstack.risc-software.at
Image Name: Ubuntu Server 12.04 LTS
Flavor: m1.largediskdrive
Security Groups : <GRP>
Associate Floating IP: <IP>
2) Set passwords install git and clone repository
ssh -i path/to/<GRP> root@<IP>
passwd
passwd cloud
apt-get update
apt-get install git
cd /home/cloud/
mkdir chd5setup
cd chd5setup
git clone <GITREPO>
3) on Master
download ./oracle-java-7-update-45.tar.gz
python 0_newid.py
4) everywhere (including Master):
run install scripts
sudo python 1_ssh.py
sudo python 2_java.py
sudo python 3_network_conf.py
sudo python 4_chd5.py
5) on Master
sudo cp -r /etc/hadoop/conf.empty /etc/hadoop/conf.my_cluster
sudo cp -r /etc/hbase/conf.dist/ /etc/hbase/conf.benchmark_cluster
configure everything according to http://www.cloudera.com/content/cloudera-content/cloudera-docs/CDH5/latest/CDH5-Installation-Guide/CDH5-Installation-Guide.html
6) everywhere(including Master):
run configuration distribution script
sudo python 5_hadoop_conf.py
Format HDFS with:
hdfs namenode -format
start services
for x in `cd /etc/init.d ; ls hadoop-hdfs-*` ; do sudo service $x start ; done