Step 4 Create a Cluster

After you have successfully created your virtual machine and have installed Ambari, you can now create your cluster.

1. Launch Install Wizard
CreateCluster1
I picked the cluster name “sandbox”.
CreateCluster2
Pick HDP 2.4
CreateCluster3
Note: You’ll need to ssh to the virtual machine as root to get the private key.

cat ~/.ssh/id_dsa

Be sure to use the fully qualified name of hdb.localdomain and the private key from the root account.
CreateCluster4
Installation will take a while.
CreateCluster5
CreateCluster6
In this step, you can pick the services you want for your virtual machine. HAWQ only needs HDFS to run but you can add more services. You can also use YARN with HAWQ for resource management. PXF also supports Hive and Hbase so install those services if you want to test this integration.
CreateCluster7
There is only a single host so this part is easy!
CreateCluster8
CreateCluster9

Under the HDFS tab in the Advanced hdfs-site section, make the following change:

dfs.block.access.token.enable=false

AdvancedHDFSSite

Under the HDFS tab in the Custom hdfs-site section, make the following additions:

dfs.block.local-path-access.user=gpadmin
dfs.client.socket-timeout=300000000
dfs.client.use.legacy.blockreader.local=false
dfs.datanode.handler.count=60

CustomHDFSSite

Under the NameNode Server Threads, change the value to 600.
NameNodeThreads

Under the HDFS tab in the Advanced core-site section, make the following change:

ipc.client.connection.maxidletime=3600000

AdvancedCoreSite

Under the HDFS tab in the Custom core-site section, make the following additions:

ipc.client.connect.timeout=300000
ipc.server.listen.queue.size=3300

CustomCoreSite

Now click on the HAWQ tab and change “Segment Memory Usage Limit” to 4GB.
Click on the Advanced tab and enter the HAWQ Master Port as 5432.
Set the HAWQ System User Password to “changeme”.

Click Next and proceed. You’ll be greeted with a screen warning but proceed anyway.
ProceedAnyway

Before clicking Deploy, make an addition to your /etc/hosts file in the VM. This step is important and required for HAWQ to install properly. Also, this is a “feature” of installing on a single node and isn’t a problem in a multi-node cluster. Use the IP address found with ifconfig.

echo "192.168.175.135 hdb.localdomain" >> /etc/hosts

Deploy
DeploySuccess
DeploySummary