Sap HANA VORA 1.2 consists of the two following main components:
- SAP HANA Vora Engine:
SAP HANA Vora instances hold data in memory and boost the performance. - SAP HANA Vora Spark Extension Library:
- Allows SAP HANA Vora to be accessed through Spark.
- It also makes available additional functionality, such as a hierarchy implementation.
These two components are included inside the Vora packages which are available as follows and you could choose based on your Hadoop distribution.
- SAP HANA Vora for Ambari: VORA_AM<version>.TGZ
- SAP HANA Vora for Cloudera: VORA_CL<version>.TGZ
- SAP HANA Vora for MapR: VORA_MR<VERSION>.TGZ
To download the packages: https://support.sap.com/swdc
Vora 1.2 supports the following operation systems:
- SUSE Linux Enterprise Server (SLES) 11 SP3 (see compatibility pack details below)
- Red Hat Enterprise Linux (RHEL) 6.7 (see compatibility pack details below) and 7.2
You should also follow the Installation and Administration guide for the compatibility pack installations: http://help.sap.com/hana_vora
Following table shows you the combination of operating system, cluster provisioning tool, and Hadoop distribution:
Remember that the minimal setup for Vora 1.2 is :
- 4 cores
- 8 GB of RAM
- 20 GB of free disk space for HDFS data
- Note: You can’t install Vora 1.2 on a single node
In order to have the Vora 1.2 running, you have to have the following Vora services installed and configured and I will walk you through their installment and configurations on the clusters.
- SAP HANA Vora Base: Vora libraries and binaries. Installs on all hosts.
- SAP HANA Vora Catalog: Vora distributed metadata store. Installs on one node and usually on DLOG node.
- SAP HANA Vora Discovery Service: Manages service registrations and installs on all nodes. In server mode installs on 3 nodes(Max 7) and selects the bootstrapping host. In client mode, installs on all remaining nodes. Note: You can’t install DS server and client both on the same node.
- SAP HANA Vora Distributed Log: Provides persistence for Vora Catalog. Install usually on the master node(5 nodes recommended).
- SAP HANA Vora Thriftserver: Gateway compatible with Hive JDBC connector. ,Usually install on the jumpbox where DS, DLOG and Catalog servers are not installed.
- SAP HANA Vora Tools: Web UI for Vora 1.2 modeler. Install on the same node as Vora Thriftserver.
- SAP HANA Vora V2Server: Vora Engine. Installs on all worker nodes ( Datanodes)
The installation and configuration should either happen at the same time for all the services or you should follow the following order to make sure of handling the dependencies:
The following schema shows you the architecture for clusters with 4 nodes and the assignment of different Vora 1.2 services which we will set up in this document:
Step 1) Adding Vora Base: You have to add Vora base on all nodes and they have to be installed as clients as shown below.
— no extra configuration is needed.
— you can click on the proceed button as is shown below even if you get the error since you’re not using MapReduce jobs:
— Click on complete.
— notice that the Vora base is now added to your services:
Step 2) Now we add Vora discovery as 3 Vora discovery servers and one client.
Adding the Vora Discovery client:
-- Vora discovery servers need extra configurations:
— in vora_discovery_bootstrap add the master DNS
— in vora_discovery_servers add your server DNS’s
— proceed and deploy the service
notice that vora discovery service is now installed:
Step 3) Now we add Vora Distributed Log service :
— we install DLOG servers on the same machines where we installed our Discovery Servers.
— No extra configurations are needed.
— click Next-> click Proceed anyway—>click Complete
— Notice that vora DLOG is now added to the services:
Step 4) Next step is to install Vora Catalog:
— Install Catalog on your master node:
— click Next->click Proceed anyway—>click Complete
— Notice that vora Catalog is added to the services:
Step 5) Time to install V2Server as shown below:
— extra configuration: add the Vora V2Server Worker service to worker1 and worker2 nodes and remove it from your server node.
— click Next->click Proceed anyway—>click Complete
— Notice that vora V2Server is now added to the services:
Step 6) Time to install Vora Thriftserver and Vora Tools:
— you have to add more configurations to the thrift server as it’s shown below:
— add vora_thriftserver_java_home = /usr/lib/jvm/java
— add vora_thriftserver_spark_home = /usr/hdp/2.3.4.0-3485/spark
— click Next-> click Proceed anyway—>click Complete
— Notice that vora thriftServer and Vora tools are now added to the services:
Now click on HDFS, MaprReduce2 and YARN services which are in red and restart all affected as shown below:
Congratulations!! You now have Vora 1.2 services installed on your clusters.
Step 7) To validate your Vora:
— SSH to your worker1 node and run:
— source /etc/vora/vora-env.sh
— $VORA_SPARK_HOME/bin/start-spark-shell.sh
and you should now see the SQL contexts bieng available.