Site icon MANDY SANDHU’S BLOG

Oracle Big Data Cloud Service – Introduction

Big data is a topic that everyone seems to be talking about it. But still many of us wonder  “What exactly is big data”. Which technology provider should I use?? I have written couple of blog on apache hadoop, cloudera distribution and AWS EMR service as well. And today in this blog, I’ll go through Oracle Big data Cloud Service and what is included in the service.

What is Oracle Big data cloud Service???

Oracle Big Data Cloud Service is an automated cloud service for big data processing. It is optimized to run different set of workloads from Hadoop-only workloads (ETL, Spark, Hive)  to interactive SQL queries using SQL-on-hadoop tools. Here are some key features of Oracle cloud big data service:

Deployment Models:

Oracle provides various different Big data deployment models.

Oracle BDCS Architecture overview:

Oracle big data cloud service is a collection of nodes. Three types of nodes.

The software included in Oracle BDCS are :

How to create OBCC Instance:

Login to oracle cloud account and from dashboard click on create Instance. The create instance will list all the services that you can create in your Oracle cloud account as per region.  Click on create BigData service.

The create New Oracle Big Data Cloud Service instance wizard is displayed and now fill up the following information.

A box pops up and asks for confirmation. I’ve selected to create an instance for the service of Big Data Cloud Service. Do I wish to continue? I click Create. Admin will be notified by email that instance has been created.

Now the status of the instance is active. We are now ready to processed to next step to create the cluster.

Create the cluster:

Go to the service instance section of the service details: Oracle Big Data Cloud service page, click the open service console link next as highlighted above screenshot.

Connect to cluster:

You can connect to cluster by using putty on windows. But need SSH private key. So if you have created the new keypair during Oracle  BDCS instance creation, the first thing you need to do is to convert the key to a format that can be used by putty. Have a look at this  article.

Now look at the lowest IP address in the cluster of nodes and connect using putty. I choose lowest IP address because its more likely will be the primary node. How to connect by using as private key is explained in my old post here.

I connected successfully using putty. Now check the cluster set up using bdacli. As i mentioned the lowest number will be the primary node and same is reflected below.

Now I would like to connect to cloudera manager. So open the firewall from your primary node. For simplicity I opened all IP address. But for your organization, please review and open specific IP addresses.

### Access to Cloudera Manager
bash-4.1# bdacli bdcs_whitelist allow cloudera_manager 0.0.0.0/0
BDCS Network Services Firewall & Whitelist
Changes saved.
### Access to Hue
bash-4.1# bdacli bdcs_whitelist allow hue 0.0.0.0/0
BDCS Network Services Firewall & Whitelist
Changes saved.
bash-4.1#

Open Cloudera Manager:

You can access cloudera manager from Oracle big data cloud service console or directly form a browser. 

Login page will appear. Add username : admin and password : that you specified during cluster creation.

Administrative Tasks:

Adding Nodes to a cluster:

We can extend a cluster by adding permanent hadoop nodes, edge nodes and cluster compute nodes. The recommendation is add nodes in one-node increments up to 60 nodes in the cluster.  You can add additional permanent nodes to a cluster after it is created and started.

Go to service instance for instance we created for big data, click on menu to modify and add nodes.

Thanks

Mandy

 

Exit mobile version