Putting in the Cloudera CDP Personal Cloud Base on IBM Cloud with Ansible – IBM Developer


On this second weblog publish in our collection, we speak about Cloudera Knowledge Platform for IBM Cloud Pak for Knowledge. Very like IBM Cloud Pak for Knowledge, the Cloudera Knowledge Platform is a knowledge and AI platform that may be put in on-premises. In truth, many IBM prospects are additionally Cloudera prospects. IBM Cloud Pak for Knowledge is constructed on Crimson Hat OpenShift and breaks down silos to allow your whole information customers to collaborate from a single, unified interface.

Like most trendy platforms, set up is rather more than simply unzipping a file or clicking a “subsequent” button on a wizard. Fortunately, the Cloudera crew lately introduced it will open supply Ansible playbooks that we are going to leverage to make this entire course of simpler for our personal functions.

This weblog publish is meant to share our expertise in utilizing Ansible to put in Cloudera Knowledge Platform on IBM Cloud. It’s value mentioning that the automation used is open supply and follows the perfect practices beneficial by the Cloudera Skilled Companies crew.

The environment

We used Digital Servers on IBM Cloud because the goal for our Cloudera Knowledge Platform set up. A complete of 8 VMs, every 32 vCPU by 128 GB of RAM working CentOS, have been chosen. We additionally had one other Home windows-based VM to run Lively Listing, to finest mimic what prospects most frequently use of their environments. And a single bastion node was provisioned to simplify the communication between the consumer and the hosts. IBM Cloud Pak for Knowledge was additionally provisioned, however the particulars of which can be out of scope for this publish.

List of virtual servers on IBM Cloud
Determine 1. Listing of digital servers on IBM Cloud

When put collectively, the environment resembled the structure diagram under.

Diagram of environment used for integrating Cloudera Data Platform and IBM Cloud Pak for Data
Determine 2. An structure diagram of the atmosphere used for integrating Cloudera Knowledge Platform and IBM Cloud Pak for Knowledge

The Ansible playbooks

As talked about earlier, to put in Cloudera Knowledge Platform on IBM Cloud, we leveraged present Ansible playbooks that have been open sourced.

The set up takes roughly 30-60 minutes to finish, relying on machine specs. The longest half is when the installer pulls down the mandatory artifacts and pushes them to every host.

Cloudera Manager installing Cloudera Data Platform
Determine 3. A screenshot of the Cloudera Supervisor putting in Cloudera Knowledge Platform

Subsequent steps

Should you’re an IBMer trying to get your palms on Cloudera, or involved in studying extra about utilizing Ansible playbooks to put in Cloudera, take a look at the GitHub repo. Should you loved this, take a look at A technical deep-dive on integrating Cloudera Knowledge Platform and IBM Cloud Pak for Knowledge. You too can study extra in regards to the Cloudera Knowledge Platform for IBM Cloud Pak for Knowledge joint providing.


Leave a Reply

Your email address will not be published. Required fields are marked *