DevOps | Cloud | Analytics | Open Source | Programming





Understand The Cloudera Cluster Internals



This post helps you to Understand The Cloudera Cluster Internals . Since it is pre-packaged Big Data platform , hence it is interesting to know and understand what runs where compared to a Vanilla Manual installation.

  • Cloudera Distribution consists of two Main components - Scm server(cloudera-scm-server) & Scm agent(cloudera-scm-agent).
  • cloudera-scm-server runs in the Main node ideally.
  • cloudera-scm-agent runs in the all the other machines or slave\data nodes in the cluster .
  • Scm agent also runs in the Scm server Node)
  • The Scm agents send their Heartbeats to the Scm server . This way the SCM server knows who all are the functioning nodes. Or in case any node doesn't send the Heartbeat , the Scm server perceives it as down.
  • The Scm agent also receive the Instructions from the Scm server about the roles associated with the different runing services.
  • Use command to see cloudera server status -

**$ service cloudera-scm-server status**

Additional Read -