← Back to team overview

bigdata-dev team mailing list archive

[Bug 1414080] [NEW] Race condition on relation when bundling hdp-hadoop

 

Samuel Cozannet (samuel-cozannet) has assigned this bug to you for hdp-pig in Juju Charms Collection:

I build a demo to run a Machine Learning workshop based on a blog post
made by Hortonworks.

There are 2 sides in the demo: 
* Bundle: https://github.com/SaMnCo/bundle-flight-delay-demo
* Charm: https://github.com/SaMnCo/charm-flight-delay-demo

The bundle comprises:

* YARN Master
* 4x compute nodes
* PIG colocated on YARN
* ipython-notebook colocated on YARN

When I deploy manually using the 00-deploy file provided in the bundle,
everything goes well. However, when trying to deploy the bundle, it
fails at relation creation.

In the attached juju log collected at deployment, line 11347, we see
Hadoop crashing. Then 11402, the crash expands. 11418, we discover the
resource manager is not ready.

I can reproduce the same behavior with a simpler bundle comprising:

* YARN Master
* 4x compute nodes
* PIG colocated on YARN

So it seems to really be related to PIG/YARN/Compute nodes relation.

I can also reproduce the same behavior from juju-deployer, which fails
with :

2015-01-23 16:51:21 [INFO] deployer.import: Adding relations...
2015-01-23 16:51:23 [INFO] deployer.import:  Adding relation c00-yarn-master:namenode <-> c02-compute-node:datanode
2015-01-23 16:51:24 [INFO] deployer.import:  Adding relation c00-yarn-master:resourcemanager <-> c02-compute-node:nodemanager
2015-01-23 16:51:25 [INFO] deployer.import:  Adding relation c04-hdp-pig:namenode <-> c00-yarn-master:namenode
2015-01-23 16:51:25 [INFO] deployer.import:  Adding relation c04-hdp-pig:resourcemanager <-> c00-yarn-master:resourcemanager
2015-01-23 16:51:26 [INFO] deployer.import:  Adding relation c03-flight-delay-demo:notebook <-> c01-ipython-notebook:notebook
2015-01-23 16:51:26 [DEBUG] deployer.import: Waiting for relation convergence 60s
2015-01-23 16:52:29 [ERROR] deployer.env: The following units had errors:
   unit: c00-yarn-master/0: machine: 1 agent-state: error details: hook failed: "namenode-relation-joined"
2015-01-23 16:52:29 [INFO] deployer.cli: Deployment stopped. run time: 583.23

Let me know if you need anything else! 
Thanks!

** Affects: hdp-hadoop (Juju Charms Collection)
     Importance: Undecided
         Status: New

** Affects: hdp-pig (Juju Charms Collection)
     Importance: Undecided
     Assignee: Juju Big Data Development (bigdata-dev)
         Status: New


** Tags: hadoop ipython pig spark yarn
-- 
Race condition on relation when bundling hdp-hadoop
https://bugs.launchpad.net/bugs/1414080
You received this bug notification because you are a member of Juju Big Data Development, which is a bug assignee.