Wednesday, February 13, 2013

Storm No such process errors

When you see this:

kill 10747: No such process
kill 10751: No such process
kill 10775: No such process
kill 10761: No such process
kill 10745: No such process

from Storm usually it means the supervisor/nimbus is looking for state information from an improperly shutdown cluster which didn't have a topology killed before shutdown or missing state information like shutting down a remote supervisor with running processes which Storm expects to still be there. 

To fix go to the /supervisor directory where the metadata is stored and delete the subdiretories under there. Restart the supervisors on all the nodes. 

