Force abort of stuck workunit?
I have a workunit that is stuck alternating between aborting and compiling. Is there some way to force it to abort so I can delete it and allow the system to process other jobs.
This is on a single node cluster.
I tried going through the command line using both ECL and ECLPlus but neither of them get the job done.
This is on a single node cluster.
I tried going through the command line using both ECL and ECLPlus but neither of them get the job done.
- BGehalo
- Posts: 13
- Joined: Thu Sep 28, 2017 8:06 pm
In ECL Watch:
- * Open the list of Workunits
* Select the Workunit you want to abort. The Abort action button is now enabled.
* Press the Abort button.
- JimD
- Posts: 160
- Joined: Wed May 18, 2011 1:35 pm
Thanks Jim but that wasn't working either, the workunits were completely unresponsive to client commands whether through ECL, ECL Watch, or ECL Plus.
They worked themselves out in time, just took overnight. I imagine the only way to force an abort when the client tools are unresponsive is to access the underlying OS directly to issue something like a kill -9 command. I'm going to play around with my VM to see what I can do.
They worked themselves out in time, just took overnight. I imagine the only way to force an abort when the client tools are unresponsive is to access the underlying OS directly to issue something like a kill -9 command. I'm going to play around with my VM to see what I can do.
- BGehalo
- Posts: 13
- Joined: Thu Sep 28, 2017 8:06 pm
If this happens again and you are able to reproduce it, you should report it in Jira -- our issue tracking system:
JIRA (https://track.hpccsystems.com).
regards,
Jim
JIRA (https://track.hpccsystems.com).
regards,
Jim
- JimD
- Posts: 160
- Joined: Wed May 18, 2011 1:35 pm
4 posts
• Page 1 of 1
Who is online
Users browsing this forum: No registered users and 1 guest