Sat Aug 18, 2018 11:42 pm
Login Register Lost Password? Contact Us


Force abort of stuck workunit?

Comments or questions specific to the features of ECL Watch

Mon Oct 09, 2017 8:36 pm Change Time Zone

I have a workunit that is stuck alternating between aborting and compiling. Is there some way to force it to abort so I can delete it and allow the system to process other jobs.

This is on a single node cluster.

I tried going through the command line using both ECL and ECLPlus but neither of them get the job done.
BGehalo
 
Posts: 12
Joined: Thu Sep 28, 2017 8:06 pm

Tue Oct 10, 2017 8:09 pm Change Time Zone

In ECL Watch:

    * Open the list of Workunits
    * Select the Workunit you want to abort. The Abort action button is now enabled.
    * Press the Abort button.
Abort.jpg
Abort.jpg (56.57 KiB) Viewed 672 times
JimD
 
Posts: 132
Joined: Wed May 18, 2011 1:35 pm

Tue Oct 10, 2017 8:17 pm Change Time Zone

Thanks Jim but that wasn't working either, the workunits were completely unresponsive to client commands whether through ECL, ECL Watch, or ECL Plus.

They worked themselves out in time, just took overnight. I imagine the only way to force an abort when the client tools are unresponsive is to access the underlying OS directly to issue something like a kill -9 command. I'm going to play around with my VM to see what I can do.
BGehalo
 
Posts: 12
Joined: Thu Sep 28, 2017 8:06 pm

Wed Oct 11, 2017 5:38 pm Change Time Zone

If this happens again and you are able to reproduce it, you should report it in Jira -- our issue tracking system:

JIRA (https://track.hpccsystems.com).

regards,

Jim
JimD
 
Posts: 132
Joined: Wed May 18, 2011 1:35 pm


Return to ECL Watch

Who is online

Users browsing this forum: No registered users and 1 guest

cron