Sun Aug 19, 2018 3:58 pm
Login Register Lost Password? Contact Us


MyThor is not running in cluster

Topics related to recommendations or questions on the design for HPCC Systems clusters

Thu Feb 19, 2015 3:52 pm Change Time Zone

Hi,
As I mentioned earlier whenever I request VMs I will get a new set of machines. So the IP addresses got changed this time. Below is a mapping of the old to new IP addresses and also I have attached the new environment file for your reference.

X.X.X.221 => X.X.X.167
X.X.X.190 => X.X.X.70
X.X.X.255 => X.X.X.240

I executed the below command in X.X.X.190 and X.X.X.221
sudo /opt/HPCCSystems/bin/daliadmin X.X.X.221 dfsgroup mythor
The output was X.X.X.221 in both the machines.

Regards,
Lakshman Naresh C A
Attachments
environment.txt
(37.88 KiB) Downloaded 126 times
lakshmannaresh
 
Posts: 15
Joined: Tue Feb 03, 2015 5:20 am

Thu Feb 19, 2015 4:12 pm Change Time Zone

Hi,

Can you send us the output from these two commands:

ifconfig
iptables -L

on each of the 3 machines (.221, .190, .255)

Is the .221, .190, .255 the private or public IPs ?

thanks,
mark
mkellyhpcc
 
Posts: 15
Joined: Mon Mar 10, 2014 2:51 pm

Mon Feb 23, 2015 6:09 pm Change Time Zone

Hi Mark,
Below is the mapping for new set of IPs.
X.X.X.167 => X.X.X.221 => 152.X.X.67
X.X.X.70 => X.X.X.190 => 152.X.X.51
X.X.X.240 => X.X.X.255 => 152.X.X.7

.221, .190, .255 are public IPs.

I have attached the output of the two commands and environment.xml file for this set of IPs.

Regards,
Lakshman Naresh
Attachments
iptables.txt
iptables -L output
(3.67 KiB) Downloaded 120 times
ifconfig.txt
ifconfig ouput
(4.16 KiB) Downloaded 123 times
environment.txt
Environment.xml
(37.9 KiB) Downloaded 125 times
lakshmannaresh
 
Posts: 15
Joined: Tue Feb 03, 2015 5:20 am

Wed Feb 25, 2015 10:01 pm Change Time Zone

Hi,

It appears you want to use the public IP addresses, but these are not the first interfaces listed from the ifconfig scans. Can you edit the environment.conf file on all machines and change the interface line to be:

interface=eth1

So this matches the IP addresses you have specified in the configs.
Stop HPCC, make this change for all 3 machines and then start HPCC up again and let us know the status and log file(s).

thanks,
mark
mkellyhpcc
 
Posts: 15
Joined: Mon Mar 10, 2014 2:51 pm

Mon Mar 02, 2015 8:37 pm Change Time Zone

Hi Mark,
Even after changing the interface in config file the problem didn't resolved. I have attached the log files for your reference.

Regards,
Lakshman Naresh
Attachments
thorslave.1.2015_02_03.log
Thor Slave
(1.16 KiB) Downloaded 125 times
thormaster.2015_02_03.log
Thor Master
(12.12 KiB) Downloaded 125 times
environment.txt
Environment.xml
(37.88 KiB) Downloaded 118 times
lakshmannaresh
 
Posts: 15
Joined: Tue Feb 03, 2015 5:20 am

Mon Mar 02, 2015 9:58 pm Change Time Zone

Hi,

It shows the interface is set to * in the xml. It should be set to eth1.
These files all look like the ones from last month (02-03 instead of 03-02).
Can you double check the files. Make sure interface=eth1 is specified in the conf file on all 3 nodes before starting HPCC.

thanks,
mark
mkellyhpcc
 
Posts: 15
Joined: Mon Mar 10, 2014 2:51 pm

Tue Mar 03, 2015 3:51 pm Change Time Zone

Hi,
Sorry, I have uploaded the wrong one. Please find the latest log files.

Regards,
Lakshman
Attachments
thorslave.1.2015_03_02.log
(4.03 KiB) Downloaded 120 times
thormaster.2015_03_02.log
(8.93 KiB) Downloaded 119 times
environment.txt
(37.89 KiB) Downloaded 121 times
lakshmannaresh
 
Posts: 15
Joined: Tue Feb 03, 2015 5:20 am

Tue Mar 03, 2015 4:23 pm Change Time Zone

Hi,

It appears from these logs that thor is up and running ok.
There is a failure at first, but then it tries again and looks ok the second time. The failure msg was:

0000001D 2015-03-02 15:17:38.240 5488 5488 "CTRL-C detected"

So after about 30 seconds of thor master and slave being up and ok a CTRL-C event was detected, but then thor restarted automatically.
How are you starting up HPCC ?

Are you able to use HPCC now ?

thanks,
mark
mkellyhpcc
 
Posts: 15
Joined: Mon Mar 10, 2014 2:51 pm

Tue Mar 03, 2015 5:17 pm Change Time Zone

Can you post the start_thor log file that is found under /var/log/HPCCSystems/mythor that coresponds with the time of March 2, 15:17 ?

We're trying to figure out why the control-C got caught in the first place. But again it seems that it's up and working.
mgardner
 
Posts: 13
Joined: Tue Jan 20, 2015 9:30 pm

Wed Mar 04, 2015 5:24 pm Change Time Zone

Hi,
I have attached the log files.

Regards,
Lakshman
Attachments
start_thor_03_04_2015_11_49_21.log
(740 Bytes) Downloaded 120 times
start_thor_03_04_2015_11_47_51.log
(826 Bytes) Downloaded 120 times
environment.txt
(37.88 KiB) Downloaded 115 times
lakshmannaresh
 
Posts: 15
Joined: Tue Feb 03, 2015 5:20 am

PreviousNext

Return to Clustering

Who is online

Users browsing this forum: No registered users and 1 guest

cron