Upgrade Issue from 1.2.13 to 1.2.14

Started by Christian021, May 21, 2014, 04:34:14 PM

Previous topic - Next topic

Christian021

Hello,

Today I had installed the version 1.2.14. But now I have the problem that the NetXMS Core Process didn't start.
The error message while starting the process: Error 1067: the process unexpected stopped.

I checked the database format with nxdbmgr upgrade -t. The response was: your database format is up to date

So what can I do?

Best regards

Victor Kirhenshtein

Hi!

try to run netxmsd in foreground:

netxmsd -D9

and check output.

Best regards,
Victor

Christian021

Thank's for your answer Victor.

So I did that and I got an crash dump. I will send it to dump(at)netxms.org tomorrow.
So far it is an EXCEPTION C0000005 (Access violation) at 00779259

The rest of the information in the Email.

Best regards,
Christian

Christian021

Hi,

Victor send me a file to replace. Now the problem is solved with it.

Thank you Victor for your fast and professional help.

Threat is now closed.

sperlm

Hello, what was the solution?

I have got problems with upgrade from 1.2.12 to 1.2.14 - server shuts down after a while.
It all happens while trying to manage things - 2 times trying upload image, once opening scripts, once opening network map. These actions actually process but server shuts down after few seconds.

The only reasonable log from debug is
[27-May-2014 16:48:31.062] [WARN ] Failed to execute filter script for template object 26027 "H3C (-25506-,sys65) - filtr ok - bez DCI": Error 11 in line 13: Function not found
[27-May-2014 16:48:31.062] [WARN ] Failed to execute filter script for template object 26029 "H3C (-25506-,sys47) - filtr ok - bez DCI": Error 11 in line 4: Function not found
[27-May-2014 16:48:34.473] [WARN ] Failed to execute filter script for template object 26027 "H3C (-25506-,sys65) - filtr ok - bez DCI": Error 11 in line 13: Function not found
[27-May-2014 16:48:34.474] [WARN ] Failed to execute filter script for template object 26029 "H3C (-25506-,sys47) - filtr ok - bez DCI": Error 11 in line 4: Function not found
[27-May-2014 16:48:50.606] [WARN ] Failed to execute filter script for template object 26027 "H3C (-25506-,sys65) - filtr ok - bez DCI": Error 11 in line 13: Function not found
[27-May-2014 16:48:50.607] [WARN ] Failed to execute filter script for template object 26029 "H3C (-25506-,sys47) - filtr ok - bez DCI": Error 11 in line 4: Function not found
Segmentation fault


There seems to be some problems with scripts and filters (which were ok in 1.2.12).

Tried to search for the segmentation fault problem on forum but found nothing definite... :(

Attaching whole log.

Victor Kirhenshtein

Yes, there is a bug in script compiler that may cause crash. Are you using 32 or 64 bit server?

Best regards,
Victor

sperlm

#6
Hello, we are running CentOS release 6.4 (final) version: 2.6.32-358.14.1.el6.x86_64.

It is 64 bit server.

Victor Kirhenshtein

Did you build it from sources? If yes, try to replace file src/libnxsl/parser.y with attached one and recompile. Check that you have flex and bison installed.

Best regards,
Victor

sperlm

Yes,

   installed missing flex and bison - non development versions (how come we did not need these before?).
Downloaded netxms-1.2.14.tar.gz, extracted, swapped parser.y, configured install, make, make install, started agent, server... same results - huge amount of warnings because of filters, tried to switch them off but server fails again :(

Victor Kirhenshtein

Can you run server under gdb and send me stack trace after crash?

Run it as

gdb /path/to/netxmsd

at (gdb) prompt enter

run -D5

when it crashes at (gdb) prompt enter

bt

and send me it's output.

Best regards,
Victor

sperlm

Hello, sorry for the delay.

Its my first time debugging, used http://www.ibm.com/developerworks/linux/library/l-gdb/ as reference.
Had to enter "file /opt/netxms/bin/netxmsd" into gdb before "run -D5".

Debug finished with
Program received signal SIGSEGV, Segmentation fault.
[Switching to Thread 0x7ffff0132700 (LWP 5879)]
AlarmManager::watchdogThread (this=0x7ffff7dd7e80) at alarm.cpp:1105
1105                            if ((m_pAlarmList[i].ackTimeout != 0) &&


bt output
#0  AlarmManager::watchdogThread (this=0x7ffff7dd7e80) at alarm.cpp:1105
#1  0x00007ffff7aea849 in WatchdogThreadStarter (pArg=<value optimized out>) at alarm.cpp:204
#2  0x00007ffff6137851 in start_thread (arg=0x7ffff0132700) at pthread_create.c:301
#3  0x00007ffff534b90d in clone () at ../sysdeps/unix/sysv/linux/x86_64/clone.S:115


With regards,
Milan Sperl