02-27-2007 01:19 PM
Hi to all,
We have DFM configured to send trap notifications towards an external machine, and we see that sometimes the alarms that arrives to the external NMS are delayed. some o them are sent only days after it were received in CW .
Any clue ??... correlation issues maybe ?
Cheers,
Pedro
Solved! Go to Solution.
04-12-2007 12:39 PM
The attachment didn't come through, but I'm not sure how useful it would have been. It sounds like he sent a screenshot of a sniffer trace which isn't very helpful. We would need to see the binary sniffer trace file. In any event, I think I found an example in the log, and it looks like address resolution might be the culprit.
Basically, DFM takes the IP address that is configured and performs a lookup on the address. So, DFM tries to use the system resolver to resolve 192.168.168.134. This takes about three seconds to complete. The way Windows address resolution works can cause excessive delays when performing these kind of operations. As a workaround, try adding this address to DNS or your server's hosts file, and see if this problem improves.
02-27-2007 01:35 PM
I have not heard of any delay problems with sending trap notifications. However, there are a few problems where notifications are lost. Are you sure the notifications showing up on the external NMS are actually delayed, or is it a new notification that is showing up? How have you configured your trap notification subscription, and what do one of these delayed notifications look like?
02-27-2007 02:12 PM
I'm sure it's the same notification cause the id is the same ...i compare them, i mean, the alarm that appears on DFM and the alarm that appears later on external nms...
P
02-27-2007 09:21 PM
What about the answers to my other two questions?
Something you should try to debug this is to place a sniffer on the LMS server, and filter on UDP 162 traffic to your external NMS. See if the traps are leaving the LMS server in a timely fashion. If not, you can enable Notification Services debugging under DFM > Configuration > Other Configurations > Logging, reproduce this problem, then check the NMSROOT/log/dfmLogs/NOS/nos.log for errors.
02-27-2007 11:50 PM
My DFM (lms2.6; dmf 2.0.8) email notification doesn't work automatically. If i click on notify and fill in the details , the email comes through. But if i manually generate an alarm , the NOS.log file says that it sees it, and its preparing to send but then it doesn't.
28-Feb-2007|09:39:52.453|DEBUG|NOS|Email Notify Thread:Pooled Thread:1|EmailUtility|createAndSendMail()|.|Ready to send
28-Feb-2007|09:39:52.453|ERROR|NOS|Email Notify Thread:Pooled Thread:1|EmailUtility|createAndSendMail()|.|
Mail was not sent to ; pierre.vanvuuren@za.didata.com
02-28-2007 05:51 AM
Hi Pierre,
Is the NOS Server running? Check this by searching for "Common Services -> Server -> Admin -> Processes" the NOSServer process. Is it running?
With DFM 2.0.4 i had an issue with the NOSServer registration.
Navigate to "Common Services -> Server -> Admin -> Processes"
Click on the entry "NOSServer"
The following INformation should appear for 2.0.8:
Process: NOSServer
Path: /opt/CSCOpx/bin/cwjava
Flags: -server -cp:p MDC\tomcat\webapps\triveni\WEB-INF\lib\log4j.jar,
MDC\tomcat\webapps\triveni\WEB-INF\lib\nos.jar,
MDC\tomcat\webapps\triveni\WEB-INF\lib\ctm.jar,
MDC\tomcat\webapps\triveni\WEB-INF\lib\ogs-client.jar,
MDC\tomcat\webapps\triveni\WEB-INF\lib\cogs.jar,
MDC\tomcat\webapps\triveni\WEB-INF\lib\ogs-kc1.0.jar,
MDC\tomcat\webapps\triveni\WEB-INF\lib\ogs-kilnervirtualasa1.0.jar,
MDC\tomcat\webapps\triveni\WEB-INF\lib\activation.jar,
MDC\tomcat\webapps\triveni\WEB-INF\lib\mail.jar,
MDC\tomcat\webapps\triveni\WEB-INF\lib\tis-client.jar,
MDC\tomcat\webapps\triveni\WEB-INF\lib\pm.jar,
lib\classpath\jconn2.jar,lib\classpath\epm-v0-5-0.jar,
lib\classpath\cesar-v0-5-0b.jar,
MDC\tomcat\shared\lib\NATIVE.jar,
MDC\tomcat\shared\lib\MICE.jar,
MDC\bin,objects\nos\config,MDC\tomcat\webapps\triveni\WEB-INF\classes,
-cw:jre D:\PROGRA~1\CSCOpx\MDC\jre -cw:xrs -Djava.compiler=NONE com.cisco.nm.trx.nos.server.NOSServer
Startup: Started automatically at boot.
Dependencies: EPMDbEngine EPMServer INVDbEngine PMServer DFMOGSServer
If there is a difference you might try tu unregister and register NOSServer.
Windows
@ECHO OFF
SET NMSROOT=c:\CSCOpx
CALL %NMSROOT%\bin\pdreg.cmd -u NOSServer
CALL %NMSROOT%\bin\pdreg.cmd -r NOSServer -d "EPMDbEngine,EPMServer,INVDbEngine,PMServer,DFMOGSServer" -e %NMSROOT%\bin\cwjava.exe -f "-server^-cp:p^MDC\tomcat\webapps\triveni\WEB-INF\lib\log4j.jar,MDC\tomcat\webapps\triveni\WEB-INF\lib\nos.jar,MDC\tomcat\webapps\triveni\WEB-INF\lib\ctm.jar,MDC\tomcat\webapps\triveni\WEB-INF\lib\ogs-client.jar,MDC\tomcat\webapps\triveni\WEB-INF\lib\cogs.jar,MDC\tomcat\webapps\triveni\WEB-INF\lib\ogs-kc1.0.jar,MDC\tomcat\webapps\triveni\WEB-INF\lib\ogs-kilnervirtualasa1.0.jar,MDC\tomcat\webapps\triveni\WEB-INF\lib\activation.jar,MDC\tomcat\webapps\triveni\WEB-INF\lib\mail.jar,MDC\tomcat\webapps\triveni\WEB-INF\lib\tis-client.jar,MDC\tomcat\webapps\triveni\WEB-INF\lib\pm.jar,lib\classpath\jconn2.jar,lib\classpath\epm-v0-5-0.jar,lib\classpath\cesar-v0-5-0b.jar,MDC\tomcat\shared\lib\NATIVE.jar,MDC\tomcat\shared\lib\MICE.jar,MDC\bin,objects\nos\config,MDC\tomcat\webapps\triveni\WEB-INF\classes,^-cw:jre D:\PROGRA~1\CSCOpx\MDC\jre^-cw:xrs^-Djava.compiler=NONE com.cisco.nm.trx.nos.server.NOSServer"
Hope that helps a bit. Ohterwise help us an provide a bit more details about your setup (Notification Groups and settings).
Best regards,
Frank
02-28-2007 09:05 AM
Stick a sniffer on the LMS server, and filter for TCP port 25 to your DFM SMTP server. That trace may provide a reason why the mail is not being sent automatically.
04-09-2007 07:20 AM
04-09-2007 08:22 AM
I don't see any problems with delay on the DFM side. I see an alert come in at 10:08:34, and goes out as a trap at 10:08:50. As I said previously, you should start deploying sniffers to watch the trap move from the DFM server to the external NMS.
04-09-2007 08:32 AM
04-09-2007 08:37 AM
There is bug retrieving event details that will be worked out in LMS 3.0, but it shouldn't delay the sending of a notification trap. Without a debug showing the problem, or a sniffer trace showing where in the network the delay is occurring, I cannot offer any other suggestions.
04-12-2007 08:33 AM
Hi JC,
About the delay, every time the system is NOT loaded we see a regular flow of traps being sent from CW to the notification server ( i.e. several packets on the same second), BUT when high load is seen, the flow of traps towards NServer lowers to one trap every 4 seconds...
04-12-2007 08:36 AM
How loaded is loaded? Four seconds could be expected under high load situations. What are the specs on this server?
04-12-2007 09:15 AM
Hi JC,
Loaded means thousands of traps recevived on a certain period of time. The Server is a DL360 and the proccessor load doesn't goes more than 40%. I do believe that it is queueing delay, as that if i receive let's say 10 traps one the same second and i'm sending notifications for those traps one every 4 second's ...with time i will have delay.
I can say that at the moment, when we see load on the DFM, we experience more than 40 minutes delay from a alarm apearing on DFM and being sent to notification server...
P
04-12-2007 09:20 AM
Hi JC,
Loaded means thousands of traps recevived on a certain period of time. The Server is a DL360 and the proccessor load doesn't goes more than 40%. I do believe that it is queueing delay, as that if i receive let's say 10 traps one the same second and i'm sending notifications for those traps one every 4 second's ...with time i will have delay.
I can say that at the moment, when we see load on the DFM, we experience more than 40 minutes delay from a alarm apearing on DFM and being sent to notification server...
P
Discover and save your favorite ideas. Come back to expert answers, step-by-step guides, recent topics, and more.
New here? Get started with these tips. How to use Community New member guide