09-08-2008 06:27 AM
I'm running LMS 2.6 with RME 4.0.6.
The current situation is that the job simply fails, and doesn't appear to have polled anything.
Because I'm not aware of the exact service at fault, I stop and restart the CW daemon. This fixes the issue for a single daily run. All jobs after the first run fail.
I get a single email:
Hello,
The following is the status of your Change Poller based Collection job:
Job ID : 1015
Status : Job Failed
Description : System config polling job
Details : https://nms.ArkansasElectric.com:443/rme/DcmaJobDetails.do?jobid=1015.192
Start Date and Time : Thu Sep 04 17:00:21 CDT 2008
End Date and Time : Thu Sep 04 17:00:21 CDT 2008
RME Server Name/IP : nms.ArkansasElectric.com
Execution Summary
Pending : 0
NotAttempted : 0
Successfull : 0
Failed : 0
Partial Success : 0
I'm pretty sure this all began after we had a disaster in our data center. I know the server CW runs on went down without being properly shut down, so that probably has something to do with it.
Does anyone have any advice, or knowledge about how to remedy the situation?
Thanks,
Daniel
09-08-2008 11:24 AM
Please post the output of the pdshow command when the polling fails.
09-26-2008 07:54 AM
I've attached the output of the pdshow.
The image I've attached is an error message that I receive when I access "Archive Management". Access is fine for several attempts to access at a time, then I receive this message. After I receive this message I am unable to access any part of Cisco Works except the home page. This is also the point that scheduled jobs fail.
09-26-2008 12:41 PM
Is this installed on Windows or Solaris?
09-26-2008 03:31 PM
Windows Server 2003 Standard
09-26-2008 03:34 PM
It appears there may be a problem with your RME database. It may be corrupt in some way which is triggering a failure, or there may be a connection exhaustion. Please post your RMEDbMonitor.log, and the contents of the Windows Application Event Viewer.
09-26-2008 04:44 PM
09-26-2008 05:23 PM
It does appear your database is corrupt. If you have a known good backup of LMS, you should restore it. If not, you will need to reinitialize the RME database with the command:
NMSROOT\bin\perl NMSROOT\bin\dbRestoreOrig.pl dsn=rmeng dmprefix=RME
09-26-2008 07:23 PM
I don't know if my last couple backups are any good. What will that command do to everything I currently have in RME? Will it all be lost?
09-26-2008 07:31 PM
Yes, all your RME data will be lost.
10-01-2008 12:36 PM
I restored to a backup and it all went fine. It's running fine now, only the scheduled daily poll seems to fail before it starts. Before when it would fail it would say
Pending : 0
NotAttempted : 0
Successfull : 0
Failed : 0
Partial Success : 0
I guess because of the databse problem, but now it says all the devices are Pending, but the process just fails. I noticed that manually running syncs would complete successfully, but I had run a job that just kept running and would not stop - and it won't. It just says "Stop Initiated". Now all REM jobs fail. Any logs that show this info?
10-01-2008 12:38 PM
The dcmaservice.log would show any errors. Howeber, if you have a wedged job, you will need to restart ConfigMgmtServer at the very least for new config jobs to run again.
10-01-2008 12:59 PM
Thanks that finally killed that job. I just noticed another problem though. ANIServer Fails to run. If I start it says
Program started - No mgt msgs received
Then
Running with busy flag set
Then
Failed to run
Any ideas?
10-01-2008 01:15 PM
Please start a new thread for the problem with ANIServer.
10-01-2008 01:25 PM
I restarted the Daemon and so far it looks fine. I'll start a new thread if I continue to have problems with it. Thank you for your help.
Discover and save your favorite ideas. Come back to expert answers, step-by-step guides, recent topics, and more.
New here? Get started with these tips. How to use Community New member guide