Basic Troubleshooting

Download Report

Transcript Basic Troubleshooting

VPO Troubleshooting
Section Overview
•
•
•
•
•
•
Possible Trouble Areas
Filesystem structures
Management Server logfiles
Troubleshooting process
Scenarios
VPO Trace facility
2
VPO Troubleshooting
[vpo_troubleshooting]
Possible Trouble Areas
“Message processing”
“Networking”
Managed node
“Database”
Managed server
rpcd
Templates
DB
rpcd
VPO
Agent
Processes
DCE
RPC
VPO
Manager
Processes
Interceptor/
Encapsulator
ipmap
Logfiles
Logfiles
“Processes”
“GUIs”
registration
“OpenView”
“Applications”
“Installation”
3
VPO Troubleshooting
[vpo_troubleshooting]
Filesystem Structure - Binaries
/opt/OV
”
lib
bin
man
OpC
newconfig
OpC
VPO binaries contained
here for Server and
Nodes
install
opcinfo
opcsvinfo
Configuration files
for Node and Server
4
VPO Troubleshooting
[vpo_troubleshooting]
Filesystem Structure - Config data
/etc/opt/OV/share
registration
symbols lrf
conf
OpC
mgmt_sv
Application
registration files
5
VPO Troubleshooting
[vpo_troubleshooting]
Filesystem Structure - Runtime data
/var/opt/OV/
share
log
OpC
mgmt_sv
databases
OpC
tmp
conf
tmp
OpC
OpC
OpC
Managed Node
distribution
area
Managed Node
VPO download
config data files
OpC
opcerror
(Managed Node)
opcerror
(Mgmt Server)
bin
Managed
Node queues
and pipes
Management
Server queues
and pipes
& distrib area
OV Maps &
VPO configuration
scripts
6
VPO Troubleshooting
[vpo_troubleshooting]
Management Server Logfiles
/var/opt/OV/log/OpC/mgmt_sv
$ORACLE_HOME
inst_err.log
inst_sum.log
rdbms
install.log
opcerror
log
opcsw_inst.log
alert_openview.log
dbwr_20629.trc
opcsw_inst_err.log
7
VPO Troubleshooting
[vpo_troubleshooting]
Troubleshooting Process
• Note the precise details of the problem
• Check the Error Message Reference Manual or run
opcerr
• Check the VPO Administrators Reference Manual
• Run itochecker
• Check through the rest of this module
• Use the VPO tracing option
• Check the HP Web site for support information
• Call the HP Response Center
8
VPO Troubleshooting
[vpo_troubleshooting]
Agent/Server Installation & Runtime
• Examples of problems
– “The Management Server will not start”
– “The Managed Node processes will not start”
– “The VPO Oracle database returns errors”
– “My VPO backups aren’t working”
– “I keep getting communication errors to the nodes”
9
VPO Troubleshooting
[vpo_troubleshooting]
Windows and Messages
• Examples of problems
– “My operators can’t login to Java/Motif GUI”
– “A message was supposed to arrive in my browser….it
didn’t!!”
– “I get duplicate messages in my browser”
– “Batches of old messages keep arriving in my browser”
10
VPO Troubleshooting
[vpo_troubleshooting]
Distribution
• Examples of problems
– “A distribution error message keeps appearing in my
browser”
– “I distributed new config, but the messages are in the old
format”
– “I created a new template, did a distribution
and…...nothing!!”
11
VPO Troubleshooting
[vpo_troubleshooting]
Actions, Commands and Monitors
• Examples of problems
– “Automatic Actions keep failing”
– “Some Actions will not stop running”
– “Some of the operators can’t run their applications”
– “My new monitor is giving me weird errors”
– “One of my monitors can’t start because the previous one
is still running”
12
VPO Troubleshooting
[vpo_troubleshooting]
When all else fails…..trace!
Managed node
Management server
/opt/OV/bin/OpC/install/opcinfo
/opt/OV/bin/OpC/install/opcsvinfo
OPC_TRACE TRUE
OPC_TRACE_AREA MSG, ACTN
OPC_TRACE_TRUNC FALSE
OPC_TRACE TRUE
OPC_TRACE_AREA MSG, ACTN
OPC_TRACE_TRUNC FALSE
VPO
agent
processes
VPO
manager
processes
/var/opt/OV/tmp/OpC/trace
/var/opt/OV/share/tmp/OpC/mgmt_sv/trace
opcagt -trace
or
opcagt -start
13
opcsv -trace
or
opcsv -start
VPO Troubleshooting
[vpo_troubleshooting]