Session Title - Andy Pedisich

Download Report

Transcript Session Title - Andy Pedisich

The Complete
Handbook for Domino
Domain Monitoring
Andy Pedisich
Technotics
© 2009 Wellesley Information Services. All rights reserved.
In This Session
• Domino Domain Monitoring (DDM) is a powerful, yet complex tool
that is often overlooked by administrators
• If you are using Domino 6, 7, or 8 you are already a proud owner of
Domino Domain Monitoring Database and could already be using
its powerful functionality
• This Jumpstart will lay out all of the parts to help you see the
connections you need to successfully use this feature in your
R7/R8 domain
1
What We’ll Cover …
• Discovering the basics of Domino Domain Monitoring
• Understanding how DDM fits into your environment
• Crafting a perfect DDM data collection hierarchy
• Looking at simple but important DDM events
• Developing proactive probes
• Mastering enhanced events and corrective actions
• Working with the new Release 8 modular documents
• Testing and debugging Domain Monitoring
• Wrap-up
2
What Is Domino Domain Monitoring?
• Domino Domain Monitoring is part of a suite of tools that can help
you watch over your domain
 They are built into Notes/Domino
 Notes Administrator Client
 Domino server monitor
 Remote server console
 Event generators and handlers
 Statistics collection
 Activity logging
 LOG.NSF
 Domino Domain Monitoring
3
DDM Details
• DDM provides a single location where administrators
can access issues that are affecting multiple servers
and databases
 This is important because it lets you discover and resolve
problems quickly before they cause serious issues
• This helps to reduce the risk of unplanned downtime
4
Can DDM Be Used with All Releases of Domino?
• DDM became available in Notes/Domino Release 7
 Some of its probe features work on R6 servers
We’ll discuss this shortly
 But it is best suited for Releases 7 and 8
• DDM has been improved in Release 8

5
The Domino Domain Monitoring Database
• The DDM database is the central repository of all monitoring data
 This can be data collected by probes that you can configure
• The data in the DDM database can also include
 Result messages from event generators that you configured in
previous releases of Notes
 Results for routine checks that run as part of specific server
tasks, such as the router or replicator
6
Do All Administrators Use DDM?
• Many administrators don’t use the potential of DDM as much as
they should
 They are already overwhelmed by the monitoring features
of Domino
 They don’t understand how DDM fits into the architecture
7
Some Administrators Have Probe-Aphobia
• And some have become intimidated by the configuration of probes
 Probes are not a required part of DDM
 They are nice to have and fun to use, but DDM functions
without them
 You can get started without probes
 Then add them into the configuration when you become
more familiar with how DDM works
• If your Domain is running Release 7 or 8, DDM is working for you
right now
8
What We’ll Cover …
• Discovering the basics of Domino Domain Monitoring
• Understanding how DDM fits into your environment
• Crafting a perfect DDM data collection hierarchy
• Looking at simple but important DDM events
• Developing proactive probes
• Mastering enhanced events and corrective actions
• Working with the new Release 8 modular documents
• Testing and debugging Domain Monitoring
• Wrap-up
9
The Big Relationships in Monitoring
• EVENTS4.NSF – the Monitoring Configuration database is a key
file in your monitoring infrastructure
 It contains all of the specifics for your DDM monitoring
configuration
 For DDM probes
 For the DDM collection hierarchy (more about this later)
• But it also contains all the configuration specifics for:
 Statistic collection
 Event monitoring, such as tracking disk space or ACL changes
 Event notification, such as sending emails when problems
occur or logging issues into the Monitoring Results database
10
Simple Events
• When DDM was introduced, Lotus began to refer to events that
were generated automatically by servers as “simple events”
 Most events logged to the Domino console are simple events
 You’ve seen simple events logged to STATREP.NSF in the past,
and you’ll still find them there in R7 and R8
• You will also find simple events along with the results of probes
you create in DDM.NSF
 These are called “enhanced events”
11
This Pair of Databases Is Created Automatically
• EVENTS4.NSF was automatically built when your first server was
created
 DDM.NSF is created the first time you start an R7 or R8 server
• All EVENTS4.NSFs in your domain must have the same replica ID
 And all DDM.NSFs in your domain must be have the same
replica ID
12
This Configuration Is Extremely Important
• If the replica ID of these databases are not correct, then your
configuration for monitoring won’t be distributed consistently
throughout your domain
 And you won’t be able to collect DDM data centrally
• This business of having EVENTS4.NSFs with different replica IDs
is a very common issue
 Especially among domains that have been around for a while
• And we’ve seen many domains where the DDM database has a
different replica ID on some servers
13
Some System Databases Have Special Replica IDs
• The EVENTS4.NSF and DDM.NSF databases have very specific
replica IDs
 They are a hash of the replica ID of your domain’s NAMES.NSF
14
We Know What the Replica ID Should Be for EVENTS4
• The replica ID of system databases, such as EVENTS4 and
DDM.NSF, is derived from the replica ID of your domain’s
address book
Database
Replica ID
NAMES.NSF
852564AC:004EBCCF
CATALOG.NSF
852564AC:014EBCCF
EVENTS4.NSF
852564AC:024EBCCF
ADMIN4.NSF
852564AC:034EBCCF
DDM.NSF
852564AC:0A4EBCCF
 Notice that the first two numbers after the colon for the
EVENTS4.NSF replica are 02 and OA for DDM.NSF
15
This Configuration Must Be Correct
• Make sure that EVENTS4.NSF and DDM.NSF have the correct
replica ID throughout the domain by opening each and putting it
on your desktop
 The icons should stack, showing they are the same replica
 Check the replica ID by looking at the properties of
the databases
16
Errors You Might See If DDM.NSF Is Not Right
• If there is a DDM.NSF on every server but they aren’t all the same
replica ID, you’ll see the following error on the console every
couple of minutes:
 Unable to replicate with server Server2: None of the selected
databases have a replica on the server
 You’ll get this even if you have a much longer replication
interval scheduled
• To fix problems related to EVENTS4.NSF and DDM.NSF replica IDs,
you must delete the bad databases and restart the server
17
Want to Add Every EVENTS4.NSF to Your Desktop?
• Add this code to a button on your toolbar
 This is courtesy of Thomas Bahn

www.assono.de/blog
 It’s a great tool that I use almost every day
_names := @Subset(@MailDbName; 1) : "names.nsf";
_servers := @PickList([Custom]; _names; "Servers"; "Select
servers"; "Select servers to add database from"; 3);
_db := @Prompt([OkCancelEdit]; "Enter database"; "Enter the file
name and path of the database to add."; "log.nsf");
@For( n := 1; n <= @Elements(_servers); n := n + 1;
@Command([AddDatabase]; _servers[n] : _db) )
18
Add a Database to the Desktop
• This code will prompt you to pick the servers that have the
database you want on your desktop
 Then it will prompt for the name of the database
 And open it on all the servers you’ve selected
• Use it to make sure all the EVENTS4.NSF are the same replica in
your domain
19
Demo – Looking at Simple and Enhanced Events
20
What We’ll Cover …
• Discovering the basics of Domino Domain Monitoring
• Understanding how DDM fits into your environment
• Crafting a perfect DDM data collection hierarchy
• Looking at simple but important DDM events
• Developing proactive probes
• Mastering enhanced events and corrective actions
• Working with the new Release 8 modular documents
• Testing and debugging Domain Monitoring
• Wrap-up
21
Configure DDM for Centralized Data Collection
• DDM.NSF has the most value when it is the central repository for
all issues
 It will contain all of the issues that come from all of the servers
• This does not happen on its own
 There is no collection hierarchy set up by default
 Each domain has different monitoring requirements
22
Collection Hierarchy Is a Must
• Without a collection hierarchy, DDM probes run on a server and
report events to DDM.NSF that are on that server
 Then they remain only on that server’s replica of DDM.NSF
 You have to check the DDM database on each server to
evaluate problems and discover potential issues
 This is time consuming
 It reduces the time you could be spending solving
problems
 And you might miss important issues
23
Aggregate Data Centrally
• A DDM server collection hierarchy lets you aggregate the data
onto a key server or servers
 This must be configured in the EVENTS4.NSF
• The simplest hierarchy is to configure one server to collect from
all servers in the domain
24
More Complex Scenarios Are Possible
• Perhaps as you become more familiar with DDM, you’ll want to roll
up some data regionally
 So that regional administrators receive only information that is
pertinent to the server they maintain
25
Rolling Up the Data
• DDM data rollup propagates the probe results up the DDM server
collection hierarchy
 Data rollup is accomplished using Domino’s selective
replication to transport the data
 The replication formulas are created automatically when
you define your DDM server collection hierarchy
26
Selective Replication Formulas
• Each selective replication formula is specific to each server in the
Domino Domain Monitor replica
• When they are fully populated, the selective replication formula
references the collection server
 And all of its monitored servers
 The selective replication formula filters out all documents
from servers that are not members of the collecting
server’s hierarchy
27
Hierarchy Collection Interval
• The DDM system sets up its own collection interval
 Collection replication occurs about every five minutes
This interval cannot be modified
 It is not controlled through connection documents
• Every five minutes, each collection server uses pull replication to
get updates from the DDM database on each of the monitored
servers

28
What We’ll Cover …
• Discovering the basics of Domino Domain Monitoring
• Understanding how DDM fits into your environment
• Crafting a perfect DDM data collection hierarchy
• Looking at simple but important DDM events
• Developing proactive probes
• Mastering enhanced events and corrective actions
• Working with the new Release 8 modular documents
• Testing and debugging Domain Monitoring
• Wrap-up
29
Severity Levels in DDM
• Each event arriving into DDM.NSF has been assigned a
severity level
 The severity indicates the importance of taking action
 Fatal – Imminent system crash
 Failure – Severe failure that does not cause a system crash
 Warning (high) – Loss of function requiring intervention
 Warning (low) – Performance degradation
 Normal – Status messages
30
Address Issues by Severity Level
• Looking at issues by severity gives you the chance to deal with
the most important issues first
 They are broken out by severity category
31
Looking at a Failure
• This simple event in DDM tells us what the problem was and when
it occurred
 And it offers a possible solution
 This example shows the basics of DDM
32
Not All Simple Events Make It to DDM.NSF
• You’ll notice that not all of the events you see on the console are
delivered to DDM
 This is because of a default filter in EVENTS4.NSF
33
Filtering Specifics
• This default filter is put in place to “turn down the noise”
 Only important Fatal and Failure simple events are let through
This lets you save time by focusing on the real issues
• This filter can be customized by you, based on the requirements
within your own domain

34
Complete Listing of Messages
• There is a complete listing of all simple events and
their severities
 You can find them in the EVENTS4.NSF Monitoring
Configuration database
 Look in the Advanced section
35
Another Helpful View
• Release 8 added a new view to DDM.NSF
 You can see issues by database name

This lets you determine whether a problem is happening
on just one server or on every copy of the database in
the domain
 Very handy information when problem solving
36
What We’ll Cover …
• Discovering the basics of Domino Domain Monitoring
• Understanding how DDM fits into your environment
• Crafting a perfect DDM data collection hierarchy
• Looking at simple but important DDM events
• Developing proactive probes
• Mastering enhanced events and corrective actions
• Working with the new Release 8 modular documents
• Testing and debugging Domain Monitoring
• Wrap-up
37
Proactive Probes
• Remember that there are two kinds of events:
 Simple
 Enhanced
• Enhanced events are:
 Generated by a DDM probe configuration document
 Generated by a Domino Event Generator
• These are both events with specific target information such as the
server, database, agent, or user checked appearing in the DDM
event report header
• It’s time to check out probes
38
Probes Defined
• A probe is the investigative component of DDM
• Probes:
 Need configuration to be useful
 Are configurable by administrators
• A probe is defined as:
 A discrete check, or set of checks, configured to run against
one or more servers, databases, and services
 A probe returns status and results to the Domino Domain
Monitoring Database – DDM.NSF
 The DDM.NSF is created automatically
when the server starts
39
Analysis Probes
• Configure DDM using probe documents in EVENTS4.NSF
 Otherwise known as the Monitoring Configuration database
• You can create multiple probes for each feature area
 And you can individually configure each probe to run:



Selective checks
Against specific servers and/or databases
At specific times
40
What’s in These Probe Documents?
• Probe type and probe subtype
 For example, Security is a probe type

One of its probe subtypes is Best Practices
 This combination of probe type and probe subtype
creates a Security probe
41
Extra Information About the Probe Is Provided
• These probe documents also contain a general description of the
probe, its purpose, and its intended use
42
Configuration Specifics, Too
• Documents can also specify configurable probe targets
 The server(s) that will run the probe
 And in some cases the servers, database, etc., that the probe
runs against
 Where it’s applicable, there is also configurable scheduling
information – but not for all probe types
43
There’s More Inside
• Probe documents can also hold configuration specifics
 What the probe monitors
 What it should report on
 Thresholds to watch for

And what type of severity those thresholds represent
44
Plenty of Cool Probes
• R8 gives us 58 default DDM probes to work with
 R7 gives us 48 – still plenty to get us started
• You can get probing as soon as R7/R8 is up
• Just plug in your server info to get DDM started
 You can also create new probe documents

Define and customize your own probes
45
Many Types of Probes
• There are ten major types of probes in R8, nine in R7
 These probes can run two different ways:
On a schedule that you specify
 As an active monitor of things that happen in the domain
• Some probes can run either way
 On a schedule or as a monitor
 It depends on what you ask them to do
 Some can only run as a monitor or on a schedule

46
Summary Table for Probes
• Here’s a handy table to show how probes can run:
Probes
Administration Probes
Application Probes
Database Probes
Directory Probes
Messaging Probes
Operating System Probes
Replication Probes
Security Probes
Server Probes
Web Probes
Scheduled
Yes
Yes
Yes
Yes
Yes
Yes
Monitor
Yes
Yes
Yes
Yes
Yes
Yes
Yes
Yes
47
Establishing a Schedule
• Scheduled probes can be controlled with great granularity
 Set the probe to run:

Daily, Weekly, Monthly
 Beyond that, specific schedule settings can vary from
probe type to probe type
48
Don’t Worry About Getting Off Schedule
• If a Weekly/Monthly probe is missed, you can specify how you
want the probe to be handled:
 Ignore it completely
 Run the missed probe on startup
 Run the missed probe at the next time range
49
Zeroing in on Probes
• We’re going to focus on four probes that have a high value in
almost every Domino domain:
 Application probes
 Web probes
 Security probes
 Messaging probes
50
Agents Are Tracked by Application Probes
• Application probes monitor agents
 None of these probes can be scheduled
They are all real-time monitors
• Application probes have the following subtypes:
 Agents behind schedule
 Agents ranked by CPU usage
 Agents ranked by memory usage
 Long-running agents

51
Agents Are Tracked by Application Probes (cont.)
• Agents behind schedule
 Monitors agents and detects when an agent starts after its
scheduled time
• Agents ranked by CPU usage
 Evaluates the CPU usage for agents executed by the specified
process (Agent Manager or HTTP)
 Event threshold and severity are adjustable based on
CPU usage
 These have a relatively high overhead
52
Agents Are Tracked by Application Probes (cont.)
• Agents ranked by memory usage
 Evaluates agents memory usage executed by the Agent
Manager or HTTP tasks
 Note that evaluation results for the same agent may differ when
the agent runs in Agent Manager/HTTP
 Also, results from this probe can depend on HTTP settings
• Long-running agents
 Detects agents that run longer than a time you specify
53
Web Probes
• Web probes provide results as a Domino Event document in the
DDM database
• There are two subtypes:
 Configuration probe
 Verifies the accuracy of a Web server’s configuration
 Best Practices probe
 Compares servers’ configurations against a known
good server
• The default schedule for these probes is once a month
54
Web Configuration Probe
• The Web Configuration probe provides feedback related to the
settings of a Web server or list of Web servers
 This feedback is based on a Web profile
 The Web profile you reference could be a real server or a
profile you create that is “perfect” for your environment
55
Web Configuration Probe Details
• If the values in the two profiles differ, the probe recommends
values to either correct the configuration or improve server
performance or security
 This probe reviews field values in the Server document, the
Internet Site document, or the Web Server Configurations
document
 Depending on which documents exist in your domain
56
Web Best Practices Probe
• The Web Best Practices probe monitors HTTP-specific fields in
domain compared to a recommended value
 The HTTP-specific fields are in:
 The Server document
 The Internet Site document (if one exists)
 The Web Server Configuration document (if one exists)
57
The Best Practice DDM Report
• A DDM document details the values that don’t match the Best
Practice values recommended by Lotus
 And provides information on how to resolve any issues
 Could be corrections to server functionality, server
performance, or server security
 This helps improve overall security and Web performance
58
Security Check Probes
• Security probes assess the overall security of servers and
databases in your domain
• There are five subtypes:
 Best Practices
 Configuration
 Database ACL
 Database Review
 Security Review
59
The Five Security Check Probes
• Best Practices
 Compares a set of baseline security configuration settings to
the same settings in a domain
 This probe is a “Best Practices” security audit of the domain
• Configuration
 Compares settings in a specific Server document to settings in
a specified “good” Server doc
 This doc can be real or built by you as an example
 Lotus recommends a daily schedule for these probes
60
The Five Security Check Probes (cont.)
• Database ACL
 Monitors the access control privileges that groups and
individuals have in specified databases
 You designate the acceptable access levels on the
Specifics tab
• Database Review
 Reviews the security
properties for a specified
database
 Generates a report on probe
findings
61
The Five Security Check Probes (cont.)
• Security Review
 Generates a report on the security settings specified in the
Specifics tab of the probe document
 You have the option of selecting the “Directory Profile Note”
and the “Security settings in the Server Configuration
Document”
 And a review of all security settings in a Server doc
 This can really help to tighten your domain’s security
62
Domino Domain Monitoring for Messaging
• DDM includes 10 probe types
for messaging
 Two types have subtypes
 That’s 12 probes in all
• These probes cause events to
be logged in the DDM database
 That means problem solving
can be tracked and
assigned to an
administration team
• First three probes shown here
deal with mail flow:
 Mail DSN
 Mail Flow Statistic Check
63
The Mail DSN Probe
• This uses the Delivery Status Notification (DSN) functionality to
test SMTP mail flow
 Good for determining if a site is reachable; however, domain
must support DSN (SMTP extensions)
 Domino supports these extensions, so if the target SMTP
domain is a Domino domain, you can easily use this probe
64
Preparation Is Required for a DSN Probe
• The probe’s configuration must include an actual email address to
probe and the message is delivered!
 Make sure you specify a test user
 Messages delivered by this probe to the mail recipient will
not be automatically deleted from the mail file
 And remember that Yahoo mail does not support
DSN extensions
65
Mail Flow Statistic Check Probe
• This monitors the quantity of mail
 Generates an event in the Domino Domain Monitor when
pending messages meet or exceed the configured values
• Slack percentage reports a large quantity of mail in the system or
when there is more mail in mail.box than the router can process
 Slack percentage = (Mail.TotalPending - Mail.Dead - Mail.Held Mail.Waiting) / Mail.Waiting
66
The Mail Reflector Probes
• Reflector tests mail routing to both Domino and
non-Domino systems
 You must specify a mail recipient
 The recipient must be configured so that messages
received from this probe are sent back to the originator
 The subject of the original message must be contained
in the subject of the returned message
 Messages delivered by this probe to the mail recipient will not
be automatically deleted from the mail file
 Be sure to use a test user account
67
Mail Reflector Target User Is Not Hard to Configure
• Configuration can be done easily in the Notes environment
 Create a rule to have the recipient autoforward the message to
the ISpy mail-in database on the server executing the probe
 The messages are not deleted automatically
68
Scheduled Messaging Probes
• Message retrieval process state
 Verifies health of IMAP and POP3 tasks on specified servers
• Message retrieval TCP Port Health
 Verifies IMAP and POP3 are processing protocol requests
• NRPC Routing Status
 Tests local NRPC calls to verify mail routing is working
69
More Scheduled Messaging Probes
• Router Process State
 Monitors status of
the router
• SMTP Process State
 Monitors status of SMTP
process
• SMTP TCP Port Health
 SMTP successfully
processes protocol
requests
• The default schedule is once
per minute for these
“Process State” and “Port
Health” probes
70
This Probe Monitors and Is Not Scheduled
• Transfer Queue Check
 Monitors mail flow to
individual destinations
 Checks all
destinations
 Or just specific
destinations that you
would like to
troubleshoot
71
More on Transfer Queue Checks
• Transfer queue checks generate an event based on a threshold
you set for message retry and message count
 “Message retry” is the number of attempts to send a message
 “Message count” is the quantity of mail pending delivery
• Transfer queue check is a real-time monitor – the schedule’s not
configurable
72
What We’ll Cover …
• Discovering the basics of Domino Domain Monitoring
• Understanding how DDM fits into your environment
• Crafting a perfect DDM data collection hierarchy
• Looking at simple but important DDM events
• Developing proactive probes
• Mastering enhanced events and corrective actions
• Working with the new Release 8 modular documents
• Testing and debugging Domain Monitoring
• Wrap-up
73
The DDM Database
• The Domino Domain Monitor database holds events reported by
the probes
 It also holds events reported by:
 Event generators
 Certain server tasks, like the router and replicator
• It’s all placed into the DDM database
 But that’s not all!
74
The DDM Database Has More Than What’s on the Surface
• The DDM DB lets you review probable causes and possible
solutions for reported DDM events
 And it lets you assign them to an administrator for resolution
 You can open a link to the appropriate database from which
you can resolve a reported event
75
Explaining the Issue
• In some cases, the event will give details on the problem and
provide a link to help you get to the place where the problem
occurred
 One document might represent several occurrences of the
same event
76
Resolving Problems
• In some cases a Choose Solution button is available
 Use it to help you automatically implement a particular solution
to resolve an event
77
Here Is a Security Case Result From a Best Practice Probe
• This security probe was
configured to check the ACLs
of two databases
 NAMES and EVENTS4
• On a specific server
 ADMIN/Servers/DomLab
• We wanted to make sure
certain problem individuals
did not have more privileges
than they were supposed to
78
More Granularity
• We wanted to make sure that
 Eric has no greater than Designer access
 Kyle has no greater than Editor access
 Kenny has no greater than Author access
• We want to run the security check every day at 18:00
79
DDM Detects a Violation
• The DDM document produced by the probe indicates when the
probe detected the violation
 The doclink takes me to the probe document
 The probable cause is given as
 The ACL was modified in violation of company security
guidelines
80
Get the Details
• Clicking on Details takes me to the explanation of the issue
 It is exactly what was asked for in the security probe document
81
Here’s a Nice Feature
• A possible solution is offered by DDM
 Modify the ACL
• The corrective action section presents a button
 Clicking on the button opens the ACL of the database

I can easily set Eric’s privileges back to the
corporate guideline
82
The Major Features Explored
• This demonstrated the power behind DDM
 We were effortlessly shown the issue based on the probe
we configured
 We were shown when it occurred, the probable cause, and
a possible solution
 And were given direct access to corrective action
83
Assigning DDM Messaging Events for Resolution
• Events reported back to the DDM application become part of the
workflow
 One of the best parts about DDM is that it lets you assign
someone to resolve the problem
84
Using the Assign Button
• You can assign the event to a team member and add comments
about the task using the “Assign” button
 Or you can simply assign the event to yourself
85
Changes Are Tracked
• All changes you make to the event are tracked in the Event
Change History for easy reference
 Finally there is an easy, built-in process for tracking problem
resolution in your environment
86
Closing the Event
• When the event has been resolved, or at least when you are sick of
looking at it, you can close the event
 Closing the event removes it from the Open Events View
87
ACLs on DDM
• Take care in setting the ACLs on the DDM DB
 Make sure the settings match the requirements of your team’s
workflow responsibilities
Action
Minimum access and required roles
Read event documents
Read access
Add comments to your own events
Author access
Add comments to another
administrator’s event
Editor access
Assign or reassign your own events
Author access and the [Assign Events] role
Assign or reassign events regardless Editor access and the [Assign Events] role
of owner
Change state of your own events
Author access and the [Change State] role
Change state of event regardless
of owner
Editor access and the [Change State] role
88
Demo
89
Common Actions Available in Release 8
• In R8, all Domino Domain Monitor (DDM) events now have a
Common Actions button
 Gives you access to the most commonly performed actions for
investigating events
 Plus it adds context-sensitive help when appropriate
90
Looking at a Server NOTES.INI
• The common action that lets you look at a server’s NOTES.INI is
very quick and valuable
91
Console Access on the Fly
• The remote console access gets you to the server console
 You can send commands or get a live console
92
What We’ll Cover …
• Discovering the basics of Domino Domain Monitoring
• Understanding how DDM fits into your environment
• Crafting a perfect DDM data collection hierarchy
• Looking at simple but important DDM events
• Developing proactive probes
• Mastering enhanced events and corrective actions
• Working with the new Release 8 modular documents
• Testing and debugging Domain Monitoring
• Wrap-up
93
Modular Documents Are New in Domino 8
• Modular documents are a welcome addition to DDM
 But to understand why, we must first go back in time to see how
similar functionality was accomplished in Release 7
• In R7 DDM, some events could have automated solutions
 These automated solutions were hard-coded into the Events
documents in the Monitoring Configuration application
EVENTS4.NSF
94
R8 DDM Lets You Re-Use Solutions
• Modular documents are new
reference documents for
Probable Cause, Possible
Solution, and Corrective
Actions
 Otherwise known as PC,
PS,
and CA
• When you create an Events
document, the PC, PS, and CA
statements that you choose to
include in the document are
referenced from modular
documents
95
Re-Use Modular Documents
• The big benefit is that you only need to define these statements
once, and can use them multiple times
 Domino 8 comes with over 1,000 modular documents
96
Add Your Own
• If you have a solution to some event specific to your
own environment, you can create a custom modular document
 You can include:
 Formula language code
 LotusScript code
 Or run an agent in a database
97
Actions Are Matched with Events
• Match up the modular document with the event in the Monitoring
Configuration application
98
Changes Might Take Time
• Events and modular documents are cached
 You might find that updates to events and modular documents
are not reflected in DDM.NSF right away
 If you’re impatient, restart the Event task to ensure
updates to Events and modular documents are reflected
in DDM.NSF immediately
99
Updated Events Documents
• You will notice that there is a new button in the upper right-hand
corner of the events documents
 It toggles from Display Stock Entries to Hide Stock Entries
 This represents a very important modification to the
process of applying action through DDM
100
Multiple Entry Types Mean Greater Flexibility
• Event documents have three categories of Probable
Cause/Possible Solution and Corrective Actions
• The first tab contains the Lotus Entries
 These are, of course, provided by Lotus
 Do not modify or delete these entries
 If you want to disassociate the entry with the event, simply
edit the document and uncheck the Enabled box
101
Your Custom Entries
• Add your own references to PC, PS, and CA on the Custom
Entries tab
 The interface looks similar to the Lotus Entries tab, but you can
only add up to two Probable Cause/Possible Solution actions
 And there is no Enabled setting as with the Lotus entries
• These settings will be retained as you move forward upgrading
from Domino 8.x
102
Stock Entries for Release 7
• Stock entries are pre-Domino 8 entries for PC, PS, and CA
 These are available for backward compatibility
Lotus entries have replaced the stock entries for Domino 8
and all subsequent releases
• If you are creating a custom PC, PS, and CA with the
Domino 8 Monitoring Configuration template (EVENTS4.NTF) on
Domino 7 servers, use the stock entries
 If customizing on Domino 8 servers, use custom entries

103
Merging Old into New
• If you have made modifications to Events documents on Domino
7, those modifications can be overwritten when you upgrade to
Domino 8
 Do you want to merge your old Domino 7 stock entries into
Domino 8 stock entries?
 Then follow the nine-step process detailed in the
Administrator Help application
 Do you want to merge your old Domino 7 stock entries into
Domino 8 modular documents?
 Then follow the nine-step process detailed in the
Administrator Help application
104
Where to Find the Help You Need
• You’ll find this information in the Monitoring section of
Administrator Help
 Domino Domain Monitoring subheading
 Event-related documents
105
New Role in DDM ACL
• Many events have corrective actions associated
with them
 Only users with the Execute CA role in the DDM ACL are able
to access the command actions and the corrective action
text and links
 This ensures that only qualified team members will be able
to make the changes
106
What We’ll Cover …
• Discovering the basics of Domino Domain Monitoring
• Understanding how DDM fits into your environment
• Crafting a perfect DDM data collection hierarchy
• Looking at simple but important DDM events
• Developing proactive probes
• Mastering enhanced events and corrective actions
• Working with the new Release 8 modular documents
• Testing and debugging Domain Monitoring
• Wrap-up
107
Why Debug DDM?
• You might want to test DDM to determine if you’ve got it set up
correctly
 But who wants to wait around for an error condition to
actually happen?
 In fact, no one wants to wait
 We’re engineers! We’re impatient!
• That’s why Lotus built testing and debugging into DDM
108
Get the Schedule for Your Probes
• Issue this command from the console to get a schedule for
your probes
 Show schedule –ddm
• You can shorten the command to this one
 Sh sch -ddm
109
Getting a Probe
• You can make the Event task tell you which probes have been
enabled using the following console command
 Tell Event dumpprobes
• The Event task will show which probes are enabled
 If a probe is the type that can be scheduled, it will also show
when the probe will run next
110
Running Probes On-Demand
• You can also run your probes whenever you’d like using
the following console command
 Tell Event Run Probe ####
Substitute your probe’s ID number for the #### above
 Use what was displayed with dumpprobes command

111
Getting the Probe IDs for Duplicate Probe Types
• If you have duplicate probe types, getting to the ID is a little
more work
• You must go to the probe document and look at the Note ID on the
document properties page
 Use a calculator to convert that hex number to a decimal
112
Getting a Probe Dump
• Want to know more about your events?
• Get the event task to just dump it all with this console command
 Tell event dump
113
A Helpful NOTES.INI Parameter
• Add the following parameter to the server’s NOTES.INI
 DEBUG_EVENT=1
• This debug variable can be used for all processes under the Event
task
 This includes probes, schedules, and other information that
you can use to test your DDM configuration
• This is a global debug variable for the Event task and includes
information about probes
114
You’ll See Results When You Restart the Server
• The parameter will cause the Event task to display what’s scheduled
right on the console without entering a console command
115
What We’ll Cover …
• Discovering the basics of Domino Domain Monitoring
• Understanding how DDM fits into your environment
• Crafting a perfect DDM data collection hierarchy
• Looking at simple but important DDM events
• Developing proactive probes
• Mastering enhanced events and corrective actions
• Working with the new Release 8 modular documents
• Testing and debugging Domain Monitoring
• Wrap-up
116
Resources
• Lotus Education on Demand: Domino Domain
Monitoring (DDM)
 www-1.ibm.com/support/docview.wss?uid=swg27007060
• Notes/Domino Best Practices: Domino Domain Monitoring (DDM)
 www-1.ibm.com/support/docview.wss?uid=swg27009312
• Redpaper: Lotus Domino Domain Monitoring
 www.redbooks.ibm.com/abstracts/REDP4089.html?Open
• What is Domino Domain Monitoring (DDM)?
 www-01.ibm.com/support/docview.wss?uid=swg21215726
117
7 Key Points to Take Home
• Make sure that EVENTS4.NSF and DDM.NSF each have their own
•
•
•
•
•
•
special replica IDs in your domain
Start with a flat data collection hierarchy and make it more
complex only if your requirements call for it
Addressing DDM documents with the highest severity level first
ensures that you’ll tackle the most critical issues daily
Adjust the DDM filter to keep the noise at a minimum
Use probes proactively to keep your domain running smoothly
Assigning issues to team members for follow up lets them grow
and lets you focus on the more difficult issues
Write your own corrective actions for issues that are unique to
your enterprise
118
Your Turn!
How to contact me:
Andy Pedisich
[email protected]
http://www.technotics.com
http://www.andypedisich.com
119