What is ITCAM for Response Time

ITCAM for Web Resources (WR)
ITCAM for Response Time (RT)
Brent Dorenkamp
IT Specialist
[email protected]
TIVOLI BOARD OF ADVISORS
© 2007 IBM Corporation
Tivoli Software
ITCAM – Comprehensive Application Management Solution
Monitor
End User Response
Time Monitoring
Monitor
Transactions
Root-Cause Problem
Analysis & Resolution
Application Tracking
&
Topology Analysis
L3 SME and
Diagnostics
Tivoli
2
Application Resource
Monitoring
LOB / Operations Buyer
Problem Identification
Health Monitoring
Basic SLA reporting
Application Owner / Operations Buyer
Problem Isolation
Response time monitoring across
multiple components
SME Buyer
Root Problem Determination
Response time within application server
Take action to resolve problem
Composite Application Manager
© 2007 IBM Corporation
Tivoli Software
ITCAM for Web Resources Introduction
IT Operations via TEP
Monitor OS
Monitor
ITM
Monitor App Resources & J2EE
Applications
ITCAM for Web Resources
(J2EE App & Web Servers)
SME via Web UI
Deep Dive & Fix
ITCAM for Websphere/J2EE
 ITCAM for WR provides a more affordable and less complex J2EE application
monitoring solution for IT Operations who want:
 To quickly identify, isolate problems and route to the appropriate SME.
 Resource monitoring to be proactive in problem identification and eliminate the need
for cross-SME teams to resolve issues.
 To centralize monitoring and reduce reliance on SMEs.
 ITCAM for WR provides resource and application monitoring with the Tivoli
Enterprise Portal (TEP) user interface.
 ITCAM for WR has easy out of the box installation - installs and deploys in a few
hours!
 ITCAM for WR has improved operations dashboards with needed data to quickly
identify the source of the problem.
3
© 2007 IBM Corporation
Tivoli Software
ITCAM for Web Resources – New Feature Overview
 Simplified Installation
–Installation launch pad for all supported resources and silent install of infrastructure (TEP/DB2)
 Auto Learning of Thresholds
–Used to benchmark performance for a new application (or a changed environment) to understand where
thresholds should be established
 Application dashboard
–Summary view of applications to give operators and administrators, at a glance, the ability to see where the
problem is located (client tier, application tier or backend tier). These out of the box views are per
application server; however, logical views can be customized to include two or more application servers, and
can be customized to include a view of the entire application stack, to include databases, web servers and
OS resources.
 Java Standard Edition (J2SE) Workspaces
–monitors stand alone java applications
 Best practices documentation for logical views (Delivered via OPAL)
–Logical views allow disparate resources to be grouped together
–Resources can be viewed by geographic, functional, or relationship-based groups.
–Scalable background images/bitmaps
 Situation Scripting:
– Predefined situations that include preset thresholds, sampling intervals, boolean logic, and expert advice
(Delivered via OPAL)
–Creation of new situations available for customers to correlate any related data that will alert operations to
a potential problem. For example:
Correlate average response time alerts with an increase in CPU usage, to alert operations to a
possible memory leak.
Correlate growth in HTTP sessions with an increase in average response time to an increase in JVM
memory usage. Alert operations to check timeout sessions for HTTP sessions
4
© 2007 IBM Corporation
Tivoli Software
Supported Platforms & Workspaces
Application Servers
– WebSphere
– Tomcat
– JBOSS
– Weblogic
– Oracle
– SAP Netweaver
– Websphere ESB
– WebSphere Portal Server
– WebSphere Process Server
– Lotus Workplace Server
Web Servers
– Apache
– Internet Information Services (IIS)
– iPlanet
Application Server TEP Workspaces
–
–
–
–
–
–
–
–
Application Health Summary
–Cache Analysis
Client Tier Analysis
–Workload
Management
Application Tier Analysis
–Web Services
Backend Tier Analysis
(WebSphere only)
Application Health History
–J2SE
Application Configuration
Server Health Summary
Request Analysis
 Datasources
 JMS Summary
 Web Applications
 EJB Containers
– Pool Analysis
 DB2 Connection Pools
 J2C Connection Pools
 Thread Pools
– Garbage Collection and Allocation Failure Analysis
Web Server TEP Workspaces
– Web Server Summary
– Active Server Pages (ASP)
– Web Sites
5
© 2007 IBM Corporation
Tivoli Software
ITCAM for Web Resources Workspaces
6
© 2007 IBM Corporation
Tivoli Software
Server Health Summary
Enhanced workspace to
include paging rate, GC
rate, pool size, thread pool
usage
Metric summary per
application server. Create
logical views to
compare/contrast clusters.
Summary statistics
indicate overall health
of the application
server.
7
© 2007 IBM Corporation
Tivoli Software
Application Health Summary Dashboard
NEW workspace for views
by application
Mouse
over to
view
events
At a glance, operators can see what is going
wrong with each application. Quick indication if
the problem is in the application tier
(EJB/JMS/ORB), client tier (servlet) or backend
tier (JBDC).
8
Thresholds fire situations to
indicated good, fair, poor status.
Sampling rates can be
configured per application.
© 2007 IBM Corporation
Tivoli Software
Drill Down to the Client Tier (Servlet)
NEW workspace for views
by client tier per
application
Top 5 delays
and
completion
rates per
application
JMS
summary
by app.
server
HTTP
Session and
Web
Container per
app. server
9
© 2007 IBM Corporation
Tivoli Software
Drill Down to the Application Tier (EJB, ORB, JMS)
NEW
workspace for
views by
application tier
per application
Top 5 delays
and
completion
rates per
application
JMS
summary
by app.
server
ORB
Container
and
Transactions
by app. sever
10
© 2007 IBM Corporation
Tivoli Software
Drill Down to the Backend Tier (JDBC)
NEW workspace for views
by backend tier per
application
Worst delays
and most
used JDBC
and JMS
Resources by
Application
JMS
summary
by app.
server
JDBC and
JCA Pool
Usage by
Application
Server
11
© 2007 IBM Corporation
Tivoli Software
Baseline Threshholds
NEW workspace for setting
thresholds based on
historical analysis
Use out of the
box thresholds
for good, fair and
poor response
times or set
thresholds based
on the
application
baseline, i.e.,
what is normal
for that
application.
12
© 2007 IBM Corporation
Tivoli Software
Request Analysis
 Operations teams track the average response time for requests processed on the app
server, and can quickly detect issues when delays increase over time or spike. This
workspace shows the worst average response times broken down by Java Component
Response Times: Application,
JCA, JMS, JNDI, SQL
connection/query/update
Tabular data set with drill-down
response time values for JCA,
JMS, JDBC
13
© 2007 IBM Corporation
Tivoli Software
Pool Analysis
 J2EE resource pools are critical in terms of providing availability to commonly accessed
services such as database access and other container pool types. This workspace
enhances PMI data with configuration data to provide a comprehensive overview of
requests flowing through WebSphere “funnel”.
Comparison of
recent active
threads in ORB pool
Visual correlation of
CPU utilization vs.
pool consumption
Web container pool
statistics showing # times
at maximum capacity
DB2 and J2C
connection pools at
full saturation
14
© 2007 IBM Corporation
Tivoli Software
Garbage Collection Analysis
 Garbage Collection (GC) metrics such as frequency and time to complete can have a large
effect on application server performance (during this time no other application processing
can take place). This workspace shows a detailed breakdown of GC behavior and provides
an complete analysis of GC performance metrics.
`
Collection Rate # GC’s per Minute
15
Recent JVM Heap
Usage Trend
Detailed Analysis
of Recent GC
Performance
% Time Spent in
GC Cycle
© 2007 IBM Corporation
Tivoli Software
Cache Analysis
 Highlights in-memory cache sizes, a shows cache templates with highest miss
rates
Detailed tabular views
of cache metrics for
analysis and tuning
Miss rates correlated
with recent cache size
trends
16
IBM Confidential
© 2007 IBM Corporation
Tivoli Software
Single Console Allows Faster Incident Management
DB2 Activity and
System Memory
charts
WAS Response
Time, Throughput,
and JVM and
System CPU
Usage charts
WAS Request
Breakdown,
Database Wait and
Processing Times,
and Heap Usage
charts
–In the TEP you can customize workspaces to view OS, Database, Messaging, and
Application server resources to quickly pinpoint common resource problems.
17
© 2007 IBM Corporation
Tivoli Software
Correlation of events/automate take action using scripts
Correlate
situations, and
assign expert
advice and
actions on a
situation.
18
 Correlation of situations – how
to & example situations – will
be available on OPAL in May.
 Examples of situation
correlation include:
–
Correlate average response time
alerts with and increase in CPU
usage, to alert operators to
investigate a a memory leak.
–
Correlate alerts for increased
activity on the server to ensure
JVM memory usage is not
impacted. Correlate growth in
HTTP sessions with an increase
in average response time to an
increase in JVM memory usage.
Alert to check timeout sessions
for HTTP sessions set to
appropriate level to help reduce
impact on memory.
© 2007 IBM Corporation
ITCAM for Response Time v6.2
TIVOLI BOARD OF ADVISORS
© 2007 IBM Corporation
Tivoli Software
Current State Of IT Application Management
IT organizations are under tremendous pressure to deliver results.
IT Challenge
Business Impact
Majority of IT problems are still being
identified by end user complaints
Lost business when problems go
undetected; impacts customer
satisfaction
Composite application performance
problems can take many hours to isolate
Support budget spent isolating problems
instead of fixing them
Once problem is identified a large team of
specialists is required to resolve the
problem
Large amount of money from both
development and support spent solving
problems
Most application problems go back to
development group, which cannot recreate
the problem
Development budget spent fixing problems
instead of building new applications
… What we need is comprehensive application management
20
© 2007 IBM Corporation
Tivoli Software
End User Monitoring is the Best Way to Determine what IT
Problems need Attention
What is the Customer
Experience?...
Web server thread pools
are full - is it impacting
the customer response
time?
… Do you know what your customers are experiencing? Or are they calling your help
desk to tell you that you have an availability or response time problem. Effective
application management requires monitoring both application resources and response
times to find the real problems that are impacting your business
21
© 2007 IBM Corporation
Tivoli Software
Workflow for Managing Composite Applications Problems
Sense
Detect that a
threshold has been
breached and that a
problem occurred, or
is about to happen
Isolate
Diagnose
Repair
Pinpoint the problem
to a specific part of
the environment and
hand-off to the
appropriate specialist
Drill down into
the details and
get to the root
cause of the
problem
Fix the faulty
component,
validate the fix
and roll back into
production
ITCAM for RT: End-User Response Monitoring
22
© 2007 IBM Corporation
Tivoli Software
ITCAM for Response Time
What is it? and Why I Need it!

What is ITCAM for Response Time (RT)
–

23
Tivoli Enterprise Portal (TEP) based solution that provides IT Operations with both real time and robotic
monitoring of the end user response time experience to help quickly identify SLA breaches and to help
proactively prevent future violations.
Highlights
–
Provides comprehensive response time coverage for both Web and Windows applications using a variety
of robotic and real-time analysis
–
Integrates with the Tivoli Enterprise Portal (TEP), a portal-based customizable user interface that can bring
together response time data from ITCAM for RT and IBM Tivoli Monitor (ITM) resource data in an easy-touse interface to quickly identify what resource bottlenecks are impacting the end user experience
–
Integrates with other IBM Service Management (ISM) and Tivoli products to provide complete automated
end-to-end management of your applications
© 2007 IBM Corporation
Tivoli Software
ITCAM for Response Time
The Best of Both Types of Response Time Monitoring in One Integrated UI
Synthetic
Transactions
–
 Synthetic playback of all robotic scripts
 Via Rational Robot, RPT, Mercury LoadRunner, Command Line
Interface
Real End User
Transactions
–
24
Robotic Response Time Monitoring
Web Response Time Monitoring
 Monitors real end user web transactions (HTTP/S)
–
Client Response Time Monitoring
 Monitor real end user client Windows application transactions
 i.e. Lotus Notes, Microsoft Outlook, SAP, 3270, etc
© 2007 IBM Corporation
Tivoli Software
Availability Dashboard
Shows overall
enterprise status
for all Clients and
Applications
Quickly identify the Top 5
worst performing Clients and
Applications, and drill down to
identify the problem…
25
© 2007 IBM Corporation
Tivoli Software
Application Service Level Metrics
View availability over
time to quickly see when
a problem started
Compare with
application load to see if
a spike in volume could
have contributed to the
problem
Is your Application’s
Uptime and Downtime
improving or getting
worse?
26
© 2007 IBM Corporation
Tivoli Software
Applications
Quickly identify the Top 5
worst performing
applications
Drill into a specific
Application for more
details…
27
© 2007 IBM Corporation
Tivoli Software
Application Summary
How has the application
been performing over
time?
28
© 2007 IBM Corporation
Tivoli Software
Transaction Summary
View the application’s
business transactions
and how they have been
performing over time
Drill into the worst
performing transaction
for more details…
29
© 2007 IBM Corporation
Tivoli Software
ITCAM for Response Time v6.2 Release Highlights
Unified Infrastructure and User Interface
• Single infrastructure built on ITM
• Single, consolidated user interface built on Tivoli Enterprise Portal (TEP)
Improved Consumability to Enhance Ease of Use and Time to Value
• Fully customizable dashboard, reports and workspaces
• Simplified configuration, including default Situations
• Simplified installation
• Intelligent alerting based on ITM powerful situations editor
• Configurable data aggregation as low as every 5 minutes
Enhanced Response Time Monitoring
• Report & alert on any real time or historical response time metric
• Identify response time bottlenecks by Client, Network or Server times
• Identify, report & alert on individual clients or locations
• Discover, report & alert backend server resources
• Improved robotic monitoring w/ Rational Performance Tester (RPT)
• Immediate playback of robotic scripts
• Custom ARM application response time monitoring
• Improved CLI functions to edit configuration
Deliver IBM Service Management Foundation Elements
• CCMDB discovery & real time status of Business Processes & Business
Activities
30
You can install ITCAM
for RT and have it
show real web
response time data
within minutes!
(Really!)
ITCAM for RT is so
easy to run on top of
ITM that even a
(you guessed it)
A Neanderthal could
do it
ITCAM for
Response Time
© 2007 IBM Corporation
Tivoli Software
Questions?
31
© 2007 IBM Corporation