SlideShare ist ein Scribd-Unternehmen logo
1 von 5
Exadata Cell metrics
Exadata CELLSRV periodically records important runtime properties,
called metrics, for cell components such as CPUs, cell disks, grid
disks, flash cache, and IORM statistics. These metrics are recorded
in memory. Based on its own metric collection schedule, the
Management Server (MS) gets the set of metric data accumulated by
CELLSRV.
Management Server (MS) provides Exadata cell management and
configuration functions. MS is responsible for sending alerts and
collects some statistics in addition to those collected by CELLSRV.
Each cell is individually managed with Exadata cell command-line
interface (CellCLI).
Locate the MS process
-------------------------------$ ps -ef | grep ms.err
1000
3940 3723 0 01:42 pts/0
00:00:00 grep ms.err
root
24541 24540
0 Sep28 ?
00:01:32
/usr/java/jdk1.5.0_15/bin/java
-Xms256m
-Xmx512m
Djava.library.path=/opt/oracle/
Check the Alert History
-----------------------MS triggers an alert when it discovers a:
Cell hardware issue
Cell software or configuration issue
CELLSRV internal error
Metric that has exceeded a threshold defined in the cell
CellCLI> list alerthistory
1
2013-09-26T22:51:15-04:00
critical
"ORA-00700: soft internal error, arguments: [main_6a], [3], [Invalid
IP addresses in cellinit.ora file], [], [], [], [], [], [], [], [],
[]"
2_1
2013-09-26T22:52:07-04:00
warning
"Hugepage allocation failure in service cellsrv. Number of
Hugepages allocated is 0, failed to allocate 110"
3
2013-09-26T22:54:08-04:00
critical
"ORA-00700: soft internal error, arguments: [main_6a], [3], [Invalid
IP addresses in cellinit.ora file], [], [], [], [], [], [], [], [],
[]"
4
2013-09-28T13:05:21-04:00
critical
"RS-7445 [Serv RS_BACKUP is absent] [It will be restarted] [] [] []
[] [] [] [] [] [] []"
5
2013-09-28T22:05:38-04:00
critical
"RS-7445 [Serv CELLSRV is absent] [It will be restarted] [] [] [] []
[] [] [] [] [] []"
Create and check for disk I/O errors
---------------------------ellCLI> create threshold CD_IO_ERRS_MIN comparison='>', warning=0, > occurrences=1, observation=1
Threshold CD_IO_ERRS_MIN successfully created
CellCLI> list threshold CD_IO_ERRS_MIN detail
name:
CD_IO_ERRS_MIN
comparison:
>
observation:
1
occurrences:
1
warning:
0.0
ellCLI> list alerthistory where severity='warning';
2_1
2013-09-26T23:02:12-04:00
warning
"Hugepage allocation failure in service cellsrv. Number of
Hugepages allocated is 0, failed to allocate 110"
CellCLI> list alerthistory where severity='critical';
1
2013-09-26T23:01:18-04:00
critical
"ORA-00700: soft internal error, arguments: [main_6a], [3],
IP addresses in cellinit.ora file], [], [], [], [], [], [],
[]"
3
2013-09-26T23:04:11-04:00
critical
"ORA-00700: soft internal error, arguments: [main_6a], [3],
IP addresses in cellinit.ora file], [], [], [], [], [], [],
[]"
4
2013-10-01T06:42:39-04:00
critical
"RS-7445 [Serv CELLSRV is absent] [It will be restarted] []
[] [] [] [] [] []"

[Invalid
[], [],
[Invalid
[], [],
[] [] []

CellCLI> list alerthistory where severity='clear';
CellCLI> list alerthistory where severity='info';
MetricType:
- cumulative: Cumulative statistics since the metric was created
- instantaneous: Value at the time that the metric is collected
- rate: Rates computed by averaging statistics over observation
periods
- transition: Collected at the time when the value of the metrics
has changed, and typically captures important transitions in
hardware status
CellCLI> list metriccurrent attributes
name,metrictype,metricobjectname,metricvalue,collectionTime where
metrictype='Rate'

Monitoring Exadata with Active Requests
---------------------------------------CellCLI> LIST ACTIVEREQUEST WHERE IoType = 'predicate pushing'
DETAIL
ioType identifies the type of active request file initialization
Possible values are read, write, predicate pushing, filtered backup
read, predicate push read
Check retention period for metric and alert history
------------------------------------------------------CellCLI> list cell attributes metricHistoryDays
7
CellCLI> alter cell metrichistorydays=5
Cell qr03cel02 successfully altered
CellCLI> list cell attributes metrichistorydays
5
CellCLI> list cell attributes name,interconnectCount
qr03cel02
2
configure the cell to automatically send an email and/or SNMP
message to a designated set of Exadata administrator.
-----------------------------------------------------------------------------------------------------------------alter cell smtpServer='my_mail.example.com', smtpFromAddr='monowar.mukul@example.com', smtpFrom='monowar mukul', smtpToAddr='jane.smith@example.com', notificationPolicy='critical,warning,clear', notificationMethod='mail'
Watching for Undelivered Alerts
--------------------------------It is important to periodically check the storage servers just to
make sure that raised alerts have actually been delivered (via email
and/or to Grid or Cloud Control).
CellCLI>LIST
examinedBy=''

ALERTHISTORY

where

notificationState

dcli
-g
cell_group
cellcli
-e
"LIST
notificationState != 1 and examinedBy='' "

!=

ALERTHISTORY

1
2013-09-26T23:01:18-04:00
critical
"ORA-00700: soft internal error, arguments: [main_6a], [3],
IP addresses in cellinit.ora file], [], [], [], [], [], [],
[]"
2_1
2013-09-26T23:02:12-04:00
warning
"Hugepage allocation failure in service cellsrv. Number of
Hugepages allocated is 0, failed to allocate 110"
3
2013-09-26T23:04:11-04:00
critical
"ORA-00700: soft internal error, arguments: [main_6a], [3],
IP addresses in cellinit.ora file], [], [], [], [], [], [],
[]"
4
2013-10-01T06:42:39-04:00
critical
"RS-7445 [Serv CELLSRV is absent] [It will be restarted] []
[] [] [] [] [] []"
Drop Alert History
--------------------CellCLI> drop alerthistory all
Alert 1 successfully dropped
Alert 2_1 successfully dropped
Alert 3 successfully dropped
Checking Threshold
------------------CellCLI> list threshold
cl_fsut./

1

and
where

[Invalid
[], [],

[Invalid
[], [],
[] [] []
cl_fsut./u01
CellCLI> create threshold cl_tst."/u01" comparison='>', warning=80
Threshold cl_fsut."/u01" successfully created
CellCLI> list threshold detail
name:
comparison:
warning:
name:
comparison:
warning:

cl_fsut./
>
70.0
cl_fsut./u01
>
80.0

CellCLI> alter threshold cl_fsut."/" comparison='>', warning=50
Threshold cl_fsut."/" successfully altered
CellCLI> list threshold detail
name:
comparison:
warning:
name:
comparison:
warning:

cl_fsut./
>
50.0
cl_fsut./u01
>
80.0

Execute the following command inside the cell operating system. It
creates a 512-MB file on the root file system which will increase
the utilization metric. After the metric crosses the threshold , an
alert will be generated.
$ dd if=/dev/zero of=/tmp/file.out bs=1024 count=500000
[celladmin@qr03cel02 ~]$ dd if=/dev/zero of=/tmp/file.out bs=1024
count=500000
500000+0 records in
500000+0 records out
512000000 bytes (512 MB) copied, 4.25551 seconds, 120 MB/s
[celladmin@qr03cel02 ~]$ cellcli
CellCLI: Release 11.2.3.1.0 - Production on Mon Sep 30 01:36:45 EDT
2013
Copyright (c) 2007, 2011, Oracle.
Cell Efficiency Ratio: 26M

All rights reserved.

CellCLI> list alerthistory
1_1
2013-09-30T01:32:46-04:00
warning
"The warning threshold for the following metric has been crossed.
Metric Name
: CL_FSUT
Metric Description : Percentage of
total space on this file system that is currently used Object Name
: / Current Value
: 56.0 % Threshold Value
: 50.0 % "
CellCLI> alter alerthistory 1_1 examinedby='investigator'
Alert 1_1 successfully altered
CellCLI> list alerthistory detail
name:
1_1
alertMessage:
"The warning threshold for the
following metric has been crossed. Metric Name
: CL_FSUT
Metric Description : Percentage of total space on this file system
that is currently used Object Name
: / Current Value
:
56.0 % Threshold Value
: 50.0 % "
alertSequenceID:
1
alertShortName:
CL_FSUT
alertType:
Stateful
beginTime:
2013-09-30T01:32:46-04:00
endTime:
examinedBy:
investigator
metricObjectName:
"/"
metricValue:
56.0
notificationState:
0
sequenceBeginTime:
2013-09-30T01:32:46-04:00
severity:
warning
alertAction:
"Examine the metric value that is
violating the specified threshold, and take appropriate actions if
needed."
The value of the name attribute is a composite of abbreviations.
• CL_ (cell)
• CD_ (cell disk)
• GD_ (grid disk)
• FC_ (flash cache)
• DB_ (database)
• CG_ (consumer group)
• CT_ (category)
• N_ (interconnect network)
-- Monitoring IORM with cellcli command.
I/O-related metric:
• IO_RQ (number of requests)
• IO_BY (number of MB)
• IO_TM (I/O latency)
• IO_WT (I/O wait time)
_R for read
_W for write.
_SM small I/O
_LG large I/O
_SEC signify per second
_RQ to signify per request
• CD_IO_WT_R_SM is the cell disk (CD_) I/O wait time (IO_WT) to read
(_R) small blocks (_SM).
• GD_IO_RQ_W_LG_SEC is the grid disk (GD_) number of requests
(IO_RQ) to write (_W) of large block (_LG) I/O per second (_SEC) on
a grid disk.

Weitere ähnliche Inhalte

Was ist angesagt?

Trouble shooting apachecloudstack
Trouble shooting apachecloudstackTrouble shooting apachecloudstack
Trouble shooting apachecloudstack
Sailaja Sunil
 
Group3 sap nw3 7.0 install
Group3  sap nw3 7.0 installGroup3  sap nw3 7.0 install
Group3 sap nw3 7.0 install
dkeerthan
 
Upgrade 10204-to-10205 on-2-node_rac_linux_x86_64_detail-steps_v0.1
Upgrade 10204-to-10205 on-2-node_rac_linux_x86_64_detail-steps_v0.1Upgrade 10204-to-10205 on-2-node_rac_linux_x86_64_detail-steps_v0.1
Upgrade 10204-to-10205 on-2-node_rac_linux_x86_64_detail-steps_v0.1
Raheel Syed
 
Kernel在多核机器上的负载均衡机制
Kernel在多核机器上的负载均衡机制Kernel在多核机器上的负载均衡机制
Kernel在多核机器上的负载均衡机制
Hao(Robin) Dong
 
Understanding Query Optimization with ‘regular’ and ‘Exadata’ Oracle
Understanding Query Optimization with ‘regular’ and ‘Exadata’ OracleUnderstanding Query Optimization with ‘regular’ and ‘Exadata’ Oracle
Understanding Query Optimization with ‘regular’ and ‘Exadata’ Oracle
Guatemala User Group
 

Was ist angesagt? (17)

Wp intelli cache_reduction_iops_xd5.6_fp1_xs6.1
Wp intelli cache_reduction_iops_xd5.6_fp1_xs6.1Wp intelli cache_reduction_iops_xd5.6_fp1_xs6.1
Wp intelli cache_reduction_iops_xd5.6_fp1_xs6.1
 
Basic MySQL Troubleshooting for Oracle DBAs
Basic MySQL Troubleshooting for Oracle DBAsBasic MySQL Troubleshooting for Oracle DBAs
Basic MySQL Troubleshooting for Oracle DBAs
 
Demystifying cost based optimization
Demystifying cost based optimizationDemystifying cost based optimization
Demystifying cost based optimization
 
pstack, truss etc to understand deeper issues in Oracle database
pstack, truss etc to understand deeper issues in Oracle databasepstack, truss etc to understand deeper issues in Oracle database
pstack, truss etc to understand deeper issues in Oracle database
 
Riyaj: why optimizer_hates_my_sql_2010
Riyaj: why optimizer_hates_my_sql_2010Riyaj: why optimizer_hates_my_sql_2010
Riyaj: why optimizer_hates_my_sql_2010
 
Dataguard fsfo-implementation
Dataguard fsfo-implementationDataguard fsfo-implementation
Dataguard fsfo-implementation
 
8i standby
8i standby8i standby
8i standby
 
Trouble shooting apachecloudstack
Trouble shooting apachecloudstackTrouble shooting apachecloudstack
Trouble shooting apachecloudstack
 
Platform administration guide-nos_v3_5
Platform administration guide-nos_v3_5Platform administration guide-nos_v3_5
Platform administration guide-nos_v3_5
 
Command reference nos-v3_5
Command reference nos-v3_5Command reference nos-v3_5
Command reference nos-v3_5
 
Group3 sap nw3 7.0 install
Group3  sap nw3 7.0 installGroup3  sap nw3 7.0 install
Group3 sap nw3 7.0 install
 
Upgrade 10204-to-10205 on-2-node_rac_linux_x86_64_detail-steps_v0.1
Upgrade 10204-to-10205 on-2-node_rac_linux_x86_64_detail-steps_v0.1Upgrade 10204-to-10205 on-2-node_rac_linux_x86_64_detail-steps_v0.1
Upgrade 10204-to-10205 on-2-node_rac_linux_x86_64_detail-steps_v0.1
 
Kernel在多核机器上的负载均衡机制
Kernel在多核机器上的负载均衡机制Kernel在多核机器上的负载均衡机制
Kernel在多核机器上的负载均衡机制
 
Installation Errors PSCAD v4.2 PSCAD X4 (v4.3, v4.4, v4.5, v4.6)
Installation Errors PSCAD v4.2 PSCAD X4 (v4.3, v4.4, v4.5, v4.6)Installation Errors PSCAD v4.2 PSCAD X4 (v4.3, v4.4, v4.5, v4.6)
Installation Errors PSCAD v4.2 PSCAD X4 (v4.3, v4.4, v4.5, v4.6)
 
A MySQL Odyssey - A Blackhole Crossover
A MySQL Odyssey - A Blackhole CrossoverA MySQL Odyssey - A Blackhole Crossover
A MySQL Odyssey - A Blackhole Crossover
 
MySQL Replication: Demo Réplica en Español
MySQL Replication: Demo Réplica en EspañolMySQL Replication: Demo Réplica en Español
MySQL Replication: Demo Réplica en Español
 
Understanding Query Optimization with ‘regular’ and ‘Exadata’ Oracle
Understanding Query Optimization with ‘regular’ and ‘Exadata’ OracleUnderstanding Query Optimization with ‘regular’ and ‘Exadata’ Oracle
Understanding Query Optimization with ‘regular’ and ‘Exadata’ Oracle
 

Andere mochten auch

12c: Testing audit features for Data Pump (Export & Import) and RMAN jobs
12c: Testing audit features for Data Pump (Export & Import) and RMAN jobs12c: Testing audit features for Data Pump (Export & Import) and RMAN jobs
12c: Testing audit features for Data Pump (Export & Import) and RMAN jobs
Monowar Mukul
 
Oracle 12c far sync standby instance
Oracle 12c far sync standby instanceOracle 12c far sync standby instance
Oracle 12c far sync standby instance
Monowar Mukul
 
Oracle 12c RAC (Advanced installation - Flex ASM)
Oracle 12c RAC (Advanced installation - Flex ASM)Oracle 12c RAC (Advanced installation - Flex ASM)
Oracle 12c RAC (Advanced installation - Flex ASM)
Monowar Mukul
 
Advanced installation 12c rac
Advanced installation 12c racAdvanced installation 12c rac
Advanced installation 12c rac
Monowar Mukul
 
Edtc6341 63 esther_sauceda_practice_test5
Edtc6341 63 esther_sauceda_practice_test5Edtc6341 63 esther_sauceda_practice_test5
Edtc6341 63 esther_sauceda_practice_test5
esauceda12
 
Copyright Crash Course 1st revised ppt 6340.64 Sonia Aldape
Copyright Crash Course 1st revised ppt 6340.64 Sonia AldapeCopyright Crash Course 1st revised ppt 6340.64 Sonia Aldape
Copyright Crash Course 1st revised ppt 6340.64 Sonia Aldape
soniaaldape
 

Andere mochten auch (20)

Exadata I/O Resource Manager (Exadata IORM)
Exadata I/O Resource Manager (Exadata IORM)Exadata I/O Resource Manager (Exadata IORM)
Exadata I/O Resource Manager (Exadata IORM)
 
1247g3hg1238 2011
1247g3hg1238 20111247g3hg1238 2011
1247g3hg1238 2011
 
Exadata - BULK DATA LOAD Testing on Database Machine
Exadata - BULK DATA LOAD Testing on Database Machine Exadata - BULK DATA LOAD Testing on Database Machine
Exadata - BULK DATA LOAD Testing on Database Machine
 
12c: Testing audit features for Data Pump (Export & Import) and RMAN jobs
12c: Testing audit features for Data Pump (Export & Import) and RMAN jobs12c: Testing audit features for Data Pump (Export & Import) and RMAN jobs
12c: Testing audit features for Data Pump (Export & Import) and RMAN jobs
 
12c database migration from ASM storage to NON-ASM storage
12c database migration from ASM storage to NON-ASM storage12c database migration from ASM storage to NON-ASM storage
12c database migration from ASM storage to NON-ASM storage
 
SOA Fusion Middleware installation
SOA Fusion Middleware installationSOA Fusion Middleware installation
SOA Fusion Middleware installation
 
Oracle 12c far sync standby instance
Oracle 12c far sync standby instanceOracle 12c far sync standby instance
Oracle 12c far sync standby instance
 
Oracle 12c RAC (Advanced installation - Flex ASM)
Oracle 12c RAC (Advanced installation - Flex ASM)Oracle 12c RAC (Advanced installation - Flex ASM)
Oracle 12c RAC (Advanced installation - Flex ASM)
 
Moving 12c database from NON-ASM to ASM
Moving 12c database from NON-ASM to ASMMoving 12c database from NON-ASM to ASM
Moving 12c database from NON-ASM to ASM
 
Advanced installation 12c rac
Advanced installation 12c racAdvanced installation 12c rac
Advanced installation 12c rac
 
Edtc6341 63 esther_sauceda_practice_test5
Edtc6341 63 esther_sauceda_practice_test5Edtc6341 63 esther_sauceda_practice_test5
Edtc6341 63 esther_sauceda_practice_test5
 
Upgrade database using cloud_control Provisioning
Upgrade database using cloud_control Provisioning Upgrade database using cloud_control Provisioning
Upgrade database using cloud_control Provisioning
 
Copyright Crash Course 1st revised ppt 6340.64 Sonia Aldape
Copyright Crash Course 1st revised ppt 6340.64 Sonia AldapeCopyright Crash Course 1st revised ppt 6340.64 Sonia Aldape
Copyright Crash Course 1st revised ppt 6340.64 Sonia Aldape
 
TimesTen in memory database Creation
TimesTen in memory database Creation TimesTen in memory database Creation
TimesTen in memory database Creation
 
12c Flex ASM: Moving to Flex ASM
12c Flex ASM: Moving to Flex ASM12c Flex ASM: Moving to Flex ASM
12c Flex ASM: Moving to Flex ASM
 
Migration Database from SQL SERVER 2012 to Oracle12c with Pluggable Database
Migration Database from SQL SERVER 2012 to Oracle12c with Pluggable Database  Migration Database from SQL SERVER 2012 to Oracle12c with Pluggable Database
Migration Database from SQL SERVER 2012 to Oracle12c with Pluggable Database
 
4. java intro class
4. java intro class4. java intro class
4. java intro class
 
Oracle AWR baseline - Basic
Oracle AWR baseline - BasicOracle AWR baseline - Basic
Oracle AWR baseline - Basic
 
GMO OILBOOSTER Proposal (Eng)
GMO OILBOOSTER Proposal (Eng)GMO OILBOOSTER Proposal (Eng)
GMO OILBOOSTER Proposal (Eng)
 
SMS notification setup using EM12c
SMS notification setup using EM12cSMS notification setup using EM12c
SMS notification setup using EM12c
 

Ähnlich wie Exadata Cell metrics

Thomas+Niewel+ +Oracletuning
Thomas+Niewel+ +OracletuningThomas+Niewel+ +Oracletuning
Thomas+Niewel+ +Oracletuning
afa reg
 
你所不知道的Oracle后台进程Smon功能
你所不知道的Oracle后台进程Smon功能你所不知道的Oracle后台进程Smon功能
你所不知道的Oracle后台进程Smon功能
maclean liu
 
Oracle Open World Thursday 230 ashmasters
Oracle Open World Thursday 230 ashmastersOracle Open World Thursday 230 ashmasters
Oracle Open World Thursday 230 ashmasters
Kyle Hailey
 
BRKRST-3066 - Troubleshooting Nexus 7000 (2013 Melbourne) - 2 Hours.pdf
BRKRST-3066 - Troubleshooting Nexus 7000 (2013 Melbourne) - 2 Hours.pdfBRKRST-3066 - Troubleshooting Nexus 7000 (2013 Melbourne) - 2 Hours.pdf
BRKRST-3066 - Troubleshooting Nexus 7000 (2013 Melbourne) - 2 Hours.pdf
aaajjj4
 

Ähnlich wie Exadata Cell metrics (20)

Troubleshooting Complex Performance issues - Oracle SEG$ contention
Troubleshooting Complex Performance issues - Oracle SEG$ contentionTroubleshooting Complex Performance issues - Oracle SEG$ contention
Troubleshooting Complex Performance issues - Oracle SEG$ contention
 
Mod03 linking and accelerating
Mod03 linking and acceleratingMod03 linking and accelerating
Mod03 linking and accelerating
 
Thomas+Niewel+ +Oracletuning
Thomas+Niewel+ +OracletuningThomas+Niewel+ +Oracletuning
Thomas+Niewel+ +Oracletuning
 
Rmoug ashmaster
Rmoug ashmasterRmoug ashmaster
Rmoug ashmaster
 
Troubleshooting Complex Oracle Performance Problems with Tanel Poder
Troubleshooting Complex Oracle Performance Problems with Tanel PoderTroubleshooting Complex Oracle Performance Problems with Tanel Poder
Troubleshooting Complex Oracle Performance Problems with Tanel Poder
 
Why is my_oracle_e-biz_database_slow_a_million_dollar_question
Why is my_oracle_e-biz_database_slow_a_million_dollar_questionWhy is my_oracle_e-biz_database_slow_a_million_dollar_question
Why is my_oracle_e-biz_database_slow_a_million_dollar_question
 
Your tuning arsenal: AWR, ADDM, ASH, Metrics and Advisors
Your tuning arsenal: AWR, ADDM, ASH, Metrics and AdvisorsYour tuning arsenal: AWR, ADDM, ASH, Metrics and Advisors
Your tuning arsenal: AWR, ADDM, ASH, Metrics and Advisors
 
Scylla Summit 2017: Cry in the Dojo, Laugh in the Battlefield: How We Constan...
Scylla Summit 2017: Cry in the Dojo, Laugh in the Battlefield: How We Constan...Scylla Summit 2017: Cry in the Dojo, Laugh in the Battlefield: How We Constan...
Scylla Summit 2017: Cry in the Dojo, Laugh in the Battlefield: How We Constan...
 
Techno-Fest-15nov16
Techno-Fest-15nov16Techno-Fest-15nov16
Techno-Fest-15nov16
 
My First 100 days with an Exadata (PPT)
My First 100 days with an Exadata (PPT)My First 100 days with an Exadata (PPT)
My First 100 days with an Exadata (PPT)
 
Rac 12c optimization
Rac 12c optimizationRac 12c optimization
Rac 12c optimization
 
More than 12 More things about Oracle Database 12c
More than 12 More things about Oracle Database 12cMore than 12 More things about Oracle Database 12c
More than 12 More things about Oracle Database 12c
 
你所不知道的Oracle后台进程Smon功能
你所不知道的Oracle后台进程Smon功能你所不知道的Oracle后台进程Smon功能
你所不知道的Oracle后台进程Smon功能
 
Oracle Open World Thursday 230 ashmasters
Oracle Open World Thursday 230 ashmastersOracle Open World Thursday 230 ashmasters
Oracle Open World Thursday 230 ashmasters
 
An introduction to_rac_system_test_planning_methods
An introduction to_rac_system_test_planning_methodsAn introduction to_rac_system_test_planning_methods
An introduction to_rac_system_test_planning_methods
 
Oracle analysis 101_v1.0_ext
Oracle analysis 101_v1.0_extOracle analysis 101_v1.0_ext
Oracle analysis 101_v1.0_ext
 
OSMC 2015: Linux Performance Profiling and Monitoring by Werner Fischer
OSMC 2015: Linux Performance Profiling and Monitoring by Werner FischerOSMC 2015: Linux Performance Profiling and Monitoring by Werner Fischer
OSMC 2015: Linux Performance Profiling and Monitoring by Werner Fischer
 
OSMC 2015 | Linux Performance Profiling and Monitoring by Werner Fischer
OSMC 2015 | Linux Performance Profiling and Monitoring by Werner FischerOSMC 2015 | Linux Performance Profiling and Monitoring by Werner Fischer
OSMC 2015 | Linux Performance Profiling and Monitoring by Werner Fischer
 
BRKRST-3066 - Troubleshooting Nexus 7000 (2013 Melbourne) - 2 Hours.pdf
BRKRST-3066 - Troubleshooting Nexus 7000 (2013 Melbourne) - 2 Hours.pdfBRKRST-3066 - Troubleshooting Nexus 7000 (2013 Melbourne) - 2 Hours.pdf
BRKRST-3066 - Troubleshooting Nexus 7000 (2013 Melbourne) - 2 Hours.pdf
 
OOW13 JB KP ASH Deep Dive
OOW13 JB KP ASH Deep DiveOOW13 JB KP ASH Deep Dive
OOW13 JB KP ASH Deep Dive
 

Kürzlich hochgeladen

The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptx
heathfieldcps1
 
An Overview of Mutual Funds Bcom Project.pdf
An Overview of Mutual Funds Bcom Project.pdfAn Overview of Mutual Funds Bcom Project.pdf
An Overview of Mutual Funds Bcom Project.pdf
SanaAli374401
 
Making and Justifying Mathematical Decisions.pdf
Making and Justifying Mathematical Decisions.pdfMaking and Justifying Mathematical Decisions.pdf
Making and Justifying Mathematical Decisions.pdf
Chris Hunter
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global Impact
PECB
 

Kürzlich hochgeladen (20)

Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptx
 
This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.
 
Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1
 
How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17
 
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
 
Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104
 
Unit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptxUnit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptx
 
ICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptxICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptx
 
An Overview of Mutual Funds Bcom Project.pdf
An Overview of Mutual Funds Bcom Project.pdfAn Overview of Mutual Funds Bcom Project.pdf
An Overview of Mutual Funds Bcom Project.pdf
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introduction
 
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxBasic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
 
Class 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdfClass 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdf
 
Application orientated numerical on hev.ppt
Application orientated numerical on hev.pptApplication orientated numerical on hev.ppt
Application orientated numerical on hev.ppt
 
Making and Justifying Mathematical Decisions.pdf
Making and Justifying Mathematical Decisions.pdfMaking and Justifying Mathematical Decisions.pdf
Making and Justifying Mathematical Decisions.pdf
 
Z Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphZ Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot Graph
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global Impact
 
Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...
Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...
Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...
 
fourth grading exam for kindergarten in writing
fourth grading exam for kindergarten in writingfourth grading exam for kindergarten in writing
fourth grading exam for kindergarten in writing
 
Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"
 

Exadata Cell metrics

  • 1. Exadata Cell metrics Exadata CELLSRV periodically records important runtime properties, called metrics, for cell components such as CPUs, cell disks, grid disks, flash cache, and IORM statistics. These metrics are recorded in memory. Based on its own metric collection schedule, the Management Server (MS) gets the set of metric data accumulated by CELLSRV. Management Server (MS) provides Exadata cell management and configuration functions. MS is responsible for sending alerts and collects some statistics in addition to those collected by CELLSRV. Each cell is individually managed with Exadata cell command-line interface (CellCLI). Locate the MS process -------------------------------$ ps -ef | grep ms.err 1000 3940 3723 0 01:42 pts/0 00:00:00 grep ms.err root 24541 24540 0 Sep28 ? 00:01:32 /usr/java/jdk1.5.0_15/bin/java -Xms256m -Xmx512m Djava.library.path=/opt/oracle/ Check the Alert History -----------------------MS triggers an alert when it discovers a: Cell hardware issue Cell software or configuration issue CELLSRV internal error Metric that has exceeded a threshold defined in the cell CellCLI> list alerthistory 1 2013-09-26T22:51:15-04:00 critical "ORA-00700: soft internal error, arguments: [main_6a], [3], [Invalid IP addresses in cellinit.ora file], [], [], [], [], [], [], [], [], []" 2_1 2013-09-26T22:52:07-04:00 warning "Hugepage allocation failure in service cellsrv. Number of Hugepages allocated is 0, failed to allocate 110" 3 2013-09-26T22:54:08-04:00 critical "ORA-00700: soft internal error, arguments: [main_6a], [3], [Invalid IP addresses in cellinit.ora file], [], [], [], [], [], [], [], [], []" 4 2013-09-28T13:05:21-04:00 critical "RS-7445 [Serv RS_BACKUP is absent] [It will be restarted] [] [] [] [] [] [] [] [] [] []" 5 2013-09-28T22:05:38-04:00 critical "RS-7445 [Serv CELLSRV is absent] [It will be restarted] [] [] [] [] [] [] [] [] [] []" Create and check for disk I/O errors ---------------------------ellCLI> create threshold CD_IO_ERRS_MIN comparison='>', warning=0, > occurrences=1, observation=1
  • 2. Threshold CD_IO_ERRS_MIN successfully created CellCLI> list threshold CD_IO_ERRS_MIN detail name: CD_IO_ERRS_MIN comparison: > observation: 1 occurrences: 1 warning: 0.0 ellCLI> list alerthistory where severity='warning'; 2_1 2013-09-26T23:02:12-04:00 warning "Hugepage allocation failure in service cellsrv. Number of Hugepages allocated is 0, failed to allocate 110" CellCLI> list alerthistory where severity='critical'; 1 2013-09-26T23:01:18-04:00 critical "ORA-00700: soft internal error, arguments: [main_6a], [3], IP addresses in cellinit.ora file], [], [], [], [], [], [], []" 3 2013-09-26T23:04:11-04:00 critical "ORA-00700: soft internal error, arguments: [main_6a], [3], IP addresses in cellinit.ora file], [], [], [], [], [], [], []" 4 2013-10-01T06:42:39-04:00 critical "RS-7445 [Serv CELLSRV is absent] [It will be restarted] [] [] [] [] [] [] []" [Invalid [], [], [Invalid [], [], [] [] [] CellCLI> list alerthistory where severity='clear'; CellCLI> list alerthistory where severity='info'; MetricType: - cumulative: Cumulative statistics since the metric was created - instantaneous: Value at the time that the metric is collected - rate: Rates computed by averaging statistics over observation periods - transition: Collected at the time when the value of the metrics has changed, and typically captures important transitions in hardware status CellCLI> list metriccurrent attributes name,metrictype,metricobjectname,metricvalue,collectionTime where metrictype='Rate' Monitoring Exadata with Active Requests ---------------------------------------CellCLI> LIST ACTIVEREQUEST WHERE IoType = 'predicate pushing' DETAIL ioType identifies the type of active request file initialization Possible values are read, write, predicate pushing, filtered backup read, predicate push read Check retention period for metric and alert history ------------------------------------------------------CellCLI> list cell attributes metricHistoryDays 7 CellCLI> alter cell metrichistorydays=5
  • 3. Cell qr03cel02 successfully altered CellCLI> list cell attributes metrichistorydays 5 CellCLI> list cell attributes name,interconnectCount qr03cel02 2 configure the cell to automatically send an email and/or SNMP message to a designated set of Exadata administrator. -----------------------------------------------------------------------------------------------------------------alter cell smtpServer='my_mail.example.com', smtpFromAddr='monowar.mukul@example.com', smtpFrom='monowar mukul', smtpToAddr='jane.smith@example.com', notificationPolicy='critical,warning,clear', notificationMethod='mail' Watching for Undelivered Alerts --------------------------------It is important to periodically check the storage servers just to make sure that raised alerts have actually been delivered (via email and/or to Grid or Cloud Control). CellCLI>LIST examinedBy='' ALERTHISTORY where notificationState dcli -g cell_group cellcli -e "LIST notificationState != 1 and examinedBy='' " != ALERTHISTORY 1 2013-09-26T23:01:18-04:00 critical "ORA-00700: soft internal error, arguments: [main_6a], [3], IP addresses in cellinit.ora file], [], [], [], [], [], [], []" 2_1 2013-09-26T23:02:12-04:00 warning "Hugepage allocation failure in service cellsrv. Number of Hugepages allocated is 0, failed to allocate 110" 3 2013-09-26T23:04:11-04:00 critical "ORA-00700: soft internal error, arguments: [main_6a], [3], IP addresses in cellinit.ora file], [], [], [], [], [], [], []" 4 2013-10-01T06:42:39-04:00 critical "RS-7445 [Serv CELLSRV is absent] [It will be restarted] [] [] [] [] [] [] []" Drop Alert History --------------------CellCLI> drop alerthistory all Alert 1 successfully dropped Alert 2_1 successfully dropped Alert 3 successfully dropped Checking Threshold ------------------CellCLI> list threshold cl_fsut./ 1 and where [Invalid [], [], [Invalid [], [], [] [] []
  • 4. cl_fsut./u01 CellCLI> create threshold cl_tst."/u01" comparison='>', warning=80 Threshold cl_fsut."/u01" successfully created CellCLI> list threshold detail name: comparison: warning: name: comparison: warning: cl_fsut./ > 70.0 cl_fsut./u01 > 80.0 CellCLI> alter threshold cl_fsut."/" comparison='>', warning=50 Threshold cl_fsut."/" successfully altered CellCLI> list threshold detail name: comparison: warning: name: comparison: warning: cl_fsut./ > 50.0 cl_fsut./u01 > 80.0 Execute the following command inside the cell operating system. It creates a 512-MB file on the root file system which will increase the utilization metric. After the metric crosses the threshold , an alert will be generated. $ dd if=/dev/zero of=/tmp/file.out bs=1024 count=500000 [celladmin@qr03cel02 ~]$ dd if=/dev/zero of=/tmp/file.out bs=1024 count=500000 500000+0 records in 500000+0 records out 512000000 bytes (512 MB) copied, 4.25551 seconds, 120 MB/s [celladmin@qr03cel02 ~]$ cellcli CellCLI: Release 11.2.3.1.0 - Production on Mon Sep 30 01:36:45 EDT 2013 Copyright (c) 2007, 2011, Oracle. Cell Efficiency Ratio: 26M All rights reserved. CellCLI> list alerthistory 1_1 2013-09-30T01:32:46-04:00 warning "The warning threshold for the following metric has been crossed. Metric Name : CL_FSUT Metric Description : Percentage of total space on this file system that is currently used Object Name : / Current Value : 56.0 % Threshold Value : 50.0 % " CellCLI> alter alerthistory 1_1 examinedby='investigator' Alert 1_1 successfully altered CellCLI> list alerthistory detail name: 1_1
  • 5. alertMessage: "The warning threshold for the following metric has been crossed. Metric Name : CL_FSUT Metric Description : Percentage of total space on this file system that is currently used Object Name : / Current Value : 56.0 % Threshold Value : 50.0 % " alertSequenceID: 1 alertShortName: CL_FSUT alertType: Stateful beginTime: 2013-09-30T01:32:46-04:00 endTime: examinedBy: investigator metricObjectName: "/" metricValue: 56.0 notificationState: 0 sequenceBeginTime: 2013-09-30T01:32:46-04:00 severity: warning alertAction: "Examine the metric value that is violating the specified threshold, and take appropriate actions if needed." The value of the name attribute is a composite of abbreviations. • CL_ (cell) • CD_ (cell disk) • GD_ (grid disk) • FC_ (flash cache) • DB_ (database) • CG_ (consumer group) • CT_ (category) • N_ (interconnect network) -- Monitoring IORM with cellcli command. I/O-related metric: • IO_RQ (number of requests) • IO_BY (number of MB) • IO_TM (I/O latency) • IO_WT (I/O wait time) _R for read _W for write. _SM small I/O _LG large I/O _SEC signify per second _RQ to signify per request • CD_IO_WT_R_SM is the cell disk (CD_) I/O wait time (IO_WT) to read (_R) small blocks (_SM). • GD_IO_RQ_W_LG_SEC is the grid disk (GD_) number of requests (IO_RQ) to write (_W) of large block (_LG) I/O per second (_SEC) on a grid disk.