AFS-Monitor

 view release on metacpan or  search on metacpan

pod/afsmon_stats.pod  view on Meta::CPAN

#------------------------------------------------------------------------------
# afsmonitor.pod
#
# Copyright © 2004 Alf Wachsmann <alfw@slac.stanford.edu> and
#                  Elizabeth Cassell <e_a_c@mailsnare.net>
#
# $Revision: 1.4 $ $Date: 2004/08/05 20:52:03 $ $Author: alfw $
#
# This library is free software; you can redistribute it and/or modify it
# under the same terms as Perl itself.
#------------------------------------------------------------------------------

=head1 NAME

B<afsmonitor Program Statistics> - a list of the statistics you can gather with the afsmonitor program.

=head1 DESCRIPTION

A complete listing of Cache Manager and File Server statistics gathered by the
L<afsmonitor(1)> function. The statistics are grouped by category and section,
with a brief description of each field, group, and section. For more information
about the L<afsmonitor(1)> function, see the L<afsmonitor(1)> documentation.

=head1 The Cache Manager Statistics

Cache Manager statistics fields are classified into the following sections and
groups:

=over

=item *

PerfStats_section - Performance Statistics Section.

=over

=item *

PerfStats_group - Performance Statistics Group.

=item *

misc_group - Miscellaneous Group.

=back

=item *

Server_UpDown_section - Server Up/Down Statistics Section.

=over

=item *

FS_upDown_SC_group - File Server Up/Down Statistics in Same Cell Group.

=item *

FS_upDown_OC_group - File Server Up/Down Statistics in Other Cells Group.

=item *

VL_upDown_SC_group - VL Server Up/Down Statistics in Same Cell Group.

=item *

VL_upDown_OC_group - VL Server Up/Down Statistics in Other Cells Group.

=back

=item *

RPCop_section - RPC Operation Measurements Section.

=over

=item *

FS_RPCopTimes_group - File Server RPC Operation Timings Group.

=item *

FS_RPCopErrors_group - File Server RPC Operation Errors Group.

=item *

FS_RPCopBytes_group - File Server RPC Transfer Timings Group.

=item *

CM_RPCopTimes_group - Cache Manager RPC Operation Timings Group.

=back

=item *

Auth_Access_section - Authentication and Replicated File Access Section.

=over

=item *

Auth_Stats_group - Authentication Information for Cache Manager Group.

=item *

Access_Stats_group - Unreplicated File Access Group.

=back

=back

All Cache Manager variables categorized under these sections and groups names
are listed below.

=head2 Performance Statistics Section (PerfStats_section)

Performance Statistics Group (PerfStats_group)

=over

=item *

dlocalAccesses: Number of data accesses to files within local cell.

=item *

vlocalAccesses: Number of stat accesses to files within local cell.

=item *

dremoteAccesses: Number of data accesses to files outside of local cell.

=item *

vremoteAccesses: Number of stat accesses to files outside of local cell.

=item *

cacheNumEntries: Number of cache entries.

=item *

cacheBlocksTotal: Number of (1K) blocks configured for cache.

=item *

cacheBlocksInUse: Number of cache blocks actively in use.

=item *

cacheBlocksOrig: Number of cache blocks at bootup.

=item *

cacheMaxDirtyChunks: Maximum number of dirty cache chunks tolerated.

=item *

cacheCurrDirtyChunks: Current number of dirty cache chunks.

=item *

dcacheHits: Number of data files found in local cache.

=item *

vcacheHits: Number of stat entries found in local cache.

=item *

dcacheMisses: Number of data files not found in local cache.

=item *

vcacheMisses: Number of stat entries not found in local cache.

pod/afsmon_stats.pod  view on Meta::CPAN

=item *

OutStandingAllocs: Outstanding osi_allocs (no osi_frees yet).

=item *

CallBackAlloced: Number of callback structures allocated.

=item *

CallBackFlushes: Number of callback flush operations performed.

=item *

srvRecords: Number of servers currently on record.

=item *

srvRecordsHWM: Server record high water mark.

=item *

srvNumBuckets: Number of server hash chain buckets.

=item *

srvMaxChainLength: Maximum server hash chain length.

=item *

srvMaxChainLengthHWM: Server hash chain high water mark.

=item *

sysName_ID: Sysname ID for host hardware.

=back

Miscellaneous Group (misc_group)

=over

=item *

numPerfCalls: Number of performance calls received.

=item *

epoch: Cache Manager epoch time.

=item *

numCellsVisible: Number of cells we know about.

=item *

numCellsContacted: Number of cells contacted.

=back

=head2 Server Up/Down Statistics Section (Server_UpDown_section)

File Server Up/Down Statistics in Same Cell Group (FS_upDown_SC_group)

Note: The records referred to in this section are the internal records kept by the afsmonitor program to track the processes from which data is being gathered.

=over

=item *

fs_sc_numTtlRecords: Number of fileserver records, active or inactive.

=item *

fs_sc_numUpRecords: Number of (active) fileserver records currently marked up.

=item *

fs_sc_numDownRecords: Number of (active) fileserver records currently marked down.

=item *

fs_sc_sumOfRecordAges: Sum of fileserver record lifetimes.

=item *

fs_sc_ageOfYoungestRecord: Age of youngest fileserver record.

=item *

fs_sc_ageOfOldestRecord: Age of oldest fileserver record.

=item *

fs_sc_numDowntimeIncidents: Number of (completed) downtime incidents.

=item *

fs_sc_numRecordsNeverDown: Number of fileserver records never marked down.

=item *

fs_sc_maxDowntimesInARecord: Maximum downtimes seen by any fileserver record.

=item *

fs_sc_sumOfDowntimes: Sum of all (completed) downtimes, in seconds.

=item *

fs_sc_shortestDowntime: Shortest downtime, in seconds.

=item *

fs_sc_longestDowntime: Longest downtime, in seconds.

=item *

fs_sc_down_0_10_min: Down time incidents: 0-10 minutes.

=item *

fs_sc_down_10_30_min: Down time incidents: 10-30 minutes.

=item *

fs_sc_down_half_1_hr: Down time incidents: 30-60 minutes.

=item *

fs_sc_down_1_2_hr: Down time incidents: 1-2 hours.

=item *

fs_sc_down_2_4_hr: Down time incidents: 2-4 hours.

=item *

fs_sc_down_4_8_hr: Down time incidents: 4-8 hours.

=item *

fs_sc_down_more_8_hr: Down time incidents: more than 8 hours.

=item *

fs_sc_downDst_0: Down time incidents: 0 times.

=item *

fs_sc_downDst_1: Down time incidents: 1 time.

=item *

fs_sc_downDst_2_5: Down time incidents: 2-5 times.

=item *

fs_sc_downDst_6_10: Down time incidents: 6-10 times.

=item *

fs_sc_downDst_10_50: Down time incidents: 10-50 times.

=item *

fs_sc_downDst_more_50: Down time incidents: more than 50 times.

=back

File Server Up/Down Statistics in Other Cells Group (FS_upDown_OC_group)

=over

=item *

fs_oc_numTtlRecords: Number of fileserver records, active or inactive.

=item *

fs_oc_numUpRecords: Number of (active) fileserver records currently marked up.

=item *

fs_oc_numDownRecords: Number of (active) fileserver records currently marked down.

=item *

fs_oc_sumOfRecordAges: Sum of server record lifetimes.

=item *

fs_oc_ageOfYoungestRecord: Age of youngest fileserver record.

=item *

fs_oc_ageOfOldestRecord: Age of oldest fileserver record.

=item *

fs_oc_numDowntimeIncidents: Number of (completed) downtime incidents.

=item *

fs_oc_numRecordsNeverDown: Number of fileserver records never marked down.

=item *

fs_oc_maxDowntimesInARecord: Maximum downtimes seen by any fileserver.

=item *

fs_oc_sumOfDowntimes: Sum of all (completed) downtimes, in seconds.

=item *

fs_oc_shortestDowntime: Shortest downtime, in seconds.

=item *

fs_oc_longestDowntime: Longest downtime, in seconds.

=item *

fs_oc_down_0_10_min: Down time incidents: 0-10 minutes.

=item *

fs_oc_down_10_30_min: Down time incidents: 10-30 minutes.

=item *

fs_oc_down_half_1_hr: Down time incidents: 30-60 minutes.

=item *

fs_oc_down_1_2_hr: Down time incidents: 1-2 hours.

=item *

fs_oc_down_2_4_hr: Down time incidents: 2-4 hours.

=item *

fs_oc_down_4_8_hr: Down time incidents: 4-8 hours.

=item *

fs_oc_down_more_8_hr: Down time incidents: more than 8 hours.

=item *

fs_oc_downDst_0: Down time incidents: 0 times.

=item *

fs_oc_downDst_1: Down time incidents: 1 time.

=item *

fs_oc_downDst_2_5: Down time incidents: 2-5 times.

=item *

fs_oc_downDst_6_10: Down time incidents: 6-10 times.

=item *

fs_oc_downDst_10_50: Down time incidents: 10-50 times.

=item *

fs_oc_downDst_more_50: Down time incidents: more than 50 times.

=back

VL Server Up/Down Statistics in Same Cell Group (VL_upDown_SC_group)

=over

=item *

vl_sc_numTtlRecords: Number of vlserver records, active or inactive.

=item *

vl_sc_numUpRecords: Number of (active) vlserver records currently marked up.

=item *

vl_sc_numDownRecords: Number of (active) vlserver records currently marked down.

=item *

vl_sc_sumOfRecordAges: Sum of vlserver record lifetimes.

=item *

vl_sc_ageOfYoungestRecord: Age of youngest vlserver record.

=item *

vl_sc_ageOfOldestRecord: Age of oldest vlserver record.

=item *

vl_sc_numDowntimeIncidents: Number of (completed) downtime incidents.

=item *

vl_sc_numRecordsNeverDown: Number of vlserver records never marked down.

=item *

vl_sc_maxDowntimesInARecord: Maximum downtimes seen by any vlserver record.

=item *

vl_sc_sumOfDowntimes: Sum of all (completed) downtimes, in seconds.

=item *

vl_sc_shortestDowntime: Shortest downtime, in seconds.

=item *

vl_sc_longestDowntime: Longest downtime, in seconds.

=item *

vl_sc_down_0_10_min: Down time incidents: 0-10 minutes.

=item *

vl_sc_down_10_30_min: Down time incidents: 10-30 minutes.

=item *

vl_sc_down_half_1_hr: Down time incidents: 30-60 minutes.

=item *

vl_sc_down_1_2_hr: Down time incidents: 1-2 hours.

=item *

vl_sc_down_2_4_hr: Down time incidents: 2-4 hours.

=item *

vl_sc_down_4_8_hr: Down time incidents: 4-8 hours.

=item *

vl_sc_down_more_8_hr: Down time incidents: more than 8 hours.

=item *

vl_sc_downDst_0: Down time incidents: 0 times.

=item *

vl_sc_downDst_1: Down time incidents: 1 time.

=item *

vl_sc_downDst_2_5: Down time incidents: 2-5 times.

=item *

vl_sc_downDst_6_10: Down time incidents: 6-10 times.

=item *

vl_sc_downDst_10_50: Down time incidents: 10-50 times.

=item *

vl_sc_downDst_more_50: Down time incidents: more than 50 times.

=back

VL Server Up/Down Statistics in Other Cells Group (VL_upDown_DC_group)

=over

=item *

vl_oc_numTtlRecords: Number of vlserver records, active or inactive.

=item *

vl_oc_numUpRecords: Number of (active) vlserver records currently marked up.

=item *

vl_oc_numDownRecords: Number of (active) vlserver records currently marked down.

=item *

vl_oc_sumOfRecordAges: Sum of vlserver record lifetimes.

=item *

vl_oc_ageOfYoungestRecord: Age of youngest vlserver record.

=item *

vl_oc_ageOfOldestRecord: Age of oldest vlserver record.

=item *

vl_oc_numDowntimeIncidents: Number of (completed) downtime incidents.

=item *

vl_oc_numRecordsNeverDown: Number of vlserver records never marked down.

=item *

vl_oc_maxDowntimesInARecord: Maximum downtimes seen by any vlserver record.

=item *

vl_oc_sumOfDowntimes: Sum of all (completed) downtimes, in seconds.

=item *

vl_oc_shortestDowntime: Shortest downtime, in seconds.

=item *

vl_oc_longestDowntime: Longest downtime, in seconds.

=item *

vl_oc_down_0_10_min: Down time incidents: 0-10 minutes.

=item *

vl_oc_down_10_30_min: Down time incidents: 10-30 minutes.

=item *

vl_oc_down_half_1_hr: Down time incidents: 30-60 minutes.

=item *

vl_oc_down_1_2_hr: Down time incidents: 1-2 hours.

=item *

vl_oc_down_2_4_hr: Down time incidents: 2-4 hours.

=item *

vl_oc_down_4_8_hr: Down time incidents: 4-8 hours.

=item *

vl_oc_down_more_8_hr: Down time incidents: more than 8 hours.

=item *

vl_oc_downDst_0: Down time incidents: 0 times.

=item *

vl_oc_downDst_1: Down time incidents: 1 time.

=item *

vl_oc_downDst_2_5: Down time incidents: 2-5 times.

=item *

vl_oc_downDst_6_10: Down time incidents: 6-10 times.

=item *

vl_oc_downDst_10_50: Down time incidents: 10-50 times.

=item *

vl_oc_downDst_more_50: Down time incidents: more than 50 times.

=back

=head2 RPC Operation Measurements Section (RPCop_section)

File Server RPC Operation Timings Group (FS_RPCopTimes_group)

=over

=item *

FetchData_ops: Number of FetchData operations executed.

=item *

FetchData_ops_ok: Number of successful FetchData operations.

=item *

FetchData_sum: Sum of timings for FetchData operations.

=item *

FetchData_sqr: Sum of squares of sample timings for FetchData operations.

=item *

FetchData_min: Minimum execution time observed for FetchData operations.

=item *

FetchData_max: Maximum execution time observed for FetchData operations.

=item *

FetchACL_ops: Number of FetchACL operations executed.

=item *

FetchACL_ops_ok: Number of successful FetchACL operations.

=item *

FetchACL_sum: Sum of timings for FetchACL operations.

=item *

FetchACL_sqr: Sum of squares of sample timings for FetchACL operations.

=item *

FetchACL_min: Minimum execution time observed for FetchACL operations.

=item *

FetchACL_max: Maximum execution time observed for FetchACL operations.

=item *

FetchStatus_ops: Number of FetchStatus operations executed.

=item *

FetchStatus_ops_ok: Number of successful FetchStatus operations.

pod/afsmon_stats.pod  view on Meta::CPAN

GetCE_ops_sqr: Sum of squares of sample timings for GetCE operations.

=item *

GetCE_ops_min: Minimum execution time observed for GetCE operations.

=item *

GetCE_ops_max: Maximum execution time observed for GetCE operations.

=item *

XStatsVersion_CM_ops: Number of XStatsVersion operations executed.

=item *

XStatsVersion_CM_ops_ok: Number of successful XStatsVersion operations.

=item *

XStatsVersion_CM_ops_sum: Sum of timings for XStatsVersion operations.

=item *

XStatsVersion_CM_ops_sqr: Sum of squares of sample timings for XStatsVersion operations.

=item *

XStatsVersion_CM_ops_min: Minimum execution time observed for XStatsVersion operations.

=item *

XStatsVersion_CM_ops_max: Maximum execution time observed for XStatsVersion operations.

=item *

GetXStats_CM_ops: Number of GetXStats operations executed.

=item *

GetXStats_CM_ops_ok: Number of successful GetXStats operations.

=item *

GetXStats_CM_ops_sum: Sum of timings for GetXStats operations.

=item *

GetXStats_CM_ops_sqr: Sum of squares of sample timings for GetXStats operations.

=item *

GetXStats_CM_ops_min: Minimum execution time observed for GetXStats operations.

=item *

GetXStats_CM_ops_max: Maximum execution time observed for GetXStats operations.

=back

=head2 Authentication and Replicated File Access Section (Auth_Access_section)

Authentication Information for Cache Manager Group (Auth_Stats_group)

=over

=item *

curr_PAGs: Current number of PAGs.

=item *

curr_Records: Current number of records in table.

=item *

curr_AuthRecords: Current number of of authenticated records (with valid ticket).

=item *

curr_UnauthRecords: Current number of of unauthenticated records (without any ticket at all).

=item *

curr_MaxRecordsInPAG: Maximum records for a single PAG.

=item *

curr_LongestChain: Length of longest current hash chain.

=item *

PAGCreations: Number of PAG creations.

=item *

TicketUpdates: Number of ticket additions/refreshes.

=item *

HWM_PAGS: High water mark - number of PAGs.

=item *

HWM_Records: High water mark - number of records.

=item *

HWM_MaxRecordsInPAG: High water mark - maximum records for a single PAG.

=item *

HWM_LongestChain: High water mark - longest hash chain.

=back

Unreplicated File Access Group (Access_Stats_group)

=over

=item *

unreplicatedRefs: Number of references to unreplicated data.

=item *

replicatedRefs: Number of references to replicated data.

=item *

numReplicasAccessed: Number of replicas accessed.

=item *

maxReplicasPerRef: Maximum number of replicas accessed per reference.

=item *

refFirstReplicaOK: Number of references satisfied by 1st replica.

=back

=head1 The File Server Statistics


File Server statistics are classified into the following sections and groups:

=over

=item *

PerfStats_section: Performance Statistics Section.

=over

=item *

VnodeCache_group: Vnode Cache Group.

=item *

Directory_group: Directory Package Group.

=item *

Rx_group: Rx Group.

=item *

HostModule_group: Host Module Fields Group.

=item *

misc_group: Miscellaneous Variables Group.

=back

=item *

RPCop_section: RPC Operations Section.

=over

=item *

RPCopTimes_group: Individual RPC Operation Timings.

=item *

RPCopBytes_group: Byte Information for Certain RPC Operations.

=back

=back

All File Server variables categorized under the above sections and groups names are listed below.

=head2 Performance Statistics Section (PerfStats_section)

Vnode Cache Group (VnodeCache_group)

=over

=item *

vcache_L_Entries: Number of entries in LARGE vnode cache.

=item *

vcache_L_Allocs: Number of allocs (large).

=item *

vcache_L_Gets: Number of gets (large).

=item *

vcache_L_Reads: Number of reads (large).

=item *

vcache_L_Writes: Number of writes (large).

=item *

vcache_S_Entries: Number of entries in SMALL vnode cache.

=item *

vcache_S_Allocs: Number of allocs (small).

=item *

vcache_S_Gets: Number of gets (small).

=item *

vcache_S_Reads: Number of reads (small).

=item *

vcache_S_Writes: Number of writes (small).

=item *

vcache_H_Entries: Number of entries in HEADER vnode cache.

=item *

vcache_H_Gets: Number of gets (header)

=item *

vcache_H_Replacements: Number of replacements (header)

=back

Directory Package Group (Directory_group)

pod/afsmon_stats.pod  view on Meta::CPAN

rx_dataPacketsRead: Unique data packets read off wire.

=item *

rx_ackPacketsRead: ACK packets read.

=item *

rx_dupPacketsRead: Duplicate data packets read.

=item *

rx_spuriousPacketsRead: Inappropriate packets read.

=item *

rx_packetsSent_RcvClass: Packets sent (receive class).

=item *

rx_packetsSent_SendClass: Packets sent (send class).

=item *

rx_packetsSent_SpecialClass: Packets sent (special class).

=item *

rx_ackPacketsSent: ACK packets sent.

=item *

rx_pingPacketsSent: Ping packets sent.

=item *

rx_abortPacketsSent: Abort packets sent.

=item *

rx_busyPacketsSent: Busy packets sent.

=item *

rx_dataPacketsSent: Unique data packets sent.

=item *

rx_dataPacketsReSent: Retransmissions sent.

=item *

rx_dataPacketsPushed: Retransmissions pushed by NACK.

=item *

rx_ignoreAckedPacket: Packets with ACKed flag on rxi_Start.

=item *

rx_totalRtt_Sec and rx_totalRtt_Usec: Total round trip time (in seconds and milliseconds).

=item *

rx_minRtt_Sec and rx_minRtt_Usec: Minimum round trip time (in seconds and milliseconds).

=item *

rx_maxRtt_Sec and rx_maxRtt_Usec: Maximum round trip time (in seconds and milliseconds).

=item *

rx_nRttSamples: Round trip samples.

=item *

rx_nServerConns: Total server connections.

=item *

rx_nClientConns: Total client connections.

=item *

rx_nPeerStructs: Total peer structures.

=item *

rx_nCallStructs: Total call structures.

=item *

rx_nFreeCallStructs: Total free call structures.

=back

Host Module Fields Group (HostModule_group)

=over

=item *

host_NumHostEntries: Number of host entries.

=item *

host_HostBlocks: Blocks in use for hosts.

=item *

host_NonDeletedHosts: Non-deleted hosts.

=item *

host_HostsInSameNetOrSubnet: Hosts in same subnet as server.

=item *

host_HostsInDiffSubnet: Hosts in different subnet than server.

=item *

host_HostsInDiffNetwork: Hosts in different network than server.

=item *

host_NumClients: Number of client entries.

=item *

host_ClientBlocks: Blocks in use for clients.

=back

Miscellaneous Variables Group (misc_group)

=over

=item *

numPerfCalls: Number of performance calls received.

=back

=head2 RPC Operations Section (RPCop_section)

Individual RPC Operation Timings Group (RPCopTimes_group)

=over

=item *

epoch: Time when data collection began.

=item *

FetchData_ops: Number of FetchData operations executed.

=item *

FetchData_ops_ok: Number of successful FetchData operations.

=item *

FetchData_sum: Sum of timings for FetchData operations.

=item *

FetchData_sqr: Sum of squares of sample timings for FetchData operations.

=item *

FetchData_min: Minimum execution time observed for FetchData operations.

=item *

FetchData_max: Maximum execution time observed for FetchData operations.

=item *

FetchACL_ops: Number of FetchACL operations executed.

=item *

FetchACL_ops_ok: Number of successful FetchACL operations.

=item *

FetchACL_sum: Sum of timings for FetchACL operations.

=item *

FetchACL_sqr: Sum of squares of sample timings for FetchACL operations.

=item *

FetchACL_min: Minimum execution time observed for FetchACL operations.

=item *

FetchACL_max: Maximum execution time observed for FetchACL operations.

=item *

FetchStatus_ops: Number of FetchStatus operations executed.



( run in 2.969 seconds using v1.01-cache-2.11-cpan-39bf76dae61 )