Google Kubernetes Engine (GKE) is a managed, production-ready environment for deploying containerized applications. It brings Google latest innovations in developer productivity, resource efficiency, automated operations, and open source flexibility to accelerate your time to market.

Use OpsRamp Google Public cloud Integration to discover and collect metrics against the Google service.

Setup

To set up the Google integration and discover the Google service, go to Google Integration Discovery Profile and select Container Cluster.

Supported metrics

OpsRamp MetricGoogle MetricMetric Display NameUnitDescription
google_container_container_accelerator_duty_cyclekubernetes.io/container/accelerator/duty_cycleAccelerator duty cycle%Percent of time over the past sample period (10s) during which the accelerator was actively processing. Values are integers between 0 and 100. Sampled every 60 seconds.
google_kubernetes_container_accelerator_memory_bandwidth_utilizationkubernetes.io/container/accelerator/memory_bandwidth_utilizationMemory bandwidth utilization%Current percentage of the accelerator memory bandwidth that is being used. Computed by dividing the memory bandwidth used over a sample period by the maximum supported bandwidth over the same sample period. Sampled every 60 seconds. After sampling, data is not visible for up to 120 seconds.
google_container_container_accelerator_memory_totalkubernetes.io/container/accelerator/memory_totalAccelerator memory totalbytesTotal accelerator memory in bytes. Sampled every 60 seconds.
google_container_container_accelerator_memory_usedkubernetes.io/container/accelerator/memory_usedAccelerator memory usedbytesTotal accelerator memory allocated in bytes. Sampled every 60 seconds.
google_container_container_accelerator_requestkubernetes.io/container/accelerator/requestRequest acceleratorsdevicesNumber of accelerator devices requested by the container. Sampled every 60 seconds.
google_kubernetes_container_accelerator_tensorcore_utilizationkubernetes.io/container/accelerator/tensorcore_utilizationTensorcore utilization%Current percentage of the Tensorcore that is utilized. Computed by dividing the Tensorcore operations that were performed over a sample period by the supported number of Tensorcore operations over the same sample period. Sampled every 60 seconds. After sampling, data is not visible for up to 120 seconds.
google_container_container_cpu_core_usage_timekubernetes.io/container/cpu/core_usage_timeCPU usage timeCPU secondsCumulative CPU usage on all cores used by the container in seconds. Sampled every 60 seconds.
google_container_container_cpu_limit_coreskubernetes.io/container/cpu/limit_coresLimit coresCPU coresCPU cores limit of the container. Sampled every 60 seconds.
google_container_container_cpu_limit_utilizationkubernetes.io/container/cpu/limit_utilizationCPU limit utilizationcountThe fraction of the CPU limit that is currently in use on the instance. This value can be greater than 1 as a container might be allowed to exceed its CPU limit for extended periods of time. Sampled every 60 seconds. After sampling, data is not visible for up to 240 seconds.
google_container_container_cpu_request_coreskubernetes.io/container/cpu/request_coresRequest coresCPU coresNumber of CPU cores requested by the container. Sampled every 60 seconds. After sampling, data is not visible for up to 120 seconds.
google_container_container_cpu_request_utilizationkubernetes.io/container/cpu/request_utilizationCPU request utilizationcountThe fraction of the requested CPU that is currently in use on the instance. This value can be greater than 1 as usage can exceed the request. Sampled every 60 seconds. After sampling, data is not visible for up to 240 seconds.
google_container_container_ephemeral_storage_limit_byteskubernetes.io/container/ephemeral_storage/limit_bytesEphemeral storage limitbytesLocal ephemeral storage limit in bytes. Sampled every 60 seconds.
google_container_container_ephemeral_storage_request_byteskubernetes.io/container/ephemeral_storage/request_bytesEphemeral storage requestbytesLocal ephemeral storage request in bytes. Sampled every 60 seconds.
google_container_container_ephemeral_storage_used_byteskubernetes.io/container/ephemeral_storage/used_bytesEphemeral storage usagebytesLocal ephemeral storage usage in bytes. Sampled every 60 seconds.
google_container_container_memory_limit_byteskubernetes.io/container/memory/limit_bytesMemory limitbytesMemory limit of the container in bytes. Sampled every 60 seconds.
google_container_container_memory_limit_utilizationkubernetes.io/container/memory/limit_utilizationMemory limit utilizationcountThe fraction of the memory limit that is currently in use on the instance. This value cannot exceed 1 as usage cannot exceed the limit. Sampled every 60 seconds. After sampling, data is not visible for up to 120 seconds.
google_container_container_memory_page_fault_countkubernetes.io/container/memory/page_fault_countPage faultscountNumber of page faults, broken down by Sampled every 60 seconds.
google_container_container_memory_request_byteskubernetes.io/container/memory/request_bytesMemory requestbytesMemory request of the container in bytes. Sampled every 60 seconds. After sampling, data is not visible for up to 120 seconds.
google_container_container_memory_request_utilizationkubernetes.io/container/memory/request_utilizationMemory request utilizationcountThe fraction of the requested memory that is currently in use on the instance. This value can be greater than 1 as usage can exceed the request. Sampled every 60 seconds. After sampling, data is not visible for up to 240 seconds.
google_kubernetes_container_memory_swap_used_byteskubernetes.io/container/memory/swap_used_bytesContainer swap used bytesbytesBytes of swap memory used by the container. Sampled every 60 seconds. After sampling, data is not visible for up to 120 seconds.
google_container_container_memory_used_byteskubernetes.io/container/memory/used_bytesMemory usagebytesMemory usage in bytes. Sampled every 60 seconds.
google_kubernetes_container_multislice_accelerator_device_to_host_transfer_latencieskubernetes.io/container/multislice/accelerator/device_to_host_transfer_latenciesDevice to Host transfer latenciesmicrosecondsDistribution of device to host transfer latency for each chunk of data for multislice traffic. Sampled every 60 seconds. After sampling, data is not visible for up to 120 seconds.
google_kubernetes_container_multislice_accelerator_host_to_device_transfer_latencieskubernetes.io/container/multislice/accelerator/host_to_device_transfer_latenciesHost to Device transfer latenciesmicrosecondsDistribution of host to device transfer latency for each chunk of data for multislice traffic. Sampled every 60 seconds. After sampling, data is not visible for up to 120 seconds.
google_kubernetes_container_multislice_network_collective_end_to_end_latencieskubernetes.io/container/multislice/network/collective_end_to_end_latenciesCollective latenciesmicrosecondsDistribution of end to end collective latency for multislice traffic. Sampled every 60 seconds. After sampling, data is not visible for up to 120 seconds.
google_kubernetes_container_multislice_network_dcn_transfer_latencieskubernetes.io/container/multislice/network/dcn_transfer_latenciesDCN (Data Center Network) transfer latenciesmicrosecondsDistribution of network transfer latencies for multislice traffic. Sampled every 60 seconds. After sampling, data is not visible for up to 120 seconds.
google_kubernetes_container_multislice_network_grpc_tcp_delivery_rateskubernetes.io/container/multislice/network/grpc_tcp_delivery_ratesGRPC TCP Delivery RatesMbpsDistribution of gRPC TCP delivery rates for multislice traffic. Sampled every 60 seconds. After sampling, data is not visible for up to 120 seconds.
google_kubernetes_container_multislice_network_grpc_tcp_min_round_trip_timeskubernetes.io/container/multislice/network/grpc_tcp_min_round_trip_timesGRPC TCP Minimum Round Trip TimesmicrosecondsDistribution of gRPC TCP min round trip times for multislice traffic. Sampled every 60 seconds. After sampling, data is not visible for up to 120 seconds.
google_container_container_restart_countkubernetes.io/container/restart_countRestart countcountNumber of times the container has restarted. Sampled every 60 seconds. After sampling, data is not visible for up to 120 seconds.
google_container_container_uptimekubernetes.io/container/uptimeUptimesecondsTime in seconds that the container has been running. Sampled every 60 seconds.

Event support

  • Supported
  • Configurable in OpsRamp Google Integration Discovery Profile.

External reference