Cloud Dataproc is a managed Apache Spark and Apache Hadoop service so you can take advantage of open source data tools for batch processing, querying, streaming, and machine learning.

Cloud Dataproc automation helps you create clusters quickly, manage them easily, and save money by turning clusters off when you do not need them. With less time and money spent on administration, you can focus on your jobs and your data.

Setup

To set up the Google integration and discover the Google service, go to Google Integration Discovery Profile and select GOOGLE/Dataproc Cluster.

Supported metrics

New OpsRamp MetricGoogle MetricMetric Display NameUnitDescription
google_dataproc_cluster_capacity_deviationdataproc.googleapis.com/cluster/capacity_deviationCluster capacity deviationcountDifference between the expected node count in the cluster and the actual active YARN node managers. Sampled every 60 seconds. After sampling, data is not visible for up to 120 seconds.
google_dataproc_cluster_hdfs_datanodesdataproc.googleapis.com/cluster/hdfs/datanodesHDFS DataNodescountIndicates the number of HDFS DataNodes that are running inside a cluster. Sampled every 60 seconds. After sampling, data is not visible for up to 120 seconds.
google_dataproc_cluster_hdfs_storage_capacitydataproc.googleapis.com/cluster/hdfs/storage_capacityHDFS capacityGBIndicates capacity of HDFS system running on cluster in GB. Sampled every 60 seconds. After sampling, data is not visible for up to 120 seconds.
google_dataproc_cluster_hdfs_storage_utilizationdataproc.googleapis.com/cluster/hdfs/storage_utilizationHDFS storage utilizationcountThe percentage of HDFS storage currently used. Sampled every 60 seconds. After sampling, data is not visible for up to 120 seconds.
google_dataproc_cluster_hdfs_unhealthy_blocksdataproc.googleapis.com/cluster/hdfs/unhealthy_blocksUnhealthy HDFS blocks by statuscountIndicates the number of unhealthy blocks inside the cluster. Sampled every 60 seconds. After sampling, data is not visible for up to 120 seconds.
google_dataproc_cluster_job_completion_timedataproc.googleapis.com/cluster/job/completion_timeJob durationsecondsThe time jobs took to complete from the time the user submits a job to the time Dataproc reports it is completed. Sampled every 60 seconds. After sampling, data is not visible for up to 120 seconds.
google_dataproc_cluster_job_durationdataproc.googleapis.com/cluster/job/durationJob state durationsecondsThe time jobs have spent in a given state. Sampled every 60 seconds. After sampling, data is not visible for up to 120 seconds.
google_dataproc_cluster_job_failed_countdataproc.googleapis.com/cluster/job/failed_countFailed jobscountIndicates the number of jobs that have failed on a cluster. Sampled every 60 seconds. After sampling, data is not visible for up to 120 seconds.
google_dataproc_cluster_job_running_countdataproc.googleapis.com/cluster/job/running_countRunning jobscountIndicates the number of jobs that are running on a cluster. Sampled every 60 seconds. After sampling, data is not visible for up to 120 seconds.
google_dataproc_cluster_job_submitted_countdataproc.googleapis.com/cluster/job/submitted_countSubmitted jobscountIndicates the number of jobs that have been submitted to a cluster. Sampled every 60 seconds. After sampling, data is not visible for up to 120 seconds.
google_dataproc_cluster_mig_instances_failed_countdataproc.googleapis.com/cluster/mig_instances/failed_countDataproc Managed Instance Group Instance ErrorscountIndicates the number of instance failures for a managed instance group. Sampled every 60 seconds. After sampling, data is not visible for up to 120 seconds.
google_dataproc_cluster_nodes_expecteddataproc.googleapis.com/cluster/nodes/expectedExpected NodescountIndicates the number of nodes that are expected in a cluster. Sampled every 60 seconds. After sampling, data is not visible for up to 120 seconds.
google_dataproc_cluster_nodes_failed_countdataproc.googleapis.com/cluster/nodes/failed_countFailed NodescountIndicates the number of nodes that have failed in a cluster. Sampled every 60 seconds. After sampling, data is not visible for up to 120 seconds.
google_dataproc_cluster_nodes_recovered_countdataproc.googleapis.com/cluster/nodes/recovered_countRecovered NodescountIndicates the number of nodes that are detected as failed and have been successfully removed from cluster. Sampled every 60 seconds. After sampling, data is not visible for up to 120 seconds.
google_dataproc_cluster_nodes_runningdataproc.googleapis.com/cluster/nodes/runningRunning NodescountIndicates the number of nodes in running state. Sampled every 60 seconds. After sampling, data is not visible for up to 120 seconds.
google_dataproc_cluster_operation_completion_timedataproc.googleapis.com/cluster/operation/completion_timeOperation durationsecondsThe time operations took to complete from the time the user submits a operation to the time Dataproc reports it is completed. Sampled every 60 seconds. After sampling, data is not visible for up to 120 seconds.
google_dataproc_cluster_operation_durationdataproc.googleapis.com/cluster/operation/durationOperation state durationsecondsThe time operations have spent in a given state. Sampled every 60 seconds. After sampling, data is not visible for up to 120 seconds.
google_dataproc_cluster_operation_failed_countdataproc.googleapis.com/cluster/operation/failed_countFailed operationscountIndicates the number of operations that have failed on a cluster. Sampled every 60 seconds. After sampling, data is not visible for up to 120 seconds.
google_dataproc_cluster_operation_running_countdataproc.googleapis.com/cluster/operation/running_countRunning operationscountIndicates the number of operations that are running on a cluster. Sampled every 60 seconds. After sampling, data is not visible for up to 120 seconds.
google_dataproc_cluster_operation_submitted_countdataproc.googleapis.com/cluster/operation/submitted_countSubmitted operationscountIndicates the number of operations that have been submitted to a cluster. Sampled every 60 seconds. After sampling, data is not visible for up to 120 seconds.
google_dataproc_cluster_yarn_allocated_memory_percentagedataproc.googleapis.com/cluster/yarn/allocated_memory_percentageYARN allocated memory percentagecountThe percentage of YARN memory is allocated. Sampled every 60 seconds. After sampling, data is not visible for up to 120 seconds.
google_dataproc_cluster_yarn_appsdataproc.googleapis.com/cluster/yarn/appsYARN active applicationscountIndicates the number of active YARN applications. Sampled every 60 seconds. After sampling, data is not visible for up to 120 seconds.
google_dataproc_cluster_yarn_containersdataproc.googleapis.com/cluster/yarn/containersYARN containerscountIndicates the number of YARN containers. Sampled every 60 seconds. After sampling, data is not visible for up to 120 seconds.
google_dataproc_cluster_yarn_memory_sizedataproc.googleapis.com/cluster/yarn/memory_sizeYARN memory sizeGBIndicates the YARN memory size in GB. Sampled every 60 seconds. After sampling, data is not visible for up to 120 seconds.
google_dataproc_cluster_yarn_nodemanagersdataproc.googleapis.com/cluster/yarn/nodemanagersYARN NodeManagerscountIndicates the number of YARN NodeManagers running inside cluster. Sampled every 60 seconds. After sampling, data is not visible for up to 120 seconds.
google_dataproc_cluster_yarn_pending_memory_sizedataproc.googleapis.com/cluster/yarn/pending_memory_sizeYARN pending memory sizeGBThe current memory request, in GB, that is pending to be fulfilled by the scheduler. Sampled every 60 seconds. After sampling, data is not visible for up to 120 seconds.
google_dataproc_cluster_yarn_virtual_coresdataproc.googleapis.com/cluster/yarn/virtual_coresYARN virtual corescountIndicates the number of virtual cores in YARN. Sampled every 60 seconds. After sampling, data is not visible for up to 120 seconds.

Event support

  • Not supported

External reference