How to integrate Ganglia for Spark 2.1 Job metrics, Spark ignoring Ganglia metrics -
i trying integrate spark 2.1 job's metrics ganglia.
my spark-default.conf looks like
*.sink.ganglia.class org.apache.spark.metrics.sink.gangliasink *.sink.ganglia.name name *.sink.ganglia.host $masterip *.sink.ganglia.port $port *.sink.ganglia.mode unicast *.sink.ganglia.period 10 *.sink.ganglia.unit seconds when submit job can see warn
warning: ignoring non-spark config property: *.sink.ganglia.host=host warning: ignoring non-spark config property: *.sink.ganglia.name=name warning: ignoring non-spark config property: *.sink.ganglia.mode=unicast warning: ignoring non-spark config property: *.sink.ganglia.class=org.apache.spark.metrics.sink.gangliasink warning: ignoring non-spark config property: *.sink.ganglia.period=10 warning: ignoring non-spark config property: *.sink.ganglia.port=8649 warning: ignoring non-spark config property: *.sink.ganglia.unit=seconds my environment details are
hadoop : amazon 2.7.3 - emr-5.7.0 spark : spark 2.1.1, ganglia: 3.7.2 if have inputs or other alternative of ganglia please reply.
from page: https://spark.apache.org/docs/latest/monitoring.html
spark supports ganglia sink not included in default build due licensing restrictions: gangliasink: sends metrics ganglia node or multicast group. **to install gangliasink you’ll need perform custom build of spark**. note embedding library include lgpl-licensed code in spark package. sbt users, set spark_ganglia_lgpl environment variable before building. maven users, enable -pspark-ganglia-lgpl profile. in addition modifying cluster’s spark build user
Comments
Post a Comment