This methods fires at the end of the stage and collects metrics flattened into the stageMetricsData ListBuffer Note all reported times are in ms, cpu time and shuffle write time are originally in nanoseconds, thus in the code are divided by 1 million to normalize them to milliseconds
This methods fires at the end of the stage and collects metrics flattened into the stageMetricsData ListBuffer Note all reported times are in ms, cpu time and shuffle write time are originally in nanoseconds, thus in the code are divided by 1 million to normalize them to milliseconds
StageInfoRecorderListener: this listener gathers metrics with Stage execution granularity It is based on the Spark Listener interface Stage metrics are stored in memory and use to produce a report that aggregates resource consumption they can also be consumed "raw" (transformed into a DataFrame and/or saved to a file) See StageMetrics