Monday, 23 July 2018

AWS Glue now offers additional Apache Spark metrics for ETL jobs

You can now do better profiling and debug the ETL jobs with the addition of Apache Spark metrics. With the latest update, the customers can now easily keep a record of the running metrics such as memory usage, CPU load of the driver and executors, data shuffles among executors and bytes read and written from the AWS Glue Console. The additional ETL job metrics added by the AWS Glue will help plan CPU capacity, debug code and identify data issues. In Amazon CloudWatch you can set up alarms on certain job conditions. From the AWS Glue jobs you can collect the metrics and visualize them on the Amazon CloudWatch and AWS Glue console to identify and fix issues. 

No comments:

Post a Comment

Now Amazon Athena helps querying data in Amazon S3 Requester Pays buckets

Amazon Athena is an interactive query service which makes it simple to examine data straight in Amazon Simple Storage Service (Amazon S3)...