Monday, 14 May 2018

With Amazon Kinesis Data Firehose you can stream Real-Time Data in Apache Parquet or ORC Format

AWS has added support for Apache ORC and Apache Parquet formats in Amazon Kinesis Data Firehose so this will enable you to stream real-time data into Amazon Simple Storage Service for analytics and cost-effective storage. Apache Parquet and Apache ORC formats are columnar data formats that enable you to store and query data more cost-effectively and efficiently. Kinesis Data Firehose delivery stream can be set up to automatically convert data into ORC or Parquet format before delivering to the S3 bucket. You don’t have to do any coding and you can query S3 data much faster with Amazon Redshift spectrum and Amazon Athena which will save cost and storage. 

No comments:

Post a Comment

Optimizing Performance and Cost: Migrating an Express.js Application from EC2 to AWS Lambda

Introduction: In a recent project, our team worked on optimizing a Node.js application that was originally hosted on an EC2 instance. The ap...