Monday, 14 May 2018

With Amazon Kinesis Data Firehose you can stream Real-Time Data in Apache Parquet or ORC Format

AWS has added support for Apache ORC and Apache Parquet formats in Amazon Kinesis Data Firehose so this will enable you to stream real-time data into Amazon Simple Storage Service for analytics and cost-effective storage. Apache Parquet and Apache ORC formats are columnar data formats that enable you to store and query data more cost-effectively and efficiently. Kinesis Data Firehose delivery stream can be set up to automatically convert data into ORC or Parquet format before delivering to the S3 bucket. You don’t have to do any coding and you can query S3 data much faster with Amazon Redshift spectrum and Amazon Athena which will save cost and storage. 

No comments:

Post a Comment

AI-Driven Cloud Optimization: Automated Cloud Optimization Reducing Waste and Maximizing Efficiency

In the dynamic landscape of cloud computing, businesses continually face the challenge of balancing performance and costs. As cloud infrastr...