AWS Certified Big Data - Specialty Exam: Transformation Methods for VPC Flow Logs

Transformation Methods for VPC Flow Logs

Question

Your company has just enabled VPC Flow logs for a large number of Network Interfaces.

They need to stream the data into an S3 bucket and hence are using Amazon Kinesis Firehose for this purpose.

They need to transform the data before it can be used for analysis.

Which of the following can be used for the transformation purpose?

Answers

Explanations

Click on the arrows to vote for the correct answer

A. B. C. D.

Answer - A.

An example of this is given in the AWS Documentation.

#######

3.Decompress records with AWS Lambda.

There may be situations where you want to transform or enrich streaming data before writing it to its final destination.In this solution, we must decompress the data that is streamed from CloudWatch Logs.With the Amazon Kinesis Data Firehose Data Transformation feature, we can decompress the data with an AWS Lambda function.Kinesis Data Firehose manages the invocation of the function.Inside the function, the data is decompressed and returned to Kinesis Data Firehose.The complete source code for the Lambda function can be found here.

#######

Option B is invalid since this is used to consume data from Kinesis streams.

Option C is invalid since this is used to send data from Kinesis streams.

Option D is invalid since this is a queuing service.

For more information on this use case, please refer to the below URL.

https://aws.amazon.com/blogs/big-data/analyze-and-visualize-your-vpc-network-traffic-using-amazon-kinesis-and-amazon-athena/

In this scenario, the requirement is to transform VPC Flow logs data before storing it into an S3 bucket. The ideal solution for this use case is to use AWS Lambda for data transformation as it provides serverless computing capability that can process and transform the data in real-time.

Option A: AWS Lambda - This is the ideal solution for transforming data in real-time as it provides serverless computing capability that can process and transform the data in real-time. AWS Lambda can be integrated with Kinesis Firehose to perform transformations on the incoming data stream.

Option B: Kinesis KCL - Kinesis Client Library (KCL) is used for building Java applications that process Amazon Kinesis data streams. However, it is not used for data transformation purposes.

Option C: Kinesis KPL - Kinesis Producer Library (KPL) is a library used for building applications that produce data to Kinesis data streams. It is not used for data transformation purposes.

Option D: Amazon SQS - Amazon Simple Queue Service (SQS) is a fully managed message queuing service that enables decoupling and scaling microservices, distributed systems, and serverless applications. It is not used for data transformation purposes.

Therefore, the correct answer is Option A: AWS Lambda.