Tick-Bank: Web Traffic Analytics for User Navigational Behavior | BDS-C00 Exam Question

Library for Aggregation Mechanism

Question

Tick-Bank is a privately held Internet retailer of both physical and digital products founded in 2008

The company has more than six-million clients worldwide.

Tick-Bank aims to serve as a connection between digital content makers and affiliate dealers, who then promote them to clients.

Tick-Bank's technology aids in payments, tax calculations and a variety of customer service tasks.

Tick-Bank assists in building perceptibility and revenue making opportunities for entrepreneurs. Tick-Bank runs multiple java based web applications running on windows based EC2 machines in AWS managed by internal IT Java team, to serve various business functions.

Tick-Bank is looking to enable web-site traffic analytics there by understanding user navigational behavior, preferences and other click related info.

The amount of data captured per click is in tens of bytes.

Tick-Bankhas the following objectives in mind for the solution. Tick-Bank has multiple data streams supporting various business processes.

To optimize the costs, Tick-Bank wants to use batching methods (aggregation and collection) appropriate library to process the data.

Which library supports aggregation mechanism.

select 1 option.

Answers

Explanations

Click on the arrows to vote for the correct answer

A. B. C. D.

Answer: A.

Option A is correct - KPL supports batching of records.

Batching refers to performing a single action on multiple items instead of repeatedly performing the action on each individual item.

In this context, the "item" is a record, and the action is sending it to Kinesis Data Streams.

In a non-batching situation, you would place each record in a separate Kinesis Data Streams record and make one HTTP request to send it to Kinesis Data Streams.

With batching, each HTTP request can carry multiple records instead of just one.

The KPL supports two types of batching:

Aggregation - Storing multiple records within a single Kinesis Data Streams record.

Collection - Using the API operation PutRecords to send multiple Kinesis Data Streams records to one or more shards in your Kinesis data stream.

The two types of KPL batching are designed to coexist and can be turned on or off independently of one another.

By default, both are turned on.

https://docs.aws.amazon.com/streams/latest/dev/kinesis-kpl-concepts.html

Option B is incorrect -Develop producers using the Amazon Kinesis Data Streams API with the AWS SDK for Java.

Batching is not supported/

https://docs.aws.amazon.com/streams/latest/dev/developing-producers-with-sdk.html

Option C is incorrect -Kinesis Agent is a stand-alone Java software application that offers an easy way to collect and send data to Kinesis Data Streams.

The agent continuously monitors a set of files and sends new data to your stream.

The agent handles file rotation, checkpointing, and retry upon failures.

It delivers all of your data in a reliable, timely, and simple manner.

It also emits Amazon CloudWatch metrics to help you better monitor and troubleshoot the streaming process.

Batching is not supported.

https://docs.aws.amazon.com/streams/latest/dev/writing-with-agents.html

The correct answer is C. Amazon Kinesis Data Streams API supports batching of user records, both aggregation and collection.

Amazon Kinesis Data Streams is a fully managed service that allows real-time processing of streaming data at a large scale. It is used to collect, process, and analyze data in real-time, enabling the application of machine learning and other real-time analytics to derive insights from data.

Kinesis Data Streams API supports batching of user records, both aggregation, and collection, allowing users to collect and process data in batches to optimize the costs. Batching enables efficient use of resources and reduces the number of requests made to Kinesis Data Streams, thus reducing costs.

The Kinesis Producer Library (KPL) is a library that makes it easy to produce data to Amazon Kinesis Data Streams. It provides batching, aggregation, and automatic retries, among other features. However, KPL is a producer library, and it is used for producing data, not processing data. Therefore, option A is incorrect.

The Kinesis Agent is a service that simplifies and automates the process of collecting and sending data to Amazon Kinesis Data Streams. However, it is used for data collection and not data processing. Therefore, option B is incorrect.

In conclusion, option C, Amazon Kinesis Data Streams API supports batching of user records, both aggregation and collection, is the correct answer to the given question.