AWS DynamoDB: Storage Options for Log Files

Storage Options for Log Files

Question

Currently a company is using AWS DynamoDB since they want a serverless architecture and they don't want to manage the underlying infrastructure.

They use the DynamoDB service to ingest and store large amounts of metric related data.

They now want to start storing a large number of log files which can be used for log analytics.

Which of the following can be used for the storage of the log files?

Answers

Explanations

Click on the arrows to vote for the correct answer

A. B. C. D.

Answer - B.

The AWS Documentation mentions the following.

Amazon Elasticsearch Service (Amazon ES) is a managed service that makes it easy to deploy, operate, and scale Elasticsearch clusters in the AWS Cloud.

Elasticsearch is a popular open-source search and analytics engine for use cases such as log analytics, real-time application monitoring, and clickstream analysis.

With Amazon ES, you get direct access to the Elasticsearch APIs; existing code and applications work seamlessly with the service.

All other options are invalid since the best way to work with search capabilities is to use AWS ElasticSearch.

For more information on what is Elasticsearch, please refer to the below URL.

https://docs.aws.amazon.com/elasticsearch-service/latest/developerguide/what-is-amazon-elasticsearch-service.html

For the storage of log files, the most suitable service from the given options is Amazon S3 (Simple Storage Service).

Amazon S3 is a highly scalable object storage service that allows storing and retrieving data from anywhere on the web. It is designed for 99.999999999% durability and can store virtually unlimited amounts of data.

On the other hand, DynamoDB is a fully managed NoSQL database service that is designed to deliver high performance, reliability, and scalability. It is optimized for the storage and retrieval of structured data, and not recommended for storing large binary data such as log files.

AWS ElasticSearch is a managed service that enables real-time search and analysis of data. It is built on the open-source Elasticsearch and Kibana software and is best suited for search, analysis, and visualization of log data. It can be used in conjunction with S3, which can store the log data and ElasticSearch can index and query the data.

AWS Redshift is a data warehouse service that is optimized for querying and analyzing structured data using SQL. It is not an ideal choice for storing unstructured data like log files.

Therefore, the most appropriate service for storing large numbers of log files is Amazon S3. S3 provides an object storage facility that is cost-effective, highly scalable, durable, and secure.