10-Year Log Retention for Cost-Optimized Troubleshooting - AWS Exam Prep

Cost-Optimized Technique for 10-Year Log Retention and Troubleshooting

Prev Question Next Question

Question

You have been given a business requirement to retain log files for your application for 10 years.

You need to regularly retrieve the most recent logs for troubleshooting.

Your logging system must be able to store the given large volume of logs.

What cost-optimized technique should you use to meet these requirements?

Answers

Explanations

Click on the arrows to vote for the correct answer

A. B. C. D.

Correct answer is.

C.Option A is invalid, because although it can store the large volume of logs, it is NOT a "cost-optimized technique" that the question asks for.

Option B is invalid, because it won't server the purpose of regularly retrieve the most recent logs for troubleshooting.

Option D is invalid, because it is not an ideal storage option.

You can define lifecycle configuration rules for objects that have a well-defined lifecycle.

For example:

If you are uploading periodic logs to your bucket, your application might need these logs for a week or a month after creation, and after that you might want to delete them.

Some documents are frequently accessed for a limited period of time.

After that, these documents are less frequently accessed.

Over time, you might not need real-time access to these objects, but your organization or regulations might require you to archive them for a longer period and then optionally delete them later.

You might also upload some types of data to Amazon S3 primarily for archival purposes, for example digital media archives, financial and healthcare records, raw genomics sequence data, long-term database backups, and data that must be retained for regulatory compliance.

For more information on Lifecycle management please refer to the below link:

http://docs.aws.amazon.com/AmazonS3/latest/dev/object-lifecycle-mgmt.html

The most cost-effective technique to meet the given requirements is to store the logs in Amazon S3 and use lifecycle policies to archive them to Amazon Glacier.

Option A - Store your logs in Amazon CloudWatch Logs: Amazon CloudWatch Logs is a service used to monitor, store, and access log files from various AWS resources. However, it is not recommended for long-term storage because of the high cost and limitation of retention periods up to 10 years. Retrieving logs from CloudWatch Logs can also be expensive.

Option B - Store your logs in Amazon Glacier: Amazon Glacier is a low-cost object storage service for data archival and long-term backup. It is optimized for infrequently accessed data, and retrieval times can range from minutes to hours, making it unsuitable for regularly retrieving logs for troubleshooting.

Option C - Store your logs in Amazon S3, and use lifecycle policies to archive to Amazon Glacier: Amazon S3 is an object storage service that provides scalable, durable, and highly available storage for any type of data. By using lifecycle policies, you can automate the transition of objects between different storage classes based on their age and access patterns. You can start by storing your logs in Amazon S3, and after a certain period, transition them to the more cost-effective Amazon Glacier for long-term archival. This method is cost-effective and allows for quick retrieval of logs for troubleshooting when required.

Option D - Store your logs on Amazon EBS, and use Amazon EBS snapshots to archive them: Amazon Elastic Block Store (EBS) is a high-performance block storage service used to store persistent data for EC2 instances. While EBS provides a reliable and performant storage option for logs, it is not designed for long-term archival storage. Amazon EBS snapshots can be used to create incremental backups of EBS volumes, but they are not ideal for storing log files for long periods.

In summary, option C - storing logs in Amazon S3 and using lifecycle policies to archive to Amazon Glacier - is the most cost-effective technique to meet the given requirements, as it provides scalable storage, automated transitions to lower-cost storage, and quick retrieval of logs for troubleshooting.