Amazon BDS-C00: AWS Certified Big Data - Specialty Exam | Data Store Options for Range Searches, Term Boosting, and Faceting

Data Store Options for Range Searches, Term Boosting, and Faceting

Question

A company wants to have a data store on AWS that can be used to store a large number of log files.

Apart from this the service should have the following capabilities Range searches Term boosting Faceting Which of the below can be used as the data store?

Answers

Explanations

Click on the arrows to vote for the correct answer

A. B. C. D.

Answer - B.

The AWS Documentation mentions the following.

You can use Amazon CloudSearch to index and search both structured data and plain text.

Amazon CloudSearch features:

Full text search with language-specific text processing.

Boolean search.

Prefix searches.

Range searches.

Term boosting.

Faceting.

Highlighting.

Autocomplete Suggestions.

Since the documentation clearly gives the features of this product , all other options are incorrect.

For more information on what is cloudsearch, please refer to the below URL.

https://docs.aws.amazon.com/cloudsearch/latest/developerguide/what-is-cloudsearch.html

Based on the requirements given, the best option for a data store on AWS would be AWS CloudSearch (Option B).

Here's why:

AWS S3 (Option A) is a popular object storage service, and it can store any type of data including log files. However, it is primarily an object storage service and does not provide the advanced search capabilities required in this scenario. Although it is possible to run third-party search tools on data stored in S3, it may not be the most cost-effective solution for large-scale log file storage and search.

AWS Redshift (Option C) is a data warehousing service designed for large-scale analytics. While it does offer some limited search capabilities, it is primarily designed for complex queries against structured data, rather than unstructured log files.

AWS DynamoDB (Option D) is a NoSQL database service that provides fast, low-latency access to structured data. While it can store unstructured data such as log files, it does not provide the advanced search capabilities required for this scenario.

AWS CloudSearch (Option B) is a managed search service that provides powerful full-text search and indexing capabilities. It can index a large number of log files and support complex search queries, range searches, term boosting, and faceting. It can also scale up or down based on demand and supports multiple languages.

In summary, AWS CloudSearch would be the best option for a data store on AWS that can store a large number of log files and provide the required search capabilities.