Determining if Reviews are Authentic: AWS Machine Learning Classification

Identifying Authentic Customer Reviews

Question

A company wants to start making use of AWS Machine learning.

The main aspect of the program would be to determine if the reviews written on their web site was done by a customer or a bot.

Which of the following classifications in ML would you make use of?

Answers

Explanations

Click on the arrows to vote for the correct answer

A. B. C. D.

Answer - C.

The AWS Documentation mentions the following.

You can use supervised ML approaches for these specific machine learning tasks: binary classification (predicting one of two possible outcomes), multiclass classification (predicting one of more than two outcomes) and regression (predicting a numeric value).

Examples of binary classification problems:

Will the customer buy this product or not buy this product?

Is this email spam or not spam?

Is this product a book or a farm animal?

Is this review written by a customer or a robot?

For more information on the different classifications, please refer to the below URL.

https://docs.aws.amazon.com/machine-learning/latest/dg/machine-learning-problems-in-amazon-machine-learning.html

The appropriate classification algorithm to use for determining if a review was written by a customer or a bot is Binary Classification.

Binary classification is a type of supervised learning algorithm that involves categorizing data into one of two classes. In this case, the two classes are "customer" and "bot." The algorithm is trained on a labeled dataset, where each data point is assigned to one of the two classes.

During training, the algorithm learns to identify patterns and features in the data that are indicative of each class. Once the training is complete, the algorithm can be used to predict the class of new data points. In this case, the algorithm would analyze the text of a review and determine whether it was written by a customer or a bot.

The other types of classification mentioned in the question are not appropriate for this use case.

Single class classification is used when there is only one class of data. Multiclass classification is used when there are more than two classes of data. Regression is used to predict numerical values rather than categorical classes.

In summary, the appropriate classification algorithm to use for determining if a review was written by a customer or a bot is Binary Classification.