Unstructured Data Classification MCQ Answer


Unstructured Data Classification Multiple choice question | Unstructured Data Classification Question Answer | Unstructured Data Classification Objective type question | Unstructured Data Classification Question Answer Dumps | Unstructured Data Classification TCS Fresco Play Dumps | Unstructured Data Classification TCS Question Answer

26.In document classification, each document has to be converted from full text to a document vector

Answer : true

27.A technique used to depict the performance in a tabular form that has 2 dimensions namely actual and predicted sets of data is _

Answer : Confusion Matrix

28.Which NLP technique uses a lexical knowledge base to obtain the correct base form of the words?

Answer : Lemmatization

29.Which numerical statistics is used to identify the importance of a rare word in a document?

Answer : TF-IDF

30.Which type of cross-validation is used for an imbalanced dataset?

Answer : K-Fold

31.Cross-validation causes over-fitting.

Answer : False

32.Is there a class imbalance problem in the given data set?

Answer : Yes

33.SVM is a _

Answer : Supervised learning algorithm

34.In a Term Document Matrix (TDM), each row represents __

Answer : TF-IDF value

35.Imagine you have just finished training a decision tree for spam classification, and it is showing abnormal bad performance on both your training and test sets. Assume that your implementation has no bugs. What could be the reason for this problem?

Answer : All the options


36.In a Document Term Matrix (DTM), each row represents

Answer : TF-IDF value

37.Email spam data is an example of __

Answer : Unstructured data

38.High classification accuracy always indicates a good classifier.

Answer : False

39.__ directly achieves multi-class classification (without the support of binary classifiers).

Answer : K Nearest Neighbor

40.A classifier that can compute using numeric as well as categorical values is __

Answer : Random Forest Classifier

41.Lemmatization offers better precision than stemming.

Answer : True

42.The following are pre-processing methods used for unstructured data classification, except______

Answer : Confusion_matrix

43.TF-IDF is a feature extraction technique.

Answer : True

44.The higher value of which of the following hyperparameters is better for the decision tree algorithm?

Answer : Cannot say

45.What kind of classification is the given case study (Sentiment Analysis dataset)?

Answer : Binary classification

46.Which of the following commands is used to view the dataset SIZE, and what is the value returned?

Answer : sentiment_analysis_data.shape, (6918, 2)


Leave a Reply

Your email address will not be published. Required fields are marked *