Databricks Certified Professional Data Scientist Exam Practice Test

Page: 1 / 14
Total 138 questions
Question 1

In which of the following scenario you should apply the Bay's Theorem



Answer : D


Question 2

You have used k-means clustering to classify behavior of 100, 000 customers for a retail store. You decide to use household income, age, gender and yearly purchase amount as measures. You have chosen to use 8 clusters and notice that 2 clusters only have 3 customers assigned. What should you do?



Answer : C


Question 3

You are creating a Classification process where input is the income, education and current debt of a customer, what could be the possible output of this process.



Answer : D


Question 4

What is one modeling or descriptive statistical function in MADlib that is typically not provided in a standard relational database?



Answer : C


Question 5

Regularization is a very important technique in machine learning to prevent over fitting. And Optimizing with a L1 regularization term is harder than with an L2 regularization term because



Answer : A


Question 6

Scenario: Suppose that Bob can decide to go to work by one of three modes of transportation,

car, bus, or commuter train. Because of high traffic, if he decides to go by car. there is a 50% chance he will be late. If he goes by bus, which has special reserved lanes but is sometimes overcrowded, the probability of being late is only 20%. The commuter train is almost never late, with a probability of only 1 %, but is more expensive than the bus.

Suppose that Bob is late one day, and his boss wishes to estimate the probability that he drove to work that day by car. Since he does not know Which mode of transportation Bob usually uses, he gives a prior probability of 1 3 to each of the three possibilities. Which of the following method the boss will use to estimate of the probability that Bob drove to work?



Answer : A


Question 7

You are working on a problem where you have to predict whether the claim is done valid or not. And you find that most of the claims which are having spelling errors as well as corrections in the manually filled claim forms compare to the honest claims. Which of the following technique is suitable to find out whether the claim is valid or not?



Answer : D


Page:    1 / 14   
Total 138 questions