Snowflake SnowPro Advanced: Data Scientist Certification DSA-C02 Exam Practice Test

Page: 1 / 14
Total 65 questions
Question 1

As Data Scientist looking out to use Reader account, Which ones are the correct considerations about Reader Accounts for Third-Party Access?



Answer : D

Data sharing is only supported between Snowflake accounts. As a data provider, you might want to share data with a consumer who does not already have a Snowflake account or is not ready to be-come a licensed Snowflake customer.

To facilitate sharing data with these consumers, you can create reader accounts. Reader accounts (formerly known as ''read-only accounts'') provide a quick, easy, and cost-effective way to share data without requiring the consumer to become a Snowflake customer.

Each reader account belongs to the provider account that created it. As a provider, you use shares to share databases with reader accounts; however, a reader account can only consume data from the provider account that created it.

So, Data Sharing is possible between Snowflake & Non-snowflake accounts via Reader Account.


Question 2

What Can Snowflake Data Scientist do in the Snowflake Marketplace as Consumer?



Answer : A, B, C, D

As a consumer, you can do the following:

* Discover and test third-party data sources.

* Receive frictionless access to raw data products from vendors.

* Combine new datasets with your existing data in Snowflake to derive new business insights.

* Have datasets available instantly and updated continually for users.

* Eliminate the costs of building and maintaining various APIs and data pipelines to load and up-date data.

* Use the business intelligence (BI) tools of your choice.


Question 3

Which of the learning methodology applies conditional probability of all the variables with respec-tive the dependent variable?



Answer : A

Supervised learning methodology applies conditional probability of all the variables with respective the dependent variable and generally conditional probability of variables is nothing but a basic method of estimating the statistics for few random experiments.

Conditional probability is thus the likelihood of an event or outcome occurring based on the occurrence of some other event or prior outcome. Two events are said to be independent if one event occurring does not affect the probability that the other event will occur.


Question 4

Which of the following method is used for multiclass classification?



Answer : A

Binary vs. Multi-Class Classification

Classification problems are common in machine learning. In most cases, developers prefer using a supervised machine-learning approach to predict class tables for a given dataset. Unlike regression, classification involves designing the classifier model and training it to input and categorize the test dataset. For that, you can divide the dataset into either binary or multi-class modules.

As the name suggests, binary classification involves solving a problem with only two class labels. This makes it easy to filter the data, apply classification algorithms, and train the model to predict outcomes. On the other hand, multi-class classification is applicable when there are more than two class labels in the input train data. The technique enables developers to categorize the test data into multiple binary class labels.

That said, while binary classification requires only one classifier model, the one used in the multi-class approach depends on the classification technique. Below are the two models of the multi-class classification algorithm.

One-Vs-Rest Classification Model for Multi-Class Classification

Also known as one-vs-all, the one-vs-rest model is a defined heuristic method that leverages a binary classification algorithm for multi-class classifications. The technique involves splitting a multi-class dataset into multiple sets of binary problems. Following this, a binary classifier is trained to handle each binary classification model with the most confident one making predictions.

For instance, with a multi-class classification problem with red, green, and blue datasets, binary classification can be categorized as follows:

Problem one: red vs. green/blue

Problem two: blue vs. green/red

Problem three: green vs. blue/red

The only challenge of using this model is that you should create a model for every class. The three classes require three models from the above datasets, which can be challenging for large sets of data with million rows, slow models, such as neural networks and datasets with a significant number of classes.

The one-vs-rest approach requires individual models to prognosticate the probability-like score. The class index with the largest score is then used to predict a class. As such, it is commonly used for classification algorithms that can naturally predict scores or numerical class membership such as perceptron and logistic regression.


Question 5

What is the formula for measuring skewness in a dataset?



Answer : C

Since the normal curve is symmetric about its mean, its skewness is zero. This is a theoretical expla-nation for mathematical proofs, you can refer to books or websites that speak on the same in detail.


Question 6

Which one is the incorrect option to share data in Snowflake?



Answer : B

Options for Sharing in Snowflake

You can share data in Snowflake using one of the following options:

* a Listing, in which you offer a share and additional metadata as a data product to one or more ac-counts,

* a Direct Share, in which you directly share specific database objects (a share) to another account in your region,

* a Data Exchange, in which you set up and manage a group of accounts and offer a share to that group.


Question 7

Which ones are the correct rules while using a data science model created via External function in Snowflake?



Answer : A, B, C, D

From the perspective of a user running a SQL statement, an external function behaves like any other UDF . External functions follow these rules:

External functions return a value.

External functions can accept parameters.

An external function can appear in any clause of a SQL statement in which other types of UDF can appear. For example:

1. select my_external_function_2(column_1, column_2)

2. from table_1;

1. select col1

2. from table_1

3. where my_external_function_3(col2) < 0;

1. create view view1 (col1) as

2. select my_external_function_5(col1)

3. from table9;

An external function can be part of a more complex expression:

1. select upper(zipcode_to_city_external_function(zipcode))

2. from address_table;

The returned value can be a compound value, such as a VARIANT that contains JSON.

External functions can be overloaded; two different functions can have the same name but different signatures (different numbers or data types of input parameters).


Page:    1 / 14   
Total 65 questions