Databricks Certified Data Analyst Associate Exam Questions

Page: 1 / 14
Total 65 questions
Question 1

Which of the following approaches can be used to ingest data directly from cloud-based object storage?



Answer : E


Question 2

In which circumstance will there be a substantial difference between the variable's mean and median values?



Answer : D

The mean is sensitive to extreme values, often called outliers, which can significantly skew the average away from the true center of the data. The median, however, is a measure of central tendency that is resistant to such outliers because it only considers the middle value(s) when the data is ordered. Therefore, when a variable contains many extreme outliers, there will be a substantial difference between the mean and the median. According to Databricks data analysis materials, this is a fundamental concept when choosing summary statistics for reporting.


Question 3

A data scientist has asked a data analyst to create histograms for every continuous variable in a data set. The data analyst needs to identify which columns are continuous in the data set.

What describes a continuous variable?



Answer : C


Question 4

A data analyst needs to share a Databricks SQL dashboard with stakeholders that are not permitted to have accounts in the Databricks deployment. The stakeholders need to be notified every time the dashboard is refreshed.

Which approach can the data analyst use to accomplish this task with minimal effort/



Answer : B


Question 5

Query History provides Databricks SQL users with a lot of benefits. A data analyst has been asked to share all of these benefits with their team as part of a training exercise. One of the benefit statements the analyst provided to their team is incorrect.

Which statement about Query History is incorrect?



Answer : C

Query History in Databricks SQL is intended for reviewing executed queries, understanding their execution plans, and identifying performance issues or errors for debugging purposes. It allows users to analyze query duration, resources used, and potential bottlenecks. However, Query History does not provide any capability to automate the execution of queries across multiple warehouses; automation must be handled through jobs or external orchestration tools, not through the Query History feature itself.


Question 6

Where in the Databricks SQL workspace can a data analyst configure a refresh schedule for a query when the query is not attached to a dashboard or alert?



Answer : C


Question 7

A data analyst is processing a complex aggregation on a table with zero null values and the query returns the following result:

Which query did the analyst execute in order to get this result?

A)

B)

C)

D)



Answer : B


Page:    1 / 14   
Total 65 questions