What is an intended application of the MapReduce framework?
Answer : A
Consider dataset that resides in HDFS. Which tool natively provides the capability to run a Random Forests model against this data?
Answer : A
What is a property of a good color model for ordinal data?
Answer : D
In which step in the visualization lifecycle would you determine how the raw data is stored?
Answer : B
You are analyzing written transcripts of focus groups conducted on product X. You approach is to use TF-IDF for your analysis.
What combination of TF-IDF scores should you examine to ensure you only report on the most important terms?
Answer : C
What are two visualization tools used for trivariate data?
Answer : B
Which library is NOT part of the Apache Spark distribution?
Answer : B