IBM Big Data Architect C2090-102 Exam Questions

Page: 1 / 14
Total 55 questions
Question 1

You need to provision a Hadoop cluster to perform data analysis on customer sales data to predict which products are more popular. Which of the following solutions will let you set up your cluster with the most stability in the platform?



Answer : D

References:

http://hortonworks.com/innovation/open-data-platform/


Question 2

Which data format stores all of the data in a binary format making the files more compact, and will even add in markers to help Map Reduce jobs determine where to break large files for more efficient processing?



Answer : B

References:

http://www.ibmbigdatahub.com/blog/how-succeed-using-avro-storage-format


Question 3

The CAP Theorem states that it is not possible for a distributed computer system to guarantee all three of these?



Answer : B


Question 4

The analysis layer reads the data digested by the layer massaging and store layer. In some cases, the analysis layer accesses the data directly from the data source. Designing the analysis layer requires careful forethought and planning. Decisions must be made with regard to how to manage the tasks to do which of the following?



Answer : B

References:

http://www.ibm.com/developerworks/library/bd-archpatterns3/


Question 5

If the recovery point objective (RPO) is low, which of the following techniques would be the most appropriate?



Answer : A

References:

http://whatis.techtarget.com/definition/recovery-point-objective-RPO


Question 6

A manufacturing company has decided they need to capture and analyze the log files of their software automation system. Their business users are still trying to define the use cases but would want to start capturing as they have had frequent outages. Given this, which of the following is the best software design recommendation?



Answer : D


Question 7

Company K is designing their Big Data system. In their enterprise, they anticipate every 9 months there will be a big spike of new data on the order of multiple TB. Their company policy also dictates that data older than one year will be archived with a major clean up every 5 years. Cost is also a big issue. Which of the following provides the best design for these requirements?



Answer : A


Page:    1 / 14   
Total 55 questions