Hortonworks HDPCD Exam Practice Test Instant Access

Question 1

On a cluster running MapReduce v1 (MRv1), a TaskTracker heartbeats into the JobTracker on your cluster, and alerts the JobTracker it has an open map task slot.

What determines how the JobTracker assigns each map task to a TaskTracker?

AThe amount of RAM installed on the TaskTracker node.

BThe amount of free disk space on the TaskTracker node.

CThe number and speed of CPU cores on the TaskTracker node.

DThe average system load on the TaskTracker node over the past fifteen (15) minutes.

EThe location of the InsputSplit to be processed in relation to the location of the node.

Answer : E

The TaskTrackers send out heartbeat messages to the JobTracker, usually every few minutes, to reassure the JobTracker that it is still alive. These message also inform the JobTracker of the number of available slots, so the JobTracker can stay up to date with where in the cluster work can be delegated. When the JobTracker tries to find somewhere to schedule a task within the MapReduce operations, it first looks for an empty slot on the same server that hosts the DataNode containing the data, and if not, it looks for an empty slot on a machine in the same rack.

Question 2

You have user profile records in your OLPT database, that you want to join with web logs you have already ingested into the Hadoop file system. How will you obtain these user records?

AHDFS command

BPig LOAD command

CSqoop import

DHive LOAD DATA command

EIngest with Flume agents

FIngest with Hadoop Streaming

Answer : C

Question 3

Examine the following Pig commands:

Which one of the following statements is true?

AThe SAMPLE command generates an 'unexpected symbol' error

BEach MapReduce task will terminate after executing for 0.2 minutes

CThe reducers will only output the first 20% of the data passed from the mappers

DA random sample of approximately 20% of the data will be output

Answer : D

Question 4

You need to run the same job many times with minor variations. Rather than hardcoding all job configuration options in your drive code, you've decided to have your Driver subclass org.apache.hadoop.conf.Configured and implement the org.apache.hadoop.util.Tool interface.

Indentify which invocation correctly passes.mapred.job.name with a value of Example to Hadoop?

Ahadoop ''mapred.job.name=Example'' MyDriver input output

Bhadoop MyDriver mapred.job.name=Example input output

Chadoop MyDrive --D mapred.job.name=Example input output

Dhadoop setproperty mapred.job.name=Example MyDriver input output

Ehadoop setproperty (''mapred.job.name=Example'') MyDriver input output

Answer : C

Configure the property using the -D key=value notation:

-D mapred.job.name='My Job'

You can list a whole bunch of options by calling the streaming jar with just the -info argument

Question 5

You have the following key-value pairs as output from your Map task:

(the, 1)

(fox, 1)

(faster, 1)

(than, 1)

(the, 1)

(dog, 1)

How many keys will be passed to the Reducer's reduce method?

ASix

BFive

CFour

DTwo

EOne

FThree

Answer : B

Only one key value pair will be passed from the two (the, 1) key value pairs.

Question 6

Given a directory of files with the following structure: line number, tab character, string:

Example:

1 abialkjfjkaoasdfjksdlkjhqweroij

2 kadfjhuwqounahagtnbvaswslmnbfgy

3 kjfteiomndscxeqalkzhtopedkfsikj

You want to send each line as one record to your Mapper. Which InputFormat should you use to complete the line: conf.setInputFormat (____.class) ; ?

ASequenceFileAsTextInputFormat

BSequenceFileInputFormat

CKeyValueFileInputFormat

DBDBInputFormat

Answer : C

http://stackoverflow.com/questions/9721754/how-to-parse-customwritable-from-text-in-hadoop

Question 7

You want to populate an associative array in order to perform a map-side join. You've decided to put this information in a text file, place that file into the DistributedCache and read it in your Mapper before any records are processed.

Indentify which method in the Mapper you should use to implement code for reading the file and populating the associative array?

Acombine

Bmap

Cinit

Dconfigure

Answer : D

Hortonworks Data Platform Certified Developer HDPCD Exam Practice Test