Q.1
____________ specifies the number of segments on disk to be merged at the same time.
  • a) mapred.job.shuffle.merge.percent
  • b) mapred.job.reduce.input.buffer.percen
  • c) mapred.inmem.merge.threshold
  • d) io.sort.factor
Q.2
Point out the correct statement.
  • a) The number of sorted map outputs fetched into memory before being merged to disk
  • b) The memory threshold for fetched map outputs before an in-memory merge is finished
  • c) The percentage of memory relative to the maximum heap size in which map outputs may not be retained during the reduce
  • d) None of the mentioned
Q.3
Map output larger than ___________ percent of the memory allocated to copying map outputs.
  • a) 10
  • b) 15
  • c) 25
  • d) 35
Q.4
Jobs can enable task JVMs to be reused by specifying the job configuration _________
  • a) mapred.job.recycle.jvm.num.tasks
  • b) mapissue.job.reuse.jvm.num.tasks
  • c) mapred.job.reuse.jvm.num.tasks
  • d) all of the mentioned
Q.5
Point out the wrong statement.
  • a) The task tracker has local directory to create localized cache and localized job
  • b) The task tracker can define multiple local directories
  • c) The Job tracker cannot define multiple local directories
  • d) None of the mentioned
Q.6
During the execution of a streaming job, the names of the _______ parameters are transformed.
  • a) vmap
  • b) mapvim
  • c) mapreduce
  • d) mapred
Q.7
The standard output (stdout) and error (stderr) streams of the task are read by the TaskTracker and logged to _________
  • a) ${HADOOP_LOG_DIR}/user
  • b) ${HADOOP_LOG_DIR}/userlogs
  • c) ${HADOOP_LOG_DIR}/logs
  • d) None of the mentioned
Q.8
____________ is the primary interface by which user-job interacts with the JobTracker.
  • a) JobConf
  • b) JobClient
  • c) JobServer
  • d) All of the mentioned
Q.9
The _____________ can also be used to distribute both jars and native libraries for use in the map and/or reduce tasks.
  • a) DistributedLog
  • b) DistributedCache
  • c) DistributedJars
  • d) None of the mentioned
Q.10
__________ is used to filter log files from the output directory listing.
  • a) OutputLog
  • b) OutputLogFilter
  • c) DistributedLog
  • d) DistributedJars
0 h : 0 m : 1 s