Q.1
Mahout provides ____________ libraries for common and primitive Java collections.
  • a) Java
  • b) Javascript
  • c) Perl
  • d) Python
Q.2
Point out the correct statement.
  • a) Mahout is distributed under a commercially friendly Apache Software license
  • b) Mahout is a library of scalable machine-learning algorithms, implemented on top of Apache Hadoop® and using the MapReduce paradigm
  • c) Apache Mahout is a project of the Apache Software Foundation to produce free implementations of distributed or otherwise scalable machine learning algorithms
  • d) None of the mentioned
Q.3
_________ does not restrict contributions to Hadoop based implementations.
  • a) Mahout
  • b) Oozie
  • c) Impala
  • d) All of the mentioned
Q.4
Mahout provides an implementation of a ______________ identification algorithm which scores collocations using log-likelihood ratio.
  • a) collocation
  • b) compaction
  • c) collection
  • d) none of the mentioned
Q.5
Point out the wrong statement.
  • a) ‘Taste’ collaborative-filtering recommender component of Mahout was originally a separate project and can run standalone without Hadoop
  • b) Integration of Mahout with initiatives such as the Pregel-like Giraph are actively under discussion
  • c) Calculating the LLR is very straightforward
  • d) None of the mentioned
Q.6
The tokens are passed through a Lucene ____________ to produce NGrams of the desired length.
  • a) ShngleFil
  • b) ShingleFilter
  • c) SingleFilter
  • d) Collfilter
Q.7
The _________ collocation identifier is integrated into the process that is used to create vectors from sequence files of text keys and values.
  • a) lbr
  • b) lcr
  • c) llr
  • d) lar
Q.8
____________ generates NGrams and counts frequencies for ngrams, head and tail subgrams.
  • a) CollocationDriver
  • b) CollocDriver
  • c) CarDriver
  • d) All of the mentioned
Q.9
A key of type ___________ is generated which is used later to join ngrams with their heads and tails in the reducer phase.
  • a) GramKey
  • b) Primary
  • c) Secondary
  • d) None of the mentioned
Q.10
________ phase merges the counts for unique ngrams or ngram fragments across multiple documents.
  • a) CollocCombiner
  • b) CollocReducer
  • c) CollocMerger
  • d) None of the mentioned
0 h : 0 m : 1 s