πŸ§ͺ Apache Hadoop MCQ Quiz Hub

Hadoop Multiple Choice Question

Choose a topic to test your knowledge and improve your Apache Hadoop skills

What does commodity Hardware in Hadoop world mean?





βœ… Correct Answer: 4

Which of the following are NOT big data problem(s)?





βœ… Correct Answer: 4

What does β€œVelocity” in Big Data mean?





βœ… Correct Answer: 4

The term Big Data first originated from:





βœ… Correct Answer: 3

Which of the following Batch Processing instance is NOT an example of Big Data Batch Processing?





βœ… Correct Answer: 4

Which of the following are example(s) of Real Time Big Data Processing?





βœ… Correct Answer: 4

Sliding window operations typically fall in the category of__________________.





βœ… Correct Answer: 3

What is HBase used as?





βœ… Correct Answer: 4

What is Hive used as?





βœ… Correct Answer: 4

Which of the following are NOT true for Hadoop?





βœ… Correct Answer: 4

Which of the following are the core components of Hadoop?





βœ… Correct Answer: 4

Hadoop is open source.





βœ… Correct Answer: 2

Hive can be used for real time queries.





βœ… Correct Answer: 2

What is the default HDFS block size?





βœ… Correct Answer: 4

What is the default HDFS replication factor?





βœ… Correct Answer: 3

Which of the following is NOT a type of metadata in NameNode?





βœ… Correct Answer: 3

Which of the following is/are correct?





βœ… Correct Answer: 4

The mechanism used to create replica in HDFS is____________.





βœ… Correct Answer: 3

NameNode tries to keep the first copy of data nearest to the client machine.





βœ… Correct Answer: 3

Where is the HDFS replication factor controlled?





βœ… Correct Answer: 4

Which of the following Hadoop config files is used to define the heap size?





βœ… Correct Answer: 3

Which of the following is not a valid Hadoop config file?





βœ… Correct Answer: 2

Read the statement: NameNodes are usually high storage machines in the clusters.





βœ… Correct Answer: 2

From the options listed below, select the suitable data sources for the flume.





βœ… Correct Answer: 4

Read the statement and select the correct options: distcp command ALWAYS needs fully qualified hdfs paths.





βœ… Correct Answer: 1

Which of following statement(s) are true about distcp command? (A)





βœ… Correct Answer: 1

Which of the following is NOT the component of Flume? (B)





βœ… Correct Answer: 2

Which of the following is the correct sequence of MapReduce flow?





βœ… Correct Answer: 3

Which of the following can be used to control the number of part files in a map reduce program output directory?





βœ… Correct Answer: 2

Which of the following operations can’t use Reducer as combiner also?





βœ… Correct Answer: 4

Which of the following is/are true about combiners?





βœ… Correct Answer: 4

Reduce side join is useful for





βœ… Correct Answer: 4

Distributed Cache can be used in





βœ… Correct Answer: 4

What is the optimal size of a file for distributed cache?





βœ… Correct Answer: 3

Number of mappers is decided by the





βœ… Correct Answer: 4

Which of the following type of joins can be performed in Reduce side join operation?





βœ… Correct Answer: 4

What should be an upper limit for counters of a Map Reduce job?





βœ… Correct Answer: 4

Which of the following class is responsible for converting inputs to key-value Pairs of Map Reduce





βœ… Correct Answer: 4

Which of the following writable can be used to know the value from a mapper/reducer?





βœ… Correct Answer: 3

A Map reduce job can be written in:





βœ… Correct Answer: 4

Pig is a:





βœ… Correct Answer: 2

Pig is good for:





βœ… Correct Answer: 4

Which of the following is the correct representation to access β€˜β€™Skill” from the Bag {β€˜Skills’,55, (β€˜Skill’, β€˜Speed’), {2, (β€˜San’, β€˜Mateo’)}}





βœ… Correct Answer: 1

Maximum size allowed for small dataset in replicated join is:





βœ… Correct Answer: 3

Parameters could be passed to Pig scripts from:





βœ… Correct Answer: 4

The schema of a relation can be examined through:





βœ… Correct Answer: 2

Data can be supplied to PigUnit tests from:





βœ… Correct Answer: 3

Which of the following constructs are valid Pig Control Structures?





βœ… Correct Answer: 4

Which of following is the return data type of Filter UDF?





βœ… Correct Answer: 3

Which of the following are not possible in Hive?





βœ… Correct Answer: 4

Who will initiate the mapper?





βœ… Correct Answer: 1

Which of the following are the Big Data Solutions Candidates?





βœ… Correct Answer: 4

Hadoop is a framework that allows the distributed processing of:





βœ… Correct Answer: 3

Which of the following are NOT metadata items?





βœ… Correct Answer: 4

What decides number of Mappers for a MapReduce job?





βœ… Correct Answer: 3

Name Node monitors block replication process





βœ… Correct Answer: 2

Which of the following are true for Hadoop Pseudo Distributed Mode?





βœ… Correct Answer: 3

Which of following statement(s) are correct?





βœ… Correct Answer: 3

Which of the following is true for Hive?





βœ… Correct Answer: 3

Which of the following is the highest level of Data Model in Hive?





βœ… Correct Answer: 3

Hive queries response time is in order of





βœ… Correct Answer: 3

Managed tables in Hive:





βœ… Correct Answer: 4