Olete.in
Articles
Mock Tests
🧪 Bigdata MCQ Quiz Hub
Introduction to Bigdata Mcq Set 1
Choose a topic to test your knowledge and improve your Bigdata skills
1. The overall percentage of the world’s total data has been created just withinthe past two years is ?
80%
85%
90%
95%
2. According to analysts, for what can traditional IT systems provide afoundation when they’re integrated with big data technologies like Hadoop?
Big data management and data mining
Data warehousing and business intelligence
Management of Hadoop clusters
Collecting and storing unstructured data
3. All of the following accurately describe Hadoop, EXCEPT ____________
Open-source
Real-time
Java-based
Distributed computing approach
4. Big data analysis does the following except?
Collects data
Spreads data
Organizes data
Analyzes data
5. The new source of big data that will trigger a Big Data revolution in theyears to come is?
Business transactions
Social media
Transactional data and sensor data
RDBMS
6. Listed below are the three steps that are followed to deploy a Big DataSolution except
Data Processing
Data dissemination
Data Storage
Data Ingestion
7. Who popularized bigdata term?
John deere
John Mashey
johny Mashe
Jhon Mash
8. Numbers ,text, image, audio and video data is ____
Volume
Value
Varity
Variety
9. Real time data is ______.
Field
Primary Key
unique
record
10. __ is the term that is used to describe data that is high volume , highvelocity and /or high variety.
Analytics
Bigdata
Hadoop Data
Bigdata analytics
11. ________ has the world’s largest Hadoop cluster.
Apple
Datamatics
Facebook
None of the above
12. Facebook Tackles Big Data With _______ based on Hadoop.
Project Prism
Prism
Project Big
Project Data
13. _________ is general-purpose computing model and runtime system fordistributed data analytics.
Mapreduce
Drill
Oozie
None of the above
14. The examination of large amounts of data to see what patterns or otheruseful information can be found is known as
Data examination
Information analysis
Big data analytics
Data analysis
15. Point out the wrong statement.
Hardtop processing capabilities are huge and its real advantage lies in the ability to process terabytes & petabytes of data
Hardtop processing capabilities are huge and its real advantage lies in the ability to process terabytes & petabytes of data
The programming model, MapReduce, used by Hadoop is difficult to write and test
All of the above
16. _______ can best be described as a programming model used to developHadoop-based applications that can process massive amounts of data.
MapReduce
Mahout
Oozie
All of the mentioned
17. Facebook Tackles Big Data With _______ based on Hadoop.
‘Project Prism’
Prism
‘Project Big’
‘Project Data’
18. Data science is the process of diverse set of data through ?
organizing data
processing data
analysing data
All of the above
19. The modern conception of data science as an independent discipline issometimes attributed to?
William S.
John McCarthy
Arthur Samuel
Satoshi Nakamoto
20. Which of the following language is used in Data science?
C
C++
R
Ruby
21. Which of the following is false?
Subsetting can be used to select and exclude variables and observations
Raw data should be processed only one time.
Merging concerns combining datasets on the same observations to produce a result with more variables
None Of the above
22. What is the work of Data Architect?
utilize large data sets to gather information that meets their company's needs
work with businesses to determine the best usage of the information yielded from data
build data solutions that are optimized for performance and design applications
All of the above
23. Which of the following is correct skills for a Data Scientist?
Probability & Statistics
Machine Learning / Deep Learning
Data Wrangling
All of the above
24. Which of the following are correct component for data science?
Data Engineering
Advanced Computing
Domain expertise
All of the above
25. Which of the following is not a part of data science process?
Discovery
Model Planning
Communication Building
Operationalize
26. Which of the following are the Data Sources in data science?
Structured
Unstructured
Both A and B
None Of the above
27. Which of the following is not a application for data science?
Recommendation Systems
Image & Speech Recognition
Online Price Comparison
Privacy Checker
28. Point out the correct statement.
Raw data is original source of data B. D.
Preprocessed data is original source of data C.
Raw data is the data obtained after processing steps
None of the above
29. Which of the following is one of the key data science skills?
Statistics
Machine Learning
Data Visualization
All of the above
30. Which of the following is a key characteristic of a hacker?
Afraid to say they don't know the answer
Willing to find answers on their own
Not Willing to find answers on their own
All of the above
31. Raw data should be processed only one time.
True
False
Can be true or false
Can not say
32. Raw data should be processed only one time.
True
False
Can be true or false
Can not say
33. Which of the following is the common goal of statistical modelling?
Inference
Summarizing
Subsetting
None of the above
34. Which of the following model is usually a gold standard for data analysis?
Inferential
Descriptive
Causal
All of the above
35. Which of the following can be used to create sub–samples using a maximumdissimilarity approach?
minDissim
maxDissim
inmaxDissim
All of the Mentioned
36. Causal analysis is commonly applied to census data.
True
False
Can be true or false
Can not say
37. Which of the following is a revision control system?
Git
Numpy
Scipy
Slidify
38. Which of the following step is performed by data scientist after acquiringthe data?
Data Cleaning
Data Integration
Data Replication
All of the above
39. Which of the following focuses on the discovery of (previously) unknownproperties on the data?
Data mining
BigData
Data wrangling
Machine Learning
40. Which of the following can be used to impute data sets based only on informationin the training set?
postprocess
preProcess
process
All of the Mentioned
41. Which of the following model model include a backwards elimination featureselection routine?
MCV
MARS
MCRS
All of the Mentioned
42. Which of the following is a categorical outcome?
RMSE
RSquared
Accuracy
All of the Mentioned
43. What is true about Machine Learning?
Machine Learning (ML) is that field of computer science
ML is a type of artificial intelligence that extract patterns out of raw data by using an algorithm or method.
The main focus of ML is to allow computer systems learn from experience without being explicitly programmed or human intervention.
All of the above
44. ML is a field of AI consisting of learning algorithms that?
Improve their performance
At executing some task
Over time with experience
All of the above
45. p → 0q is not a?
hack clause
horn clause
structural clause
system clause
46. The action _______ of a robot arm specify to Place block A on block B.
STACK(A,B)
LIST(A,B)
QUEUE(A,B)
ARRAY(A,B)
47. A__________ begins by hypothesizing a sentence (the symbol S) and successively predicting lower level constituents until individual preterminal symbols are written.
bottow-up parser
top parser
top-down parser
bottom parser
48. A model of language consists of the categories which does not include________.
System Unit
structural units.
data units
empirical units
49. The model will be trained with data in one single batch is known as ?
Batch learning
Offline learning
Both A and B
None of the above
50. Which of the following are ML methods?
based on human supervision
supervised Learning
semi-reinforcement Learning
All of the above
Submit