🧪 Databricks MCQ Quiz Hub

Databricks Mcq Question Set 2

Choose a topic to test your knowledge and improve your Databricks skills

1. The code below should return a new DataFrame with 50 percent of random records from DataFrame df without replacement.




2. Which of the following DataFrame commands will NOT generate a shuffle of data from each executor across the cluster?




3. Which of the following DataFrame commands is a narrow transform?




4. Which of the following DataFrame commands is a wide transform?




5. When Spark runs in Cluster Mode, which of the following statements about nodes is correct ?




6. The DataFrame df includes a time string column named timestamp_1. Which is the correct syntax that creates a new DataFrame df1 that is just made by the time string field converted to a unix timestamp?




7. If you wanted to: 1. Cache a df as SERIALIZED Java objects in the JVM and; 2. If the df does not fit in memory, store the partitions that don’t fit on disk, and read them from there when they’re needed; 3. Replicate each partition on two cluster nodes. which command would you choose ?




8. Spark is best suited for ______ data.




9. Which of the following Features of Apache Spark?




10. In how many ways Spark uses Hadoop?




11. When was Apache Spark developed ?




12. Which of the following is incorrect way for Spark deployment?




13. _____ is a distributed graph processing framework on top of Spark.




14. Point out the correct statement.




15. Which of the following is True regarding Azure Cosmos DB?




16. Which one of the following is a data model supported by Azure Cosmos DB?




17. Which container is supported in Azure Cosmos DB?




18. What is the maximum size of graph DB that a Fixed Container in cosmos DB can store?




19. Select the API from the following options that Azure Cosmos DB support?




20. Which one of the following logical partitions a single container in cosmos DB cannot have?




21. Elastically scalable throughput and storage is possible in:




22. In which of the APIs of Azure Cosmos DB, is automatic indexing possible?




23. What is the maximum latency limit for reads and writes in case of Azure Cosmos DB Table API?




24. Which one of the following is the feature of Azure Cosmos DB Graph API?




25. Which one of the following is not correct regarding Azure storage?




26. Which one of the following is the data service provided by Azure Storage platform?




27. Which one of the following provides block level storage volumes for Azure VMs?




28. Which one of the following is most preferred for storing streaming videos and audios?




29. How many replication options are there while creating an Azure storage account?




30. While creating Azure Storage account, which replication option is the cheapest one?




31. How many copies of data are created in case of geo redundant storage replication?




32. Choose the incorrect option regarding Zone redundant storage replication.




33. What can be the maximum size of a queue message?




34. Choose the correct option regarding Azure Storage.




35. Which one of the following is an orchestration software which can be used for scaling containers?




36. What is the basic operational unit of Kubernetes?




37. Which one of the following can be done for a container based application using Azure Kubernetes?




38. Which one of the following helps to set up cluster autoscaler for adding capacity as per demand?




39. Which one of the following is incorrect regarding Azure Kubernetes?




40. Choose the correct option.




41. Choose the wrong statement regarding Azure Kubernetes.




42. Which one of the following is correct regarding clusters of Azure Kubernetes?




43. Choose the correct option.




44. Which one of the following can be considered as the primary data store of Kubernetes?




45. Which of the following Azure services is used for performing high performance parallel computing jobs in the cloud?




46. What are compute nodes in Azure Batch?




47. How data files stored in Azure blob storage are being accessed by Azure Batch?




48. For which of the following options, Azure Batch can be used?




49. What does Azure Batch provide by default for parallelization?




50. Which one of the following is incorrect regarding Azure Batch?




51. With which of the following can Azure Batch integrate for fetching data?




52. Which type of node is not supported by Azure Batch?




53. Choose the correct option.




54. Choose the correct option.