πŸ§ͺ High Performance Computing (HPC) MCQ Quiz Hub

High Performance Computing (HPC) MCQ Set 1

Choose a topic to test your knowledge and improve your High Performance Computing (HPC) skills

A CUDA program is comprised of two primary components: a host and a _____.





βœ… Correct Answer: 1

The kernel code is dentified by the ________qualifier with void return type





βœ… Correct Answer: 2

Calling a kernel is typically referred to as _________.





βœ… Correct Answer: 4

the BlockPerGrid and ThreadPerBlock parameters are related to the ________ model supported by CUDA.





βœ… Correct Answer: 3

_______ is Callable from the device only





βœ… Correct Answer: 3

____ is Callable from the host





βœ… Correct Answer: 2

______ is Callable from the host





βœ… Correct Answer: 1

CUDA supports ____________ in which code in a single thread is executed by all other threads.





βœ… Correct Answer: 3

. In CUDA, a single invoked kernel is referred to as a _____.





βœ… Correct Answer: 3

A grid is comprised of ________ of threads.





βœ… Correct Answer: 1

A block is comprised of multiple _______.





βœ… Correct Answer: 1

a solution of the problem in representing the parallelismin algorithm is





βœ… Correct Answer: 4

Host codes in a CUDA application can not Reset a device





βœ… Correct Answer: 2

Any condition that causes a processor to stall is called as _____.





βœ… Correct Answer: 1

The time lost due to branch instruction is often referred to as _____.





βœ… Correct Answer: 3

___ method is used in centralized systems to perform out of order execution.





βœ… Correct Answer: 2

The computer cluster architecture emerged as an alternative for ____.





βœ… Correct Answer: 3

NVIDIA CUDA Warp is made up of how many threads?





βœ… Correct Answer: 4

Out-of-order instructions is not possible on GPUs.





βœ… Correct Answer: 2

CUDA supports programming in ....





βœ… Correct Answer: 3

FADD, FMAD, FMIN, FMAX are ----- supported by Scalar Processors of NVIDIA GPU.





βœ… Correct Answer: 1

Each streaming multiprocessor (SM) of CUDA herdware has ------ scalar processors (SP).





βœ… Correct Answer: 4

Each NVIDIA GPU has ------ Streaming Multiprocessors





βœ… Correct Answer: 4

CUDA provides ------- warp and thread scheduling. Also, the overhead of thread creation is on the order of ----.





βœ… Correct Answer: 2

Each warp of GPU receives a single instruction and β€œbroadcasts” it to all of its threads. It is a ---- operation.





βœ… Correct Answer: 2

Limitations of CUDA Kernel





βœ… Correct Answer: 2

What is Unified Virtual Machine





βœ… Correct Answer: 1

_____ became the first language specifically designed by a GPU Company to facilitate general purpose computing on ____.





βœ… Correct Answer: 3

The CUDA architecture consists of --------- for parallel computing kernels and functions.





βœ… Correct Answer: 4

CUDA stands for --------, designed by NVIDIA.





βœ… Correct Answer: 3

The host processor spawns multithread tasks (or kernels as they are known in CUDA) onto the GPU device. State true or false.





βœ… Correct Answer: 1

The NVIDIA G80 is a ---- CUDA core device, the NVIDIA G200 is a ---- CUDA core device, and the NVIDIA Fermi is a ---- CUDA core device





βœ… Correct Answer: 1

NVIDIA 8-series GPUs offer -------- .





βœ… Correct Answer: 1

IADD, IMUL24, IMAD24, IMIN, IMAX are ----------- supported by Scalar Processors of NVIDIA GPU.





βœ… Correct Answer: 2

CUDA Hardware programming model supports: a) fully generally data-parallel archtecture; b) General thread launch; c) Global load-store; d) Parallel data cache; e) Scalar architecture; f) Integers, bit operation





βœ… Correct Answer: 4

In CUDA memory model there are following memory types available: a) Registers; b) Local Memory; c) Shared Memory; d) Global Memory; e) Constant Memory; f) Texture Memory.





βœ… Correct Answer: 3

What is the equivalent of general C program with CUDA C: int main(void) { printf("Hello, World! "); return 0; }





βœ… Correct Answer: 2

Which function runs on Device (i.e. GPU): a) __global__ void kernel (void ) { } b) int main ( void ) { ... return 0; }





βœ… Correct Answer: 1

If variable a is host variable and dev_a is a device (GPU) variable, to allocate memory to dev_a select correct statement:





βœ… Correct Answer: 3

If variable a is host variable and dev_a is a device (GPU) variable, to copy input from variable a to variable dev_a select correct statement:





βœ… Correct Answer: 2

Triple angle brackets mark in a statement inside main function, what does it indicates?





βœ… Correct Answer: 1

What makes a CUDA code runs in parallel





βœ… Correct Answer: 4

In ___________, the number of elements to be sorted is small enough to fit into the process's main memory.





βœ… Correct Answer: 1

_____________ algorithms use auxiliary storage (such as tapes and hard disks) for sorting because the number of elements to be sorted is too large to fit into memory.





βœ… Correct Answer: 3

____ can be comparison-based or noncomparison-based.





βœ… Correct Answer: 2

The fundamental operation of comparison-based sorting is ________.





βœ… Correct Answer: 1

The performance of quicksort depends critically on the quality of the ______-.





βœ… Correct Answer: 2

The main advantage of ______ is that its storage requirement is linear in the depth of the state space being searched.





βœ… Correct Answer: 2

___ algorithms use a heuristic to guide search.





βœ… Correct Answer: 1

Graph search involves a closed list, where the major operation is a _______





βœ… Correct Answer: 3

Breadth First Search is equivalent to which of the traversal in the Binary Trees?





βœ… Correct Answer: 3

Time Complexity of Breadth First Search is? (V – number of vertices, E – number of edges)





βœ… Correct Answer: 1

Which of the following is not an application of Breadth First Search?





βœ… Correct Answer: 2

In BFS, how many times a node is visited?





βœ… Correct Answer: 3

Which of the following is not a stable sorting algorithm in its typical implementation.





βœ… Correct Answer: 3

Which of the following is not true about comparison based sorting algorithms?





βœ… Correct Answer: 4

mathematically efficiency is





βœ… Correct Answer: 1

Cost of a parallel system is sometimes referred to____ of product





βœ… Correct Answer: 3

Scaling Characteristics of Parallel Programs Ts is





βœ… Correct Answer: 2

Speedup tends to saturate and efficiency _____ as a consequence of Amdahl’s law.





βœ… Correct Answer: 3

Speedup obtained when the problem size is _______ linearlywith the number of processing elements.





βœ… Correct Answer: 1

The n Γ— n matrix is partitioned among n processors, with each processor storing complete ___ of the matrix.





βœ… Correct Answer: 1

cost-optimal parallel systems have an efficiency of ___





βœ… Correct Answer: 1

The n Γ— n matrix is partitioned among n2 processors such that each processor owns a _____ element.





βœ… Correct Answer: 3

how many basic communication operations are used in matrix vector multiplication





βœ… Correct Answer: 3

In DNS algorithm of matrix multiplication it used





βœ… Correct Answer: 3

In the Pipelined Execution, steps contain





βœ… Correct Answer: 4

the cost of the parallel algorithm is higher than the sequential run time by a factor of __





βœ… Correct Answer: 1

The load imbalance problem in Parallel Gaussian Elimination: can be alleviated by using a ____ mapping





βœ… Correct Answer: 2

A parallel algorithm is evaluated by its runtime in function of





βœ… Correct Answer: 4

For a problem consisting of W units of work, p__W processors can be used optimally.





βœ… Correct Answer: 1

C(W)__Θ(W) for optimality (necessary condition).





βœ… Correct Answer: 4

many interactions in oractical parallel programs occur in _____ pattern





βœ… Correct Answer: 1

efficient implementation of basic communication operation can improve





βœ… Correct Answer: 1

efficient use of basic communication operations can reduce





βœ… Correct Answer: 1

Group communication operations are built using_____ Messenging primitives.





βœ… Correct Answer: 1

one processor has a piece of data and it need to send to everyone is





βœ… Correct Answer: 1

the dual of one -to-all is





βœ… Correct Answer: 1

Data items must be combined piece-wise and the result made available at





βœ… Correct Answer: 1