DW and ETL Testing MCQ


Hello friends in this post we are going to discuss about DW and ETL Testing Multiple Choice Questions| DW and ETL Testing Objective Type Questions| DW and ETL Testing MCQ with answers

Q1. Which of the following language is not supported by Spark?

A. Python

B. Scala

C. Java

D. Pascal

Ans: d

Q2. Hive is a RDBMS in Hadoop ecosystem

Ans – True

Q3. _______is a distributed graph processing framework on top of Spark.

A. MLlib

B. None of the above

C. Spark Streaming

D. GraphX

Ans: d

Q4. Identify the interactive visualisation tool in AWS

A. AWS Glue

B. Amazon Athena

C. Quicksight

D. Redshift

Ans: c

Q5. Which of the following are examples of ETL tools?

A. SQL server

B. IBM Infoshphere Datastage

C. Oracle

D. Amazon $3

Ans: b

Q6.What are the different types of partitioning Methods?

A. load process and row based

B. load process and column based

C. load process and view based

D. row and column based

Ans: c

Q7. Which Talend components are used to process input/output for delimited files?

A. tFileInputDelimited

B. tFileOuputDelimited

C. tFileInputXML

D. tMySQLInput

Ans: a b d

Q8. Fact Tables are which of the following?

A. Completely denoralized

B. Completely normalized

C. Partially normalized

D. Partially denoralized

Ans: c

Q9. Which of the followings are a valid aggregate functions?

A. COUNT

B. MAX

C. SUM

D. COMPUTE

Ans: adc

Q10. Which of the following is not a DDL command?

A. TRUNCATE

B. UPDATE

C. CREATE

D. ALTER

Ans: b

Q11. Select True or False. Primary Key column supports NULL values.

ans – false

Q12. Which of the following are TCL (Transaction Control Language) commands?

A. COMMIT and ROLLBACK

B. GRANT and REVOKE

C. SELECT and INSERT

D. UPDATE and TRUNCATE

Ans: a

Q13. Which type of Programming does Python support?

A. object-oriented programming

B. functional programming

C. all of the mentioned

D. structured programming

Ans: c

Q14. What will be the output of the following Python code snippet x <<2 if x = 1

Ans – 2

Q15. Decision support systems and data lakes are essentially the same technology. True/False

Ans – False

Q16. Select True or False. We can secure data in a datalake with mechanisms such as authentication and authorization.

Ans – True

Q17. What are the features of a datalake?

A. Supports data ingestion.

B. Provides data storage

C. Contributes to data discovery

D. Enables data audit

Ans: a b

Q18. The different types of analysis which are supported through data lakes are:

A. Continuous

B. Realtime

C. Interactive

D. Batch

Ans: b c

Q19. What is the best example of an additive measure? Choose the best option

A. A date

B. A sales amount

C. A percentage value

D. A product category

Ans: b

Q20. How to ensure the data warehouse (DWH) that is initially loaded, will continue to get recentday-to-day transactions data regularly? Choose the best option

A. DWH admin has to manually extract it from the sources and keep loading them.

B. A complex software interface betwoon DWH & operational system has to be built for this.

C. Appropriate ETL jobs have to run regularly to keep refreshing DWH data.

D. Not possible DWH always works only with initially loaded data

Ans: c

Q21. Which of the following statement is true related to ‘non-volatile’ feature of a data warehouse(DWH)? Choose the best option

A.In DWH, data is written once and ready many times

B. In a DWH all CRUD operations are done; but data marts are only read

C.In a DWH, data is only written, never read

D. All types of OLTP operations are done in a DWH

Ans: d

Q22. While loading the data to the data warehouse, if the data is already present in it, what option you will choose? Choose the best option

A. Overwrite the existing data


B. apply change data capture to overwite or keeps history

C. delete the old data

D. keeps the history of data

Ans: b

Q23. Differences between data warehouse (DWH) and data mart (DM) are? Choose the best option.

A. DWH holds disparate data, DM holds specific department data

B. DWH is easy to maintain, DM is difficult to maintain

C. Only (ii) & (i) are correct

D. Only (i) is correct

E.Data held by DWH is specific, held by DM is detailed

F. Only (iii) & (ii) are correct

Ans: d

Q24. Which of the following is an ETL Bug?Choose the option that best answers the question.

A. Duplicate data loaded

B. Trim and Null issue

C. index is not created

D. data precission issue

E. All the above

Ans: e

Q25. Which statement about dimension tables is true? Choose the best option

A. A dimension table can be utilized across several reports, and it is typically reusable

B. Dimension tables are used to describe measures, and therefore, they contain dimension keys, values, and attributes

C. A dimension table is a collection of references to a non-measurable table or tables

D. Dimension tables often have more records compared to facts tables

Ans: d

Q26. Which of the following statements regarding OLTP are true?Choose all options that best answer the question.

A. OLTPS processes huge, slow and non-responsive workloads that expects faster response

B. OLTPs processes small, quick and interactive workloads that expects faster response time

C. OLTPS are often referred to as operational systems

D. OLTPS are often referred to as transaction systems

Ans: b d

Q27. While using ETL tools which of the options can be considered as best practices ?Choose all options that best answer the question.

A. Maximize data input

B. Try to cleanse 100% of the data before you begin the process

C. Try to automate as much as possible

D. Use incremental data updates

E. Maximize data quality

F. Use parallel processing

Ans: a e f

Q28. Joiner transformation is an example of

A. Active Tranformation

B. Passive Transformation

C. None of the above

Ans: a

Q29. Which schema would you choose?You need to create a data warehousing schema that consumes the least amount of memory.

A. The hybrid schema

B. It doesn’t matter which schema you use as memory consumption does not depend on the

C. schema type

D. The dimensional schema

E. The star schema

F. The snowflake schema

Ans: f

Q30. Which of these statements accurately describe statement testing?Choose all options that best answer the question.

A. It is considered a white-box testing technique

B. Exercises executable statements in the code

C. 100% statement coverage guarantees 100% decision coverage

D. It is considered a black-box testing technique

Ans: a b

Q31. Which of these accurately describe equivalence testing?Choose all options that best answer the question

A. Invalid test partitions should be tested collectively

B. Each value.may only belong to only one partition

C. Invalid test partitions should be tested individually

D. A value may appear in multiple partitions

Ans: c b

Q32. Which of these statements accurately describe boundary value analysis?Choose all options that best answer the question

A. Applicable only at component and component integration test levels

B. Typically used to test requirements using a range of numbers

C. Coverage calculation is normally expressed as a percentage

D. Effectively tests values only at the edges of equivalence classes

Ans: c b

Q33. Identify some of the essential phases of software development lifecycle. Choose all options that best answer the question.

A. Coding

B. Testing

C. Integration

D. Staging

Ans: a b

Q34. Which sector(s) are well suited for the Waterfall approach?Choose all options that best answer the question.

A. Startups

B. Unregulated industries

C. Gaming industry

D. Pharmaceuticals

E. Government

Ans: e d c

Q35. Which of these elements do we need to specify in a test case template for manualtesting ?Choose all options that best answer the question

A. Test case number

B. Attribute details

C. Test steps

D. Test data

Ans: a d

Q36. Given a data entry form that allows a single numeric character that may consist of integers from 0 to 9 and valid values from 3 to 6, which of these sets of values would cover all three partitions? Choose the best option

Ans – 1,5,8

Q37. Identify some of the critical phases of the Waterfall model.Choose all options that best answer the question.

A. Engineering

B. Programmed implementation and testing

C. Requirement gathering and analysis

D. Risk analysis

Ans: b c d

Q38. Which of the following items apply to the SDLC project initiation phase?Choose all options that best answer the question.

A.Fuzz testing

B. Conduct risk assessment

C. Define business requirements

D. Peer review

Ans: b c


Leave a Reply

Your email address will not be published. Required fields are marked *