Hello friends in this post we are going to discuss about DW and ETL Testing Multiple Choice Questions| DW and ETL Testing Objective Type Questions| DW and ETL Testing MCQ with answers
Q1. Which of the following language is not supported by Spark?
Q2. Hive is a RDBMS in Hadoop ecosystem
Ans – True
Q3. _______is a distributed graph processing framework on top of Spark.
B. None of the above
C. Spark Streaming
Q4. Identify the interactive visualisation tool in AWS
A. AWS Glue
B. Amazon Athena
Q5. Which of the following are examples of ETL tools?
A. SQL server
B. IBM Infoshphere Datastage
D. Amazon $3
Q6.What are the different types of partitioning Methods?
A. load process and row based
B. load process and column based
C. load process and view based
D. row and column based
Q7. Which Talend components are used to process input/output for delimited files?
Ans: a b d
Q8. Fact Tables are which of the following?
A. Completely denoralized
B. Completely normalized
C. Partially normalized
D. Partially denoralized
Q9. Which of the followings are a valid aggregate functions?
Q10. Which of the following is not a DDL command?
Q11. Select True or False. Primary Key column supports NULL values.
ans – false
Q12. Which of the following are TCL (Transaction Control Language) commands?
A. COMMIT and ROLLBACK
B. GRANT and REVOKE
C. SELECT and INSERT
D. UPDATE and TRUNCATE
Q13. Which type of Programming does Python support?
A. object-oriented programming
B. functional programming
C. all of the mentioned
D. structured programming
Q14. What will be the output of the following Python code snippet x <<2 if x = 1
Ans – 2
Q15. Decision support systems and data lakes are essentially the same technology. True/False
Ans – False
Q16. Select True or False. We can secure data in a datalake with mechanisms such as authentication and authorization.
Ans – True
Q17. What are the features of a datalake?
A. Supports data ingestion.
B. Provides data storage
C. Contributes to data discovery
D. Enables data audit
Ans: a b
Q18. The different types of analysis which are supported through data lakes are:
Ans: b c
Q19. What is the best example of an additive measure? Choose the best option
A. A date
B. A sales amount
C. A percentage value
D. A product category
Q20. How to ensure the data warehouse (DWH) that is initially loaded, will continue to get recentday-to-day transactions data regularly? Choose the best option
A. DWH admin has to manually extract it from the sources and keep loading them.
B. A complex software interface betwoon DWH & operational system has to be built for this.
C. Appropriate ETL jobs have to run regularly to keep refreshing DWH data.
D. Not possible DWH always works only with initially loaded data
Q21. Which of the following statement is true related to ‘non-volatile’ feature of a data warehouse(DWH)? Choose the best option
A.In DWH, data is written once and ready many times
B. In a DWH all CRUD operations are done; but data marts are only read
C.In a DWH, data is only written, never read
D. All types of OLTP operations are done in a DWH
Q22. While loading the data to the data warehouse, if the data is already present in it, what option you will choose? Choose the best option
A. Overwrite the existing data
B. apply change data capture to overwite or keeps history
C. delete the old data
D. keeps the history of data
Q23. Differences between data warehouse (DWH) and data mart (DM) are? Choose the best option.
A. DWH holds disparate data, DM holds specific department data
B. DWH is easy to maintain, DM is difficult to maintain
C. Only (ii) & (i) are correct
D. Only (i) is correct
E.Data held by DWH is specific, held by DM is detailed
F. Only (iii) & (ii) are correct
Q24. Which of the following is an ETL Bug?Choose the option that best answers the question.
A. Duplicate data loaded
B. Trim and Null issue
C. index is not created
D. data precission issue
E. All the above
Q25. Which statement about dimension tables is true? Choose the best option
A. A dimension table can be utilized across several reports, and it is typically reusable
B. Dimension tables are used to describe measures, and therefore, they contain dimension keys, values, and attributes
C. A dimension table is a collection of references to a non-measurable table or tables
D. Dimension tables often have more records compared to facts tables
Q26. Which of the following statements regarding OLTP are true?Choose all options that best answer the question.
A. OLTPS processes huge, slow and non-responsive workloads that expects faster response
B. OLTPs processes small, quick and interactive workloads that expects faster response time
C. OLTPS are often referred to as operational systems
D. OLTPS are often referred to as transaction systems
Ans: b d
Q27. While using ETL tools which of the options can be considered as best practices ?Choose all options that best answer the question.
A. Maximize data input
B. Try to cleanse 100% of the data before you begin the process
C. Try to automate as much as possible
D. Use incremental data updates
E. Maximize data quality
F. Use parallel processing
Ans: a e f
Q28. Joiner transformation is an example of
A. Active Tranformation
B. Passive Transformation
C. None of the above
Q29. Which schema would you choose?You need to create a data warehousing schema that consumes the least amount of memory.
A. The hybrid schema
B. It doesn’t matter which schema you use as memory consumption does not depend on the
C. schema type
D. The dimensional schema
E. The star schema
F. The snowflake schema
Q30. Which of these statements accurately describe statement testing?Choose all options that best answer the question.
A. It is considered a white-box testing technique
B. Exercises executable statements in the code
C. 100% statement coverage guarantees 100% decision coverage
D. It is considered a black-box testing technique
Ans: a b
Q31. Which of these accurately describe equivalence testing?Choose all options that best answer the question
A. Invalid test partitions should be tested collectively
B. Each value.may only belong to only one partition
C. Invalid test partitions should be tested individually
D. A value may appear in multiple partitions
Ans: c b
Q32. Which of these statements accurately describe boundary value analysis?Choose all options that best answer the question
A. Applicable only at component and component integration test levels
B. Typically used to test requirements using a range of numbers
C. Coverage calculation is normally expressed as a percentage
D. Effectively tests values only at the edges of equivalence classes
Ans: c b
Q33. Identify some of the essential phases of software development lifecycle. Choose all options that best answer the question.
Ans: a b
Q34. Which sector(s) are well suited for the Waterfall approach?Choose all options that best answer the question.
B. Unregulated industries
C. Gaming industry
Ans: e d c
Q35. Which of these elements do we need to specify in a test case template for manualtesting ?Choose all options that best answer the question
A. Test case number
B. Attribute details
C. Test steps
D. Test data
Ans: a d
Q36. Given a data entry form that allows a single numeric character that may consist of integers from 0 to 9 and valid values from 3 to 6, which of these sets of values would cover all three partitions? Choose the best option
Ans – 1,5,8
Q37. Identify some of the critical phases of the Waterfall model.Choose all options that best answer the question.
B. Programmed implementation and testing
C. Requirement gathering and analysis
D. Risk analysis
Ans: b c d
Q38. Which of the following items apply to the SDLC project initiation phase?Choose all options that best answer the question.
B. Conduct risk assessment
C. Define business requirements
D. Peer review
Ans: b c