Breaking News

Data Engineering Quiz Answers

Introduction to Data Engineering Week 01 Quiz Answers

Graded Quiz 01 Answers

Q1. A modern data ecosystem includes a network of continually evolving entities. It includes ___?

Ans. Data sources, enterprises data repository, business stakeholders and tools, applications and infrastructure to manage data.

Q2. Data Engineers work within the data ecosystem to ___?

Ans. Analyze data for deriving insights.

Q3. The goal of data engineering is to make quality data available for fact-finding and decision-making. Which one of these statements capture the process of data engineering?

Ans. Collecting, processing, storing and making data available to users securely.

Q4. Data extracted from disparate sources can be stored in __?

Ans. Databases, data warehouses, data lakes or any other type of data repository.

Q5. From the provided list, select the three emerging technologies that are shaping today’s data ecosystem?

Ans. Cloud Computing, Machine Learning and Big Data.



Graded Quiz 02 Answers

Q1. Which one of these functional skills is essential to the role of a Data Engineer?

Ans. The ability to work with the software development lifecycle.

Q2. Oracle Exadata, IBM Db2 Warehouse on Cloud, IBM Netezza Performance Server and Amazon RedShift are some of the popular______ in use today.

Ans. Data Warehouses

Q3. Data Engineers manage the infrastructure required for the ingestion, processing and storage of data. (True/False)

Ans. 


Introduction to Data Engineering Week 02 Quiz Answers

Graded Quiz 01 Answers

Q1. There are two main types of data repositories- Transactional and Analytical. For high-volume day-to-day operational data such as banking transactions, transactional or OLTP, systems are the ideal choice. (True/False)

Ans. True

Q2. Which of the following is an example of unstructured data?

Ans. Video and Audio files.

Q3. Which one of these file formats is independent of software, hardware and operating systems and can be viewed the same way on any device?

Ans.PDF

Q4. Which data source can return data in plain text, XMT, HTML OR JSON among others?

Ans. PDF

Q5. In the data engineer’s ecosystem, languages are classified by type. What are shell and scripting languages most commonly used for?

Ans. Automating repetitive operational tasks.


Graded Quiz 02 Answers

Q1. Data Marts and Data Warehouses have typically been relational, but the emergence of what technology has helped to let these be used for non-relational data?

Ans. NoSQL

Q2. What is one of the most significant advantages of an RDBMS?

Ans. Is ACID-Compliant

Q3. Which one of the NoSQL database types uses a graphical model to represent and store data, and is particularly useful for visualizing, analyzing and finding connections between different pieces of data?

Ans. Graph-based

Q4. Which of the data repositories serves as a pool of raw data and stores large amounts of structured, semi-structured and unstructured data in their native formats?

Ans. Data Lakes

Q5. While data integration combines disparate data into a unified view of the data, a data pipeline convers the entire data movement journey from source to destination systems, and ETL is a process within data integration. (True/False)

Ans. True



  




Graded Quiz 03 Answers

Q1. What does the attribute “Veracity” imply in the context of Big Data?

Ans. Accuracy and Conformity of data to facts.

Q2. _____________, in the context of Big Data, is the speed at which data accumulates.

Ans. Velocity

Q3. What does the attribute “Value” imply in the context of Big Data?

Ans. Our ability and need to turn data into value

Q4. Apache Spark is a general-purpose data processing engine designed to extract and process Big Data for a wide range of applications. What is one of its key use cases?

Ans. Scalable and reliable Big Data storage.

Q5. Which of the Big Data processing tools is used for reading, writing and managing large data set files that are stored in either HDFS or Apache HBase?

Ans. Hive


Introduction to Data Engineering Week 03 Quiz Answers

Graded Quiz 01 Answers

Q1. Which one of these steps is an intrinsic part of the “Data Processing Layer” of a data platform?

Ans. Read data in batch or streaming modes from storage and apply transformations.

Q2. Systems that are used for capturing high-volume transactional data need to be designed for high-speed read, write and update operations. (True/False)

Ans. True

Q3. What is the role of “Network Access Control” systems in the area of network security?

Ans. To ensure endpoint security by allowing only authorized devices to connect to the network.

Q4. ________ ensures that users access information based on their roles and the privileges assigned to their roles.

Ans. Authorization

Q5. Security Monitoring and Intelligence systems:

Ans. Create an audit history for triage and compliance purposes



Graded Quiz 02 Answers

Q1. Web scraping is used to extract what type of data?

Ans. Data from news sites and NoSQL databases

Q2. __________ focuses on cleaning database of unused data and reducing redundancy and inconsistency.

Ans. Normalization

Q3. Open Refine is an open-source tool that allows you to:

Ans. Transform data into a variety of formats such as TSV, CSV, XLS, XML and JSON.

Q4. When you’re combining rows of data from multiple source tables into a single table, what kind of data transformation are you performing?

Ans. Unions

Q5. When you detect a value in your data set that is vastly different from other observations in the same data set, what would you report that as?

Ans. Outlier

 


Graded Quiz 03 Answers

Q1. What are some of the querying techniques you can apply to identify extreme values in a data column?

Ans. Maximum and Minimum values in a data column

Q2. You can perform partial matches of data values in a data column using:

Ans. Average function

Q3. Tools for ______ break up a job into a series of logical steps which are monitored for completion and time to completion.

Ans. Job-Level Runtime Monitoring

Q4. Database partitioning helps optimize databases for performance. It does this by ______?

Ans. Reducing inconsistencies and anomalies in data

Q5. Database normalization is a design technique that helps reduce inconsistencies and anomalies from data. (True/False)

Ans. True


Graded Quiz 04 Answers

Q1. In which phase of the data lifecycle do you establish the data you need, the amount of data you need and how you intend to use the data you are collecting?

Ans. Data Acquisition

Q2. The process of ________ abstracts the presentation layer without changing the data in the database physically.

Ans. Anonymization

 

Introduction to Data Engineering Week 0 Quiz Answers

Graded Quiz Answers

Q1. Data Engineering is a highly technical field. While communication, collaboration and project management skills are somewhat useful, you don’t need these skills in order to grow in your role as a data engineer. (True/False)

Ans. False

Q2. As a Lead Data Engineer what are some of the things you may be responsible for in addition to your hands-on-skills?

Ans. Converting business requirements into technical specifications.

Q3. What are some of the factors that influence your growth on your journey from an Associate Data Engineer to a Principal Data Engineer role?

Ans. The amount of experience you gain within your chosen area of specialization and your understanding of other arears within data engineering.

Q4. If you are an IT Support Specialist or a Software Tester gaining an entry into the field of data engineering will not be possible for you. (True/False)

Ans. False

Q5. If you have basic familiarity with coding, you can develop some baseline technical skills that can get you started on your journey as a Data Engineer. What are some of these baseline skills?

Ans. Data Engineers manage the infrastructure required for the ingestion, processing and storage of data


No comments