DEA-C01 Exam Question 101
A company is migrating its database servers from Amazon EC2 instances that run Microsoft SQL Server to Amazon RDS for Microsoft SQL Server DB instances. The company's analytics team must export large data elements every day until the migration is complete. The data elements are the result of SQL joins across multiple tables. The data must be in Apache Parquet format. The analytics team must store the data in Amazon S3.
Which solution will meet these requirements in the MOST operationally efficient way?
Which solution will meet these requirements in the MOST operationally efficient way?
DEA-C01 Exam Question 102
A data engineer must ingest a source of structured data that is in .csv format into an Amazon S3 data lake. The .csv files contain 15 columns. Data analysts need to run Amazon Athena queries on one or two columns of the dataset. The data analysts rarely query the entire file.
Which solution will meet these requirements MOST cost-effectively?
Which solution will meet these requirements MOST cost-effectively?
DEA-C01 Exam Question 103
A company uses Apache Airflow to orchestrate the company's current on-premises data pipelines. The company runs SQL data quality check tasks as part of the pipelines. The company wants to migrate the pipelines to AWS and to use AWS managed services.
Which solution will meet these requirements with the LEAST amount of refactoring?
Which solution will meet these requirements with the LEAST amount of refactoring?
DEA-C01 Exam Question 104
Which of the following System keeps the following characteristics?
a. It will keep in it all the raw data.
b. Generally, the users of it is data scientists and data developers.
c. Flat architecture
d. Highly agile
a. It will keep in it all the raw data.
b. Generally, the users of it is data scientists and data developers.
c. Flat architecture
d. Highly agile
DEA-C01 Exam Question 105
A security company stores IoT data that is in JSON format in an Amazon S3 bucket. The data structure can change when the company upgrades the IoT devices. The company wants to create a data catalog that includes the IoT data. The company's analytics department will use the data catalog to index the data.
Which solution will meet these requirements MOST cost-effectively?
Which solution will meet these requirements MOST cost-effectively?
