DON'T WANT TO MISS A THING?

Certification Exam Passing Tips

Latest exam news and discount info

Curated and up-to-date by our experts

Yes, send me the newsletter

DAS-C01 Exam Prep: Study Materials & Mock Tests, AWS Certified Data Analytics | SPOTO

Prepare effectively for the DAS-C01 Exam with SPOTO's Study Materials & Mock Tests for AWS Certified Data Analytics - Specialty (DAS-C01). This certification is crucial for professionals in data analytics roles, affirming their ability to utilize AWS services for designing, securing, and managing analytics solutions. Our study materials cover a range of topics including exam questions and answers, practice tests, sample questions, and exam dumps, ensuring a thorough understanding of key concepts. Access exam materials, answers, and an exam simulator for realistic practice scenarios. Benefit from online exam questions and mock exams to assess your readiness and improve exam performance. SPOTO provides the resources and support needed to excel in the AWS Certified Data Analytics - Specialty (DAS-C01) exam and advance your career in data analytics.

Take other online exams

Question #1
A company that monitors weather conditions from remote construction sites is setting up a solution to collect temperature data from the following two weather stations. Station A, which has 10 sensors Station B, which has five sensors These weather stations were placed by onsite subject-matter experts. Each sensor has a unique ID. The data collected from each sensor will be collected using Amazon Kinesis Data Streams. Based on the total incoming and outgoing data throughput, a single Amazon Kinesis data stre
B. Upon review, it is confirmed that the total stream throughput is still less than the allocated Kinesis Data Streams throughput
A. Increase the number of shards in Kinesis Data Streams to increase the level of parallelism
B. Create a separate Kinesis data stream for Station A with two shards, and stream Station A sensor data to the new stream
C. Modify the partition key to use the sensor ID instead of the station name
D. Reduce the number of sensors in Station A from 10 to 5 sensors
View answer
Correct Answer: C
Question #2
An insurance company has raw data in JSON format that is sent without a predefined schedule through an Amazon Kinesis Data Firehose delivery stream to an Amazon S3 bucket. An AWS Glue crawler is scheduled to run every 8 hours to update the schema in the data catalog of the tables stored in the S3 bucket. Data analysts analyze the data using Apache Spark SQL on Amazon EMR set up with AWS Glue Data Catalog as the metastore. Data analysts say that, occasionally, the data they receive is stale. A data engineer
A. Create an external schema based on the AWS Glue Data Catalog on the existing Amazon Redshift cluster to query new data in Amazon S3 with Amazon Redshift Spectrum
B. Use Amazon CloudWatch Events with the rate (1 hour) expression to execute the AWS Glue crawler every hour
C. Using the AWS CLI, modify the execution schedule of the AWS Glue crawler from 8 hours to 1 minute
D. Run the AWS Glue crawler from an AWS Lambda function triggered by an S3:ObjectCreated:* event notification on the S3 bucket
View answer
Correct Answer: B
Question #3
A human resources company maintains a 10-node Amazon Redshift cluster to run analytics queries on the company’s data. The Amazon Redshift cluster contains a product table and a transactions table, and both tables have a product_sku column. The tables are over 100 GB in size. The majority of queries run on both tables. Which distribution style should the company use for the two tables to achieve optimal query performance?
A. An EVEN distribution style for both tables
B. A KEY distribution style for both tables
C. An ALL distribution style for the product table and an EVEN distribution style for the transactions table
D. An EVEN distribution style for the product table and an KEY distribution style for the transactions table
View answer
Correct Answer: C
Question #4
A company has a business unit uploading .csv files to an Amazon S3 bucket. The company’s data platform team has set up an AWS Glue crawler to do discovery, and create tables and schemas. An AWS Glue job writes processed data from the created tables to an Amazon Redshift database. The AWS Glue job handles column mapping and creating the Amazon Redshift table appropriately. When the AWS Glue job is rerun for any reason in a day, duplicate records are introduced into the Amazon Redshift table. Which solution w
A. Modify the AWS Glue job to copy the rows into a staging tabl
B. Add SQL commands to replace the existing rows in the main table as postactions in the DynamicFrameWriter class
C. Load the previously inserted data into a MySQL database in the AWS Glue jo
D. Perform an upsert operation in MySQL, and copy the results to the Amazon Redshift table
E. Use Apache Spark’s DataFrame dropDuplicates() API to eliminate duplicates and then write the data to Amazon Redshift
F. Use the AWS Glue ResolveChoice built-in transform to select the most recent value of the column
View answer
Correct Answer: B
Question #5
A hospital uses wearable medical sensor devices to collect data from patients. The hospital is architecting a near-real-time solution that can ingest the data securely at scale. The solution should also be able to remove the patient’s protected health information (PHI) from the streaming data and store the data in durable storage. Which solution meets these requirements with the least operational overhead?
A. Ingest the data using Amazon Kinesis Data Streams, which invokes an AWS Lambda function using Kinesis Client Library (KCL) to remove all PH
B. Write the data in Amazon S3
C. Ingest the data using Amazon Kinesis Data Firehose to write the data to Amazon S3
D. Ingest the data using Amazon Kinesis Data Streams to write the data to Amazon S3
E. Ingest the data using Amazon Kinesis Data Firehose to write the data to Amazon S3
View answer
Correct Answer: A
Question #6
Three teams of data analysts use Apache Hive on an Amazon EMR cluster with the EMR File System (EMRFS) to query data stored within each teams Amazon S3 bucket. The EMR cluster has Kerberos enabled and is configured to authenticate users from the corporate Active Directory. The data is highly sensitive, so access must be limited to the members of each team. Which steps will satisfy the security requirements?
A. For the EMR cluster Amazon EC2 instances, create a service role that grants no access to Amazon S3
B. Add the additional IAM roles to the cluster’s EMR role for the EC2 trust polic
C. Create a security configuration mapping for the additional IAM roles to Active Directory user groups for each team
D. For the EMR cluster Amazon EC2 instances, create a service role that grants no access to Amazon S3
E. Add the service role for the EMR cluster EC2 instances to the trust policies for the additional IAM role
F. Create a security configuration mapping for the additional IAM roles to Active Directory user groups for each team
View answer
Correct Answer: D
Question #7
A company launched a service that produces millions of messages every day and uses Amazon Kinesis Data Streams as the streaming service. The company uses the Kinesis SDK to write data to Kinesis Data Streams. A few months after launch, a data analyst found that write performance is significantly reduced. The data analyst investigated the metrics and determined that Kinesis is throttling the write requests. The data analyst wants to address this issue without significant changes to the architecture. Which ac
A. Increase the Kinesis Data Streams retention period to reduce throttling
B. Replace the Kinesis API-based data ingestion mechanism with Kinesis Agent
C. Increase the number of shards in the stream using the UpdateShardCount API
D. Choose partition keys in a way that results in a uniform record distribution across shards
E. Customize the application code to include retry logic to improve performance
View answer
Correct Answer: D
Question #8
A company has 1 million scanned documents stored as image files in Amazon S3. The documents contain typewritten application forms with information including the applicant first name, applicant last name, application date, application type, and application text. The company has developed a machine learning algorithm to extract the metadata values from the scanned documents. The company wants to allow internal data analysts to analyze and find applications using the applicant name, application date, or applic
A. For each image, use object tags to add the metadat
B. Use Amazon S3 Select to retrieve the files based on the applicant name and application date
C. Index the metadata and the Amazon S3 location of the image file in Amazon Elasticsearch Service
D. Store the metadata and the Amazon S3 location of the image file in an Amazon Redshift tabl
E. Allow the data analysts to run ad-hoc queries on the table
F. Store the metadata and the Amazon S3 location of the image files in an Apache Parquet file in Amazon S3, and define a table in the AWS Glue Data Catalo G
View answer
Correct Answer: C

View Answers after Submission

Please submit your email and WhatsApp to get the answers of questions.

Note: Please make sure your email ID and Whatsapp are valid so that you can get the correct exam results.

Email:
Whatsapp/phone number: