A company collects data from parking garages. Analysts have requested the ability to run reports in near real time about the number of vehicles in each garage.
The company wants to build an ingestion pipeline that loads the data into an Amazon Redshift cluster. The solution must alert operations personnel when the number of vehicles in a particular garage exceeds a specific threshold. The alerting query will use garage threshold values as a static reference. The threshold values are stored in
Amazon S3.
What is the MOST operationally efficient solution that meets these requirements?
Answer : A
This solution meets the requirements because:
It uses Amazon Kinesis Data Firehose to collect and deliver data to Amazon Redshift in near real time, without requiring any coding or server management1.
It uses Amazon Kinesis Data Analytics to process and analyze streaming data using SQL queries or Apache Flink applications2.It can also create a reference data source that allows joining streaming data with static data stored in Amazon S33. This way, it can compare the number of vehicles in each garage with the corresponding threshold value from the reference data source.
It uses AWS Lambda to create a serverless function that can be triggered by Kinesis Data Analytics as an output destination4.The Lambda function can then publish an Amazon SNS notification to alert operations personnel when the number of vehicles exceeds the threshold5.
A manufacturing company is storing data from its operational systems in Amazon S3. The company's business analysts need to perform one-time queries of the data in Amazon S3 with Amazon Athen
a. The company needs to access the Athena service from the on-premises network by using a JDBC connection. The company has created a VPC. Security policies mandate that requests to AWS services cannot traverse the internet.
Which combination of steps should a data analytics specialist take to meet these requirements? (Select TWO.)
Answer : A, D
AWS Direct Connect is a service that establishes a dedicated network connection between your on-premises network and AWS1.It can help you reduce network costs, increase bandwidth throughput, and provide a more consistent network experience than internet-based connections1. It can also help you meet the security policy that requires requests to AWS services not to traverse the internet.
An interface VPC endpoint is a type of VPC endpoint that enables you to privately connect your VPC to supported AWS services and VPC endpoint services powered by AWS PrivateLink2.It is represented by one or more Elastic Network Interfaces (ENIs) with private IP addresses in your VPC subnets2. It can also help you meet the security policy that requires requests to AWS services not to traverse the internet.
Amazon Athena now provides an interface VPC endpoint that allows you to connect directly to Athena through an interface VPC endpoint in your VPC3.You can create an interface VPC endpoint to connect to Athena using the AWS console or AWS CLI commands4.You can also configure the JDBC connection to use the interface VPC endpoint for Athena by specifying the endpoint URL as the JDBC URL5.
An online retailer is rebuilding its inventory management system and inventory reordering system to automatically reorder products by using Amazon Kinesis Data Streams. The inventory management system uses the Kinesis Producer Library (KPL) to publish data to a stream. The inventory reordering system uses the Kinesis Client Library (KCL) to consume data from the stream. The stream has been configured to scale as needed. Just before production deployment, the retailer discovers that the inventory reordering system is receiving duplicated data.
Which factors could be causing the duplicated data? (Choose two.)
Answer : B, D
A company developed a new elections reporting website that uses Amazon Kinesis Data Firehose to deliver full logs from AWS WAF to an Amazon S3 bucket. The company is now seeking a low-cost option to perform this infrequent data analysis with visualizations of logs in a way that requires minimal development effort.
Which solution meets these requirements?
Answer : A
https://aws.amazon.com/blogs/big-data/analyzing-aws-waf-logs-with-amazon-es-amazon-athena-and-amazon-quicksight/
A company wants to use an automatic machine learning (ML) Random Cut Forest (RCF) algorithm to visualize complex real-world scenarios, such as detecting seasonality and trends, excluding outers, and imputing missing values.
The team working on this project is non-technical and is looking for an out-of-the-box solution that will require the LEAST amount of management overhead.
Which solution will meet these requirements?
Answer : A
A business intelligence (Bl) engineer must create a dashboard to visualize how often certain keywords are used in relation to others in social media posts about a public figure. The Bl engineer extracts the keywords from the posts and loads them into an Amazon Redshift table. The table displays the keywords and the count corresponding
to each keyword.
The Bl engineer needs to display the top keywords with more emphasis on the most frequently used keywords.
Which visual type in Amazon QuickSight meets these requirements?
Answer : B
A financial services company is building a data lake solution on Amazon S3. The company plans to use analytics offerings from AWS to meet user needs for one-time querying and business intelligence reports. A portion of the columns will contain personally identifiable information (Pll). Only authorized users should be able to see
plaintext PII data.
What is the MOST operationally efficient solution that meets these requirements?
Answer : B
This solution meets the requirements because:
AWS Lake Formation is a fully managed service that allows you to build, secure, and manage data lakes on AWS1.You can use Lake Formation to register your S3 locations as data sources and catalog your data using AWS Glue1.
AWS Lake Formation provides fine-grained data permissions that enable you to control access to your data at the column or row level1.You can use Lake Formation to create two IAM roles and grant them different Select permissions based on the PII status of the columns1.
AWS Lake Formation integrates with various analytics services from AWS, such as Amazon Athena, Amazon Redshift, Amazon EMR, and Amazon QuickSight1.You can use these services to query and visualize your data in S3 using the IAM roles and permissions defined by Lake Formation1.