|
46 | 46 |
|
47 | 47 | ### A Machine Learning Specialist is building a model that will perform time series forecasting using Amazon SageMaker. The Specialist has finished training the model and is now planning to perform load testing on the endpoint so they can configure Auto Scaling for the model variant. Which approach will allow the Specialist to review the latency, memory utilization, and CPU utilization during the load test? |
48 | 48 |
|
49 | | -- [x] Review SageMaker logs that have been written to Amazon S3 by leveraging Amazon Athena and Amazon QuickSight to visualize logs as they are being produced. |
| 49 | +- [ ] Review SageMaker logs that have been written to Amazon S3 by leveraging Amazon Athena and Amazon QuickSight to visualize logs as they are being produced. |
50 | 50 | - [x] Generate an Amazon CloudWatch dashboard to create a single view for the latency, memory utilization, and CPU utilization metrics that are outputted by Amazon SageMaker. |
51 | 51 | - [ ] Build custom Amazon CloudWatch Logs and then leverage Amazon ES and Kibana to query and visualize the log data as it is generated by Amazon SageMaker. |
52 | | -- [ ] Send Amazon CloudWatch Logs that were generated by Amazon SageMaker to Amazon ES and use Kibana to query and visualize the log data |
| 52 | +- [ ] Send Amazon CloudWatch Logs that were generated by Amazon SageMaker to Amazon ES and use Kibana to query and visualize the log data. |
53 | 53 |
|
54 | 54 | ### A manufacturing company has structured and unstructured data stored in an Amazon S3 bucket. A Machine Learning Specialist wants to use SQL to run queries on this data. Which solution requires the LEAST effort to be able to query this data? |
55 | 55 |
|
|
81 | 81 |
|
82 | 82 | ### A Machine Learning Specialist is working with a large company to leverage machine learning within its products. The company wants to group its customers into categories based on which customers will and will not churn within the next 6 months. The company has labeled the data available to the Specialist. Which machine learning model type should the Specialist use to accomplish this task? |
83 | 83 |
|
84 | | -- [x] Linear regression. |
| 84 | +- [ ] Linear regression. |
85 | 85 | - [x] Classification. |
86 | | -- [x] Clustering. |
| 86 | +- [ ] Clustering. |
87 | 87 | - [ ] Reinforcement learning. |
88 | 88 |
|
89 | 89 | ### The displayed graph is from a forecasting model for testing a time series. Considering the graph only, which conclusion should a Machine Learning Specialist make about the behavior of the model? |
|
404 | 404 |
|
405 | 405 |  |
406 | 406 |
|
407 | | -- [ ] Early stopping. |
| 407 | +- [x] Early stopping. |
408 | 408 | - [ ] Random initialization of weights with appropriate seed. |
409 | | -- [x] Increasing the number of epochs. |
| 409 | +- [ ] Increasing the number of epochs. |
410 | 410 | - [ ] Adding another layer with the 100 neurons. |
411 | 411 |
|
412 | 412 | ### A company wants to predict the sale prices of houses based on available historical sales data. The target variable in the company's dataset is the sale price. The features include parameters such as the lot size, living area measurements, non-living area measurements, number of bedrooms, number of bathrooms, year built, and postal code. The company wants to use multi-variable linear regression to predict house sale prices. Which step should a machine learning specialist take to remove features that are irrelevant for the analysis and reduce the model's complexity? |
|
663 | 663 |
|
664 | 664 | ### A Data Scientist is developing a machine learning model to classify whether a financial transaction is fraudulent. The labeled data available for training consists of 100,000 non-fraudulent observations and 1,000 fraudulent observations. The Data Scientist applies the XGBoost algorithm to the data, resulting in the following confusion matrix when the trained model is applied to a previously unseen validation dataset. The accuracy of the model is 99.1%, but the Data Scientist has been asked to reduce the number of false negatives. Which combination of steps should the Data Scientist take to reduce the number of false positive predictions by the model? (Choose two.) |
665 | 665 |
|
666 | | - |
| 666 | + |
667 | 667 |
|
668 | 668 | - [ ] Change the XGBoost eval_metric parameter to optimize based on rmse instead of error. |
669 | 669 | - [ ] Increase the XGBoost scale_pos_weight parameter to adjust the balance of positive and negative weights. |
@@ -826,20 +826,6 @@ Data Firehose for clickstream analytics; AWS Glue to generate personalizedproduc |
826 | 826 | for delivery to Amazon ES for clickstream analytics; Amazon EMR togenerate personalized product recommendations. |
827 | 827 | - [ ] Amazon Athena as the data catalog; Amazon Kinesis Data Streams and Amazon Kinesis Data Analytics for historical data insights; Amazon DynamoDB streams for clickstream analytics; AWS Glue to generate personalized productrecommendations. |
828 | 828 |
|
829 | | -### A Machine Learning Specialist is working with a large company to leverage machine learning within its products. The company wants to group its customers into categories based on which customers will and will not churn within the next 6 months. The company has labeled the data available to the Specialist. Which machine learning model type should the Specialist use to accomplish this task? |
830 | | - |
831 | | -- [ ] Linear regression |
832 | | -- [x] Classification |
833 | | -- [ ] Clustering |
834 | | -- [ ] Reinforcement learning |
835 | | - |
836 | | -### A Machine Learning Specialist is building a model that will perform time series forecasting using Amazon SageMaker. The Specialist has finished training the model and is now planning to perform load testing on the endpoint so they can configure Auto Scaling for the model variant. Which approach will allow the Specialist to review the latency, memory utilization, and CPU utilization during the load test? |
837 | | - |
838 | | -- [ ] Review SageMaker logs that have been written to Amazon S3 by leveraging Amazon Athena and Amazon QuickSight to visualize logs as they are being produced. |
839 | | -- [x] Generate an Amazon CloudWatch dashboard to create a single view for the latency, memory utilization, and CPU utilization metrics that are outputted by Amazon SageMaker. |
840 | | -- [ ] Build custom Amazon CloudWatch Logs and then leverage Amazon ES and Kibana to query and visualize the log data as it is generated by Amazon SageMaker. |
841 | | -- [ ] Send Amazon CloudWatch Logs that were generated by Amazon SageMaker to Amazon ES and use Kibana to query and visualize the log data. |
842 | | - |
843 | 829 | ### A manufacturing company has structured and unstructured data stored in an Amazon S3 bucket. A Machine Learning Specialist wants to use SQL to run queries on this data Which solution requires the LEAST effort to be able to query this data? |
844 | 830 |
|
845 | 831 | - [ ] Use AWS Data Pipeline to transform the data and Amazon RDS to run queries. |
|
0 commit comments