70-773 | Replace 70-773 Exam Study Guides With New Update Exam Questions


Q11. You have one class support vector machines (SVMs).

You have a large dataset, but you do not have enough training time to fully test the model. What is an alternative method to validate the model?

A. Use Principal Components Analysis (PCA) Based Anomaly detection

B. B. Replace the SVMs with two class SVMs.

C. Perform feature selection.

D. Use outlier detection.

Answer: A

Q12. You have an Apache Hadoop Hive data warehouse. RevoScaleR is not installed. You need to sort the data according to the variables in the dataset.

What should you do?

A. Connect to the database by using an ODBC connection, and then use the rxSort function.

B. Create a table in the ORC file format.

C. Connect to the database by using an ODBC connection, and then use the rxDataStep function.

D. Execute a Hive query that sorts the data, and then reads the results.

Answer: D

Q13. Note: This question is part of a series of Questions that present the same scenario. Each question in the series contains a unique solution that might meet the stated goals. Some question sets might have more than one correct solution, whale others might not have a correct solution-After you answer a question in this section, you will NOT be able to return to it- As a result, these questions will not appear in the review screen.

You use dplyrXdf and you discover that after you exit the session, the output files that were created were deleted. You need to prevent the files from being deleted.

Solution: You use dplyrXdf with the outFile parameter and specify a path other than the working directory for dplyrXdf.

Does this meet the goal?

A. Yes

B. No

Answer: A

Q14. Note: This question is part of a series of Questions that present the same scenario. Each question in the series contains a unique solution that might meet the stated goals. Some question sets might have more than one correct solution, whale others might not have a correct solution-After you answer a question in this section, you will NOT be able to return to it- As a result, these questions will not appear in the review screen.

You use dplyrXdf and you discover that after you exit the session, the output files that were created were deleted. You need to prevent the files from being deleted.

Solution: You use dplyrXdf with the outFile parameter and specify a path other than the working directory for dplyrXdf.

Does this meet the goal?

A. Yes

B. No

Answer: A

Q15. You need to build a model that looks at the probability of an outcome. You must regulate between L1 and L2.

Which classification method should you use?

A. Two-Class Neural Network

B. Two-Class Support Vector Machine

C. Two-Class Decision Forest

D. Two-Class Logistic Regression

Answer: A

Q16. You have following regression forest.

 

Which variable contributes the most to the dependent variable?

A. stack.loss

B. Water.Temp

C. Air.Flow

D. Acid.Conc

Answer: A

Q17. You are running a parallel function that uses the following R code segment. (Line numbers are included for reference only.)

 

You need to complete the R code. The solution must support chunking. Which function should insert at line 02?

A. rxBTrees

B. rxExec

C. rxDForest

D. rxDTree

Answer: C

Q18. You have cloud and on-premises resources that include Microsoft SQL Server and a big data environment in Apache Hadoop.

You have 50 billion fact records.

You need to build time series models to execute forecasting reports on the fact records. What should you use?

A. RxSpark on the Hadoop cluster

B. RxHadoopMR on the Hadoop cluster

C. RxLocalseq on the SQL Server database

D. RxLocalParallel on the SQL Server database

Answer: A

Q19. Note: This question is part of a series of Questions that present the same scenario. Each question in the series contains a unique solution that might meet the stated goals. Some question sets might have more than one correct solution, whale others might not have a correct solution-After you answer a question in this section, you will NOT be able to return to it- As a result, these questions will not appear in the review screen.

You use dplyrXdf and you discover that after you exit the session, the output files that were created were deleted. You need to prevent the files from being deleted.

Solution: You use dplyrXdf with the persist verb.

Does this meet the goal?

A. Yes

B. No

Answer: A

Q20. You have one class support vector machines (SVMs).

You have a large dataset, but you do not have enough training time to fully test the model. What is an alternative method to validate the model?

A. Use Principal Components Analysis (PCA) Based Anomaly detection

B. B. Replace the SVMs with two class SVMs.

C. Perform feature selection.

D. Use outlier detection.

Answer: A