# Mastering Data Analysis in Excel Week 2 Quiz Answer

## Mastering Data Analysis Week 2 Quiz Answer Coursera

### Binary Classification (graded)

Question 1)

A test for “driving while intoxicated” was given 100 times. 20 people tested were actually intoxicated, and 10 people were mis-classified as intoxicated. What would the False Positive rate be?

• 10%
• 12.5%
• 50%
• 30%

Question 2)

If a fire alarm malfunctions and fails to go off when there actually is a fire, that is a:

• True Positive
• True Negative
• False Negative
• False Positive

Question 3)

Use the Binary Classification Metrics Spreadsheet Definitions to answer the following:

If the “classification incidence/test incidence” is 10% for the whole population, and the true “condition incidence” is 12% for the whole population, the True Positive rate:

• can be 100%
• must be 0%
• must be 100%
• cannot be 100%

Question 4)

Use the Cancer Diagnosis Spreadsheet to answer Questions 4 to 6.

Cancer Diagnosis.xlsx

Keep the cost per False Positive test set at \$500. Use MS Solver to determine the maximum cost per False Negative test that permits an average cost per test of \$100.

• \$17,082
• \$12,262

Question 5)

Assume a cost of \$15,000 per False Negative (FN) and \$100 per False Positive (FP). What is the minimum average cost per test?

• \$1.00
• \$25.98
• \$16,551
• \$259,800

Question 6)

If, instead of assuming a cost \$15,000 per FN and \$100 per FP, the costs are assumed to be \$7,500 per FN and \$50 per FP, what changes?

• The True Positive Rate
• The False Positive Rate
• The minimum Cost per Test
• The minimum cost threshold of 16,551.930

Question 7)

Use logic and the definition in the Binary Performance Metrics Spreadsheet to answer the following question.

Binary Performance Metrics.xlsx

In general, increasing the cost per FN while keeping the cost per FP constant will cause the cost-minimizing threshold score to:

• Decrease
• Increase
• Stay the Same

Question 8)

Make a copy of the Bombers and Seagulls Spreadsheet to answer questions 8-10.

Bombers and Seagulls.xlsx

Modify the spreadsheet data so that there are 4 bombers instead of 3, and 16 seagulls instead of 17, by changing the actual condition for the radar score of 66 from a 0 to a 1 in cell D43.

What is the new Area Under the Curve:

• 0.72
• 0.78
• 0.75
• 0.824

Question 9)

Assuming the costs for classification errors are 5 million pounds per FN and 4 million pounds per FP, how much does changing the value at Cell D43 from 0 to 1 change the minimum cost per event?

• Unknown
• Increases by 950,000 pounds
• Increases by 5 million pounds.
• Increases by 250,000 pounds

Question 10)

Change the cost per FN to 50 million pounds. How does changing the data in cell D43 from a 0 to a 1 change the cost-minimizing threshold?

• Decreases it from 75 to 70
• Increases it from 66 to 75.
• Decreases it from 75 to 66
• Decreases it from 75 to 62

Question 11)

Use the Binary Performance Metrics Spreadsheet definitions to answer the following question.

Binary Performance Metrics.xlsx

A population tested for “driving while intoxicated” has a Condition incidence of 20%. If the test has a true positive rate of 70% and a false positive rate of 10%, what is the test’s Positive Predictive Value (PPV)?

• 0.36
• 0.50
• 0.60
• 0.64

Question 12)

Use the Soldier Performance Spreadsheet to answer question 12.

Forecasting Soldier Performance.xlsx

Rank the outcomes using soldier’s age as the score, with the oldest at the top. A threshold of 24 years represents what point on the ROC Curve?

• .5, .5
• .25, .75
• .33, .67
• .67, .33