Analysis of Variance and Regression
Question 1
Choice of statistical analysis is based largely on the way in which the variables have been measured. Consider the following variable and identify if it is more likely to be measured on a metric or categorical scale:
Variable: Type of transport (bus, train, tram)
is the answer (please highlight the answer or remove the wrong answer)
Metric or Categorical
Question 2
Choice of statistical analysis is based largely on the way in which the variables have been measured. Consider the following variable and identify if it is more likely to be measured on a metric or categorical scale:
Variable: Cost of ticket ($)
is the answer (please highlight the answer or remove the wrong answer)
Metric or Categorical
Question 3
Choice of statistical analysis is based largely on the way in which the variables have been measured. Consider the following variable and identify if it is more likely to be measured on a metric or categorical scale:
Variable: Was the carriage/tram/bus overcrowded? (Yes/No)
is the answer (please highlight the answer or remove the wrong answer)
Metric or Categorical
Question 4
Choice of statistical analysis is based largely on the way in which the variables have been measured. Consider the following variable and identify if it is more likely to be measured on a metric or categorical scale:
Variable: Time taken (in minutes) to reach destination
is the answer (please highlight the answer or remove the wrong answer)
Metric or Categorical
Question 5
Apart from describing our variables as metric or categorical, we can indicate the level of measurement of said variables. Consider the following variables and select the most appropriate level of measurement:
On a scale from 1 to 5, how would you rate your satisfaction with the service provided?
1. 2. 3. 4. 5.
—-Very— Neutral —-Very—
Dissatisfied Satisfied
is the answer (please highlight the answer or remove the wrong answers)
Interval or Ordinal or Nominal or None of the above
Question 6
Apart from describing our variables as metric or categorical, we can indicate the level of measurement of said variables. Consider the following variables and select the most appropriate level of measurement:
Make of car (Holden, Ford, Mitsubishi, Other)
is the answer (please highlight the answer or remove the wrong answers)
Interval or Ordinal or Nominal or None of the above
Question 7
The following SPSS output was produced:
Choose the most appropriate statement from one of the following options:
is the answer (please highlight the answer or remove the wrong answers)
It is appropriate to use Pearson’s r.
or
It is not appropriate to use Pearson’s r because the relationship is curved.
or
It is not appropriate to use Pearson’s r because there are outliers.
Question 8
Give the regression coefficient (slope) correct to three (3) decimal places.
Answer =
Question 9
What is the best interpretation of the regression coefficient?
is the answer (please highlight the answer or remove the wrong answers)
For each additional hour of sleep, on average, people made 2.72 less errors.
or
For each error made, on average, people had 2.72 hours less sleep.
or
For each additional 2.72 hours of sleep, on average, people made 1 error more.
or
For each additional 2.72 errors, on average, sleep was reduced by 1 hour.
Question 10
What is the best interpretation of the 95% confidence interval for the correlation?
is the answer (please highlight the answer or remove the wrong answers)
We can be 95% confident that the strength of the correlation between amount of sleep and number of errors is between -0.54 and -0.75.
or
We can be 95% confident that the proportion of errors made is between -0.54 and 0.75.
or
We can be 95% confident that the sample correlation between amount of sleep and number of errors is between -0.54 and 0.75.
Question 11
A convenient way to identify the response codes used in this data set is to open the SPSS file in SPSS and use the Utilities-Variables command sequence. This command will provide a window where each variable’s label, value codes, and value labels can be seen. The “measure” aspect of each variable has been identified as “nominal” or “scale,” with “scale” pertaining to interval or ratio scaling assumptions.
1. Determine what variables are categorical (either nominal or ordinal scales), perform the appropriate descriptive analysis, and interpret it.
2. Determine what variables are scale variables (either interval or ratio scales), perform the appropriate descriptive analysis, and interpret it.
3. What are the population estimates for each of the following? a) Preference for “easy listening” radio programming b) Viewing of 10 p.m. local news on TV c) Subscribe to City Magazine d) Average age of heads of households e) Average price paid for an evening meal entrée
4. Because Jeff Dean’s restaurant will be upscale, it will appeal to high income consumers. Jeff hopes that at least 25% of the households have an income level of $100,000 or higher. Test this hypothesis.
5. With respect to those who are “very likely” to patronize the Hobbit’s Choice Restaurant, Jeff believes that they will either “very strongly” or “somewhat” prefer each of the following:
(a) wait staff with tuxedos,
(b) unusual desserts,
(c) large variety of entrees,
(d) unusual entrees,
Question 12
Measures of Central Tendency and Dispersion with SPSS to prepare for this Application: Review Chapter 15 and Appendix D in the course text Research Methods in the Social Sciences. Review the video programs for this week, located in the Learning Resources. Review Lessons 20 and 21 in the course text Using SPSS for Windows and Macintosh: Analyzing and Understanding Data. Access the gss04student_corrected dataset in the Course Information area of the classroom to use for this Application. Select one variable that is measured as a continuous or metric variable (age, Likert scale item, etc.) and one that is measured on a nominal scale (marital status, ethnicity, etc.). The assignment: State the statistical assumptions of this test. Using the data set and variables you have selected, use SPSS to calculate the following Mean, Median, Mode Range Minimum Maximum Standard Deviation Generate syntax and output files in SPSS. You will need to copy and paste these into your Application document. Use one kind of chart (any kind) to describe the data. Based on your SPSS analysis, craft up to a one-page double-spaced write up of the statistical results (include any additional pages needed for any APA tables or graphs and the SPSS syntax and output). Your report should include: SPSS syntax and output files1 chart
Question 13
This assignment aims at Understand various qualitative and quantitative research methodologies and
techniques, and other general purposes are:
1. Explain how statistical techniques can solve business problems
2. Identify and evaluate valid statistical techniques in a given scenario to solve business problems
3. Explain and justify the results of a statistical analysis in the context of critical reasoning for a business
problem solving
4. Apply statistical knowledge to summarize data graphically and statistically, either manually or via a
computer package
5. Justify and interpret statistical/analytical scenarios that best fits business solution
Question 14
The article “Measuring and Understanding the Aging of Kraft Insulating Paper in Power Transformers” contained the following observations on degree of polymerization for paper specimens for which viscosity times concentration fell in a certain middle range: 340 344 356 386 332 402 362 322 318 360 362 354 340 372 338 375 364 355 324 370 Assume that these samples constitute a random sample, and that the population is normally distributed. (a) Calculate a two-sided 95% confidence interval for true average degree of polymerization. (b) Does the interval suggest that 440 is a plausible value for true average degree of polymerization? Why?
Question 15
Your task is to write the statistical methods and results sections of a hypothetical manuscript that
focuses on health data in a sample of individuals who participated in the Framingham Heart Study
which commenced in 1948 in the community of Framingham, Massachusetts USA. The SPSS data set
you are required to analyses and the specific questions you need to answer area available on the
Assessment 2 page of Blackboard. Please note this is an individual assignment and that you must be
logged into Blackboard under your username to access the assessment tasks and data set which
have been assigned to you. The data coding manual for the data set is provided at the end of this
document.
You will be graded on both your analysis and interpretation, so you should avoid simple yes/no
answers to the questions. Rather, explain how you reached your answer citing any relevant results.
Only statistical methods taught in this unit should be used in the assessment. Your grade will be
negatively affected if alternate statistical methods are used. All questions can be answered using
the methods taught.
There are eight questions/tasks which need to be completed, plus one optional question. The
optional question must be completed in order to receive a grade of 6 or 7 for this assessment.
However completing the optional question does not guarantee you will receive a grade of 6 or 7.