Assignment 1

  1. (Please read Faraway (2006 or 2016), Chapter 1 before you do this question) The teengamb data has 47 rows and 5 columns. The data was obtained from a survey conducted to study teenage gambling in Britain. This 5 columns in the data are:

    sex: 0=male, 1=female

    status: Socioeconomic status score based on parents' occupation

    income: in pounds per week

    verbal: verbal score in words out of 12 correctly defined

    gamble: expenditure on gambling in pounds per year

    Use gamble as the response and the other variables as explanatory variables to perform a data analysis. Your data analysis should consist of:

    1. an initial data analysis that explores the numerical and graphical characteristics of the data

    2. an exploration of transformations to improve the fit of the model

    3. diagnostics to check the assumptions of your model

    4. some predictions of future observations for interesting values of the predictors

    5. an interpretation of the meaning of the model (parameters) with respect to the particular area of application

    Notice that there is always some freedom in deciding which method to use, in what order to apply them, and how to interpret the results. So, there may not be one clear right answer, and good analysts may come up with different models.

  2. In the following examples, distinguish between response and explanatory variables.
    1. Attitude toward abortion on demand (favor, oppose); gender (male female).

    2. Cholesterol level; heart disease (yes, no).

    3. Race (white, nonwhite); gender (male, female); vote for President (Republican, Democrat, Other); income.

    4. Hospital (A, B); treatment (T1, T2); patient outcome (survive, die).

  3. Identify each variable as nominal, ordinal, or interval
    1. Political party affiliation (Democrat, Republican, other)
    2. Location of hospital in which data collected (London, Boston, Madison, Rochester, Toronto)
    3. Highest degree obtained (none, high school, bachelor's, master's, doctorate)
    4. Favorite beverage (beer, juice, milk, soft drink, wine, other)
    5. Patient condition (good, fair, serious, critical)
    6. Patient survival (in number of months)
    7. Rating of a movie with 1 to 5 stars, representing (hated it, didn't like it, liked it, really liked it, loved it)

¡@

¡@

¡@