1.
The file P02_01.xlsx indicates the gender and nationality of the MBA incoming class in two successive years at the Kelley School of Business at Indiana University.

For each year, create tables of counts of gender and of nationality. Then create column charts of these counts. Do they indicate any noticeable change in the composition of the two classes?

Repeat part a for nationality, but recode this variable so that all nationalities that have counts of 1 or 2 are classified as Other.

2.
The file P02_02.xlsx contains information on 256 movies that grossed at least $1 million in 2017.

Create two column charts of counts, one of the different genres and one of the different distributors.

Recode the Genre column so that all genres with a count of 10 or less are lumped into a category called Other. Then create a column chart of counts for this recoded variable. Repeat similarly for the Distributor variable.

3.
The file P02_03.xlsx contains data from a survey of 399 people regarding a government environmental policy.

Which of the variables in this data set are categorical? Which of these are nominal; which are ordinal?

For each categorical variable, create a column chart of counts.

Recode the data into a new data set, making four transformations: (1) change Gender to list “Male” or “Female”; (2) change Children to list “No children” or “At least one child”; (3) change Salary to be categorical with categories “Less than $40K,” “Between $40K and $70K,” “Between $70K and $100K,” and “Greater than $100K ” (where you can treat the breakpoints however you like); and (4) change Opinion to be a numeric code from 1 to 5 for Strongly Disagree to Strongly Agree. Then create a column chart of counts for the new Salary variable.

4.
The file P02_04.xlsx contains salary data on all Major League Baseball players for each year from 2012 to 2018. For any three selected years, create a table of counts of the various positions, expressed as percentages of all players for the year. Then create a column chart of these percentages for these years. Do they remain fairly constant from year to year?

Published by
superadmin
View all posts