Data science
2 years ago
15
assignmentdatascience.docx
assignmentdatascience.docx
Note: Read the resources posted in week 12 folder and the chapter 7 of the ebook in the week 12 folder (Python for Data Analysis).
Solve the following problems 1 to 3, work with the Nutrition_subset data set. The data set contains the weight in grams along with the amount of saturated fat and the amount of cholesterol for a set of 961 foods. Use Python.
1. The elements in the data set are food items of various sizes, ranging from a teaspoon of cinnamon to an entire carrot cake.
a. Sort the data set by the saturated fat (saturated_fat) and produce a listing of the five food items highest in saturated fat.
b. Comment on the validity of comparing food items of different sizes.
2. Derive a new variable, saturated_fat_per_gram, by dividing the amount of saturated fat by the weight in grams.
a. Sort the data set by saturated_fat_per_gram and produce a listing of the five food items highest in saturated fat per gram.
b. Which food has the most saturated fat per gram?
3.Derive a new variable, cholesterol_per_gram.
a. Sort the data set by cholesterol_per_gram and produce a listing of the five food items highest in cholesterol fat per gram.
b. Which food has the most cholesterol fat per gram?
Solve the following problems 4 to 6, work with the adult_ch3_training data set. The response is whether income exceeds $50,000. Use Python.
4. Add a record index field to the data set
5. Determine whether any outliers exist for the education field.
6. Do the following for the age field. a. Standardize the variable. b. Identify how many outliers there are and identify the most extreme outlier.
- Consumer behaviour home work
- statistics
- This week we'll discuss marginal revenue, marginal cost, and marginal benefit, all of which is known as Marginal Analysis. This study examines the response of incremental changes in a cost/benefit format. Is there a better way to determine the most optima
- For Prof Geek Only: #4
- IT Help
- You are trying to determine the statistical significance of an equation. Given the following
- Having developed the null and alternative hypotheses in the previous module, write a 2- to 3-page essay in which you: Identify a test statistic to help you assess the evidence against the null hypothesis you developed in the previous module. Explain why
- Suppose that a 2012 National Health Interview Survey gives the number of adults in the United States which gives the number of adults in the United States (reported in thousands) classified by their age group, and whether or not respondents have ever been
- In house or Outsourcing
- Statistics 2