Lab 07 - Exploring Case Study 02

Published

December 1, 2023

Introduction

This week in class we’ve been discussing data related to predicting annual air polution. You’ve seen the data in class and maybe have started to work with it, but now is your chance to really fully get to understand the dataset. While the case study report will be submitted in groups, this will be submitted individually.The idea is that while you are still encouraged to work together during lab, you and your groupmates may come up with separate ideas. This will allow you to have more ideas when you get together and start working on the case study together.

Getting started

To get started, accept the lab07 assignment (link on Canvas), clone the repo (using SSH) into RStudio on datahub. Update the author name at the top of the .Rmd file in the YAML to be your name. And, then you’re ready to go!

Packages

In this lab we will work with the tidyverse package. Be sure to load that in before continuing with the lab. You are allowed to load in additional packages for this lab if needed.

The data

In this lab you will first read in the data, which has been provided to you in an RData file in the data folder: pm25_data.csv. This is the dataset we worked with in class.

Exercises

Part 1: Exploratory Data Analysis (EDA)

Create at least two (2) visualizations or tables that help you learn more about these data beyond what was presented in class. (This is intentionally vague. We want you to look at the data and figure out what would be most helpful to visualize from the provided data. These could be different variables than what we looked at in class. Data could be faceted. Something totally different!) These do not have to be fully polished visualizations, but it should be clear from the visualization and accompanying text what’s to be learned from the visualization.

Part 2: Possible extensions?

Think about the data you have access to, the EDA/analysis presented in class, and the questions we said we’re going to address. What possible extensions to this analysis would you be interested in carrying out? This is a space for brainstorming. Include any possible thoughts you have here, even if they aren’t “good” or you aren’t sure if they are “possible.” This can be used as a jumping off point for when you start discussing analysis extensions with your group.