Motivate...
In the previous lesson, we learned how to calculate the strength and sign of a linear relationship between two datasets. The correlation coefficient, itself, is not a directly useful metric. The correlation coefficient represents the linear dependence of the two datasets. If the correlation is strong, a linear model would be a good fit for the data, pending an inspection into the data and confirming whether a linear relationship is plausible.
Fitting a model to data is beneficial for many reasons. Modeling allows us to predict one variable based on another variable, which will be discussed in more detail later on. In this lesson, we are going to focus on fitting linear models to our data using the approach called linear regression. We will build off our previous findings in Lesson 4 on the relationship between drought, temperature, and precipitation. In addition, we will work through an example focused on climate - how the atmosphere behaves or changes on a long-term scale, whereas weather is used to describe short time periods [Source: NASA - What's the Difference Between Weather and Climate?(link is external)]. As the world becomes more concerned with the effects of climate and as more companies prepare for the long-term impacts, climate analytic positions will only continue to grow. Being able to perform such an analysis will be beneficial to you and your current or future employers.
Newsstand
- Latham, Ben. (2014, December 9). Weather to Buy or Not: How Temperature Affects Retail Sales(link is external). Retrieved September 13, 2016.
- Tobin, Mitch. (2013, December 3). Snow Jobs: America's $12 Billion Winter Sports Economy and Climate Change(link is external). Retrieved September 13, 2016.
- Bagley, Katherine. (2015, December 24). As Climate Change Imperils Winter, the Ski Industry Frets(link is external). Retrieved September 13, 2016.
- Scott, D., Dawson, J. & Jones, B. Mitig. (2008). Climate Change Vulnerability of the US Northeast Winter Recreation-Tourism Sector(link is external). Retrieved September 13, 2016.
- Vidal, John. (2010, November 26). A Climate Journey: From the Peaks of the Andes to the Amazon's Oilfields(link is external). Retrieved September 13, 2016.
- Gordon, Lewis, and Rogers. (2014, June). A Climate Risk Assessment for the United States(link is external). Retrieved September 13, 2016.
- Kluger, Jeffrey. Time and Space(link is external). Retrieved September 13, 2016.
- Schmith, Torben. (2012, November). Statistical Analysis of Global Surface Temperature and Sea Level Using Cointegration Methods(link is external). Retrieved September 13, 2016.
- Iwasaki, Sasaki, and Matsuura. (2008, April 1). Past Evaluation and Future Projection of Sea Level Rise Related to Climate Change Around Japan(link is external). Retrieved September 13, 2016.
Lesson Objectives
- Remove outliers from a dataset and prepare for a linear regression analysis.
- Break down the steps of a linear regression analysis and describe the process.
- Perform a linear regression in R, plot results, and interpret.
