Using the LIWC2015 Dataset
Generate basic statistics about the dataset using functions like "summary()", execute "mean()" and "sd()" on the Wordcount and WPS columns
Find the columns with the least/greatest correlation
Generate linear regression plots for the top 3 correlations you find
Describe 2 predictions you might be able to make about the use of language in psychology textbooks over the next decade
Create an R-Markdown of all your work, generate it to an HTML, PDF or MS Word document and submit the output
Remember to include information ABOUT your configuration using functions like "sessionInfo()" as part of your R-Markdown
Find and include definitions for Residuals vs Fitted, Normal Q-Q, Scale-Location, Residuals vs Leverage as part of your markdown before the plots.
Are there any multiply correlated columns in the dataset - where 3 or even 4 columns track together with a strong correlation? Read the "help(lm)" file in RStudio for help and insight into how to accomplish this task in particular right after the "Details" section
If R-Markdown is not working for you an acceptable alternative is for you to create a PowerPoint using screen-shots however this is a fallback option, NOT the preferred solution.