The final project for the course is a technical blog

 

The final project for the course is a technical blog post related to a data analysis project you will work on piecemeal over the course of the semester. 

The project is very open ended. The objective is to demonstrate your skill in asking meaningful questions of your data and answering them with results of the data analysis using R / Rmarkdown, and that your proficiency in interpreting and presenting the results.  The goal is not to conduct an exhaustive data analysis. The data analysis part should meet the following criteria:

1. Perform exploratory data analysis summarizing your data using descriptive statistics / summary statistics and visualizations relevant to your questions or ones that highlight some interesting insight.

2. Demonstrate at least two of the following techniques we have learned in class and that helps answer your question: PCA, hypothesis testing / confidence interval, regression analysis (linear /logistic) 

Proposal

The first task is to identify the dataset, understand the data and write questions you are planning to answer using that dataset. You may pick a data set from one of the resources mentioned on this webpage (Links to an external site.).  The proposal should meet the following criteria:

1. Perform checks to determine quality of the data (missing values, outliers, etc.)

2. Proposal on what questions you are interested in answering from the data

3. Initial visualizations and if required transform to get the data ready 

A good reference for ideas on questions and EDA in general: https://r4ds.had.co.nz/exploratory-data-analysis.html#questions

More information on the format:

It should be about 2+ pages in length, not exceeding 10 with appendix. It should include roughly the following sections:

1. Background or the context of data selected – sources, description of how it was collected, time period it represents, context in it was collected if available, perhaps why you selected it

2. Description of the data – how big is it (number of observations, variables), how many numeric variables, how many categorical variables, description of the variables

3. Goal – What questions you plan to understand from the data. 

3. Analysis – Descriptive statistics and visualization of key variables 

4. Summary of findings from the analysis and further questions for future analysis

5. References – link to data or analysis sources you have referenced for the report

6. Appendix – all the visualization that does not support your questions directly can go here

Final Write-up

The project should include

1. Introduction: What is your research question? Why do you care? Why should others care? If you know of any other related work done by others, please include a brief description.

2. Data: Include context about the data covering:

a. Data source: Include the citation for your data, and provide link to the source.

b. Data collection: Context on how the data was collected?

c. Cases: What are the cases (units of observation or experiment)? What do the rows represent in your dataset?

d. Variables: What are the variables you will be studying?

e. Type of study: was it an observational study or an experiment?

f. Data clean-up: (Optional) If you had to do any data clean up (missing values, outliers, transformation), include a very brief description of your steps.

3. Exploratory Data Analysis: summarize your data using descriptive statistics / summary statistics and visualizations relevant to your questions or ones that highlight some interesting insight. Additional plots not relevant to your research question can be included in the appendix.

4. Data Analysis: Pick and perform two of the following techniques we have learned in class and that helps answer your question about the dataset: PCA, hypothesis testing / confidence interval, regression analysis (linear /logistic) 

5. Conclusion: Summarize your findings and include a discussion of what you have learned about your data through this project. You may also want to include limitations of your approach and include ideas for possible future work. 

6. References: Include links that you have referenced for this project.

Share This Post

Email
WhatsApp
Facebook
Twitter
LinkedIn
Pinterest
Reddit

Order a Similar Paper and get 15% Discount on your First Order

Related Questions

The renegotiation between the US and Canada was contentious.Discuss

Please write your paper on one of the following topics. 1. In November 2018, Canada, US, and Mexico signed the United States—Mexico—Canada agreement (USMCA). Once ratified by all the three countries, the USMCA will replace the North America Free Trade Agreement (NAFTA) which came into effect in January 1994. The

For this Discussion Board, please complete the following: Risks are

 For this Discussion Board, please complete the following: Risks are the negative or positive events that may occur and impact a project. Risks can impact all areas of project elements, such as the schedule, cost, budget, quality, and deliverable. Based on your knowledge or assigned readings, respond to the following

After reading, the instructor comments and the peer review of

  After reading, the instructor comments and the peer review of your research paper answer the following questions with at least six to seven sentences for each of the prompts below. What changes would you make to improve your paper? Be specific and give some examples. How, did the instructor’s

Dashboard Benchmark Evaluation

Length of report: 2–4 double-spaced, typed pages. Your report should be succinct yet substantive. Please include a title page and a reference list in addition to your 2–4 page report. Number of references: Cite a minimum of 2–4 sources of scholarly, professional, or policy evidence to support your analysis and

Identify where the brand is in the product/brand life cycle

Refer back to the brand chosen for the Week 1 assignment. Identify where the brand is in the product/brand life cycle (introduction, growth, maturity, decline) and support your position with specific examples and research. Create a 700- to 1,050-word marketing strategy based on this position in the life-cycle: Identify appropriate

Discuss the cultural changes in regard to social media and

  Discuss the cultural changes in regard to social media and the growing online technology that has significantly lowered face to face/ human interactions with new generations.  Do the technological benefits outweigh its effects on socialization? You must support your response with scholarly sources in APA format.  Simply stating your

What advice or encouragement do you have for your classmate?

  What advice or encouragement do you have for your classmate? What did you learn about the topic from the visual aid?  Do you have any suggestions for how your classmate could revise their visual?    Although there are several visual aids that can be used during a  presentation, photographs and

kindly find attached the document.  Rasmussen University – NUR2063: Essentials of Pathophysiology Title of

kindly find attached the document.  Rasmussen University – NUR2063: Essentials of Pathophysiology Title of Assignment: Module 01 Written Assignment – Electrolyte Imbalance Purpose of Assignment: Identifying abnormal findings for electrolyte imbalances is an expected function of a healthcare team. Course Competency(s): · Determine the cellular functions required to regulate homeostasis.

Instructions Describe trends in the costs of care for treating

  Instructions Describe trends in the costs of care for treating at least three diseases or conditions. Examine the burden of health care costs on businesses and governments and the extent to which Americans can afford needed care. Analyze trends in the efficiency of care delivery and the competitiveness of

Given the growth in telecommuting and other mobile work arrangements,

 Given the growth in telecommuting and other mobile work arrangements, how might offices physically change in the coming years? Will offices as we think of them today exist in the next ten years? Why or why not?  At least one scholarly source should be used in the initial discussion thread.

Before beginning work on this week’s discussion post, review the

Before beginning work on this week’s discussion post, review the following resources: From the below list, select one topic for which you will lead the discussion in the forum this week. Early in the week, reserve your selected topic by posting your response (reservation post) to the Discussion Area, identifying

The Small Business Administration (SBA) summarizes government resources to assist

The Small Business Administration (SBA) summarizes government resources to assist the small business owner.  When starting a new business, an entrepreneur is faced with numerous business decisions. As a result, planning becomes crucial for business success.  The SBA website provides information about how to write a successful business plan. Go

You are now aware of the importance of building a

You are now aware of the importance of building a diversified portfolio to include foreign securities. One aspect of your international portfolio should be cash. Let’s assume you want to invest in a foreign country for 1 year. Go to Bloomberg (Links to an external site.) (http://www.bloomberg.com) and select the “Markets”

Academic Nurse Educator Role Analysis Essay Nursing Assignment Help

Expert Solution Preview Introduction: As a medical professor responsible for creating assignments and assessing student performance in a medical college, my role involves developing lecture materials, evaluating student understanding through examinations, and providing feedback for improvement. Through these tasks, I aim to facilitate the growth and development of future medical

The research proposal is requirement for this course and will

 The research proposal is requirement for this course and will require students to include an introduction to the problem, a review of literature that provides the theoretical and/or conceptual framework for the assignment, the research questions/hypotheses, data collection methods and limitations of the study.  A minimum of ten peer-reviewed sources

Timmco Case Study Timmco, Inc. is a publicly traded corporation

Timmco Case Study  Timmco, Inc. is a publicly traded corporation located in Denton, Texas that makes and sells high pressure industrial spraying equipment used in all sorts of commercial liquid spraying applications. It prides itself on top quality and promotes its products as “100% made in the USA”.  Sales have