Week 3 Assignment: Getting Started With Your Data
Overview: Throughout this course, you will be working toward your final assignment—an archival data research project. What does that mean? It means you will be using real data, collected as part of the General Social Survey (GSS), to answer a real question.
To help you complete this final assignment, these earlier assignments will walk you through the steps. In fact, to provide added assistance (and maybe even a sense of fun), you will have three fictional research assistants.
Meet your research assistants:
Juanita: She is junior in college with the intention of being a counselor. She’s more interested in the results of the study than in the process to get the results.
Duante: Duante is a senior in college who loves algebra but is skeptical about statistics. However, he wants a job where he can do research and is very interested in what you are doing.
Amanda: Amanda is a first-year student in college. She’s friendly and likes to talk a lot (and ask plenty of questions). She hasn’t been through her statistics course yet, but she is great at staying organized and loves using Excel.
Directions: Complete all six parts of this worksheet.
PART 1: GETTING STARTED
The General Social Survey (GSS) has been studying America, specifically American society, for 50 years. All of their data are available to the public (and to researchers). This makes GSS a great source for archival data projects. The surveys are long, providing a number of potential variables. For this project, certain variables have been chosen for you and the data extracted from the GSS site ahead of time. If you’re curious and want to learn more about GSS, feel free to check out the About the GSS web page.
Directions: Complete the steps below. These steps will prepare you for later portions of this assignment as well as for your final assignment in Week 9.
Step 1: Choose your variables, one from each column and fill in the white blocks of the table below.
List A (Choose ONE variable from this list) |
List B (Choose ONE variable from this list) |
RACLIVE. Have other race living in neighborhood. This is a Yes/No question asking respondents if they live in a neighborhood with people of another race. |
HAPPY. General happiness. This asks the respondents to rate how happy they are (small Likert scale). |
NEWS. How often does respondent read newspaper. This question asks respondents to rate how often they read the newspaper (Likert scale). |
LIFE. If life is exciting or dull. This question asks respondents to rate their life as exciting, routine, or dull (small Likert scale). |
WWWHR. Internet hours per week. This question asks respondents to share how many hours in a week they use the Internet for non-email activities. |
MNTLHLTH. Days of poor mental health past 30 days. This question asks respondents how many days of poor mental health they’ve had in the past 30 days. |
DEPRESS. Told have depression. This is a Yes/No question asking respondents if they have been told they have depression. |
|
My Variable (from List A): |
My Variable (from List B): |
Note: Once you’ve selected your variables, use them throughout the course. Changing variables partway through will require you to re-do work you’ve already done.
What’s your preliminary research question? Note: The survey method used to collect these data was not designed to allow you to determine causation or the effect of one variable on the other. |
Step 2: Download data. Go to Week 3 Prepare: For This Week’s Assignment activity. Download the “GSS 2018 Data” Excel file. Make sure you save this somewhere on your computer (where you can find it again).
Step 3: Clean the data. In this case, Amanda helped you out and, if you look at the Excel file you downloaded, the data have been cleaned for you. She spent time going through the data to make sure there were results for both variables you will use and remove those participants who did not respond to both questions. She also made sure that the data were in a form that JASP could read accurately (right labels, right file format, et cetera). She’s a bit of an overachiever, so she cleaned everyone’s data. On the Excel spreadsheet, you will want to make sure you are on the right tab for your project. To understand what answer the numbers correspond to, look at the Codes tab.
Amanda’s Notes on Naming:
|
Step 4: Save your .csv file. Duante didn’t want to be outdone by Amanda, so he created .csv files of everyone’s data project. To find them, open the files in the Week 3 Prepare: For This Week’s Assignment activity and locate the data for your project. He used the same naming convention as Amanda did above. Make sure you save these files somewhere on your computer (where you can find them again).
PART 2: UNDERSTANDNG MEASURES OF CENTRAL TENDENCY
Amanda is confused by the measures of central tendency and has come to you asking questions.
Directions: Answer all questions in the table.
Scoring Criterion: Describe key statistical concepts.
What types of variables can you compute a useful mean for? Select as many options as apply. |
☐Nominal ☐Ordinal ☐Interval ☐Ratio |
When would a mean be useful? |
|
Now what about the mode – what types of variables can you find a mode for? Select as many options as apply. |
☐Nominal ☐Ordinal ☐Interval ☐Ratio |
When is a mode useful? |
|
Then the median – what types of variables can you find the median for? Select as many options as apply. |
☐Nominal ☐Ordinal ☐Interval ☐Ratio |
And when would use the median (and not the mean or mode)? |
|
What does the variance tell us? |
|
Why is the variance important? |
|
Consider the demographic data on our participants. Where might you use measures of central tendency? |
|
PART 3: YOUR DATA
You’ve sent Amanda off to think more about measures and central tendency. Working with Duante, you start to consider your variables. He wants to know what the data type for each available variable is.
Directions: Fill in the table below.
Scoring Criterion: Determine data type.
What data type is our data (for JASP)? Note: JASP offers you three options: Nominal, Ordinal, Continuous. So, if the data is ratio or interval, call it continuous for the purpose of this table.
Variable |
Type |
Variable |
Type |
Variable |
Type |
AGE |
|
RACE |
|
SEX |
|
RACLIV |
|
NEWS |
|
WWWHR |
|
HAPPY |
|
LIFE |
|
MNTLHLTH |
|
DEPRESS |
|
NOTES:
Age: Respondents entered their exact age
Race: Respondents had a choice of: White, Black, or Other (The survey uses limited options, which is a limitation for modern studies but useful when comparing modern data with historical data.)
Sex: Respondents had a choice of: Male, Female (Based on biological sex at birth; this is not a variable looking at gender.)
Racliv: Respondents were given a Yes/No question on whether they lived in a neighborhood with other races.
News: Respondents were given a version of a Likert scale.
Wwwhr: Respondents entered the number of hours they are on the Internet each week.
Happy: Respondents were given a version of a Likert scale.
Life: Respondents were given a version of a Likert scale.
Mntlhlth: Respondents entered the numbers of days of poor mental health.
Depress: Respondents were given a Yes/No question on whether they had been told they had depression.
In JASP, select the three blue bars, then select open, then select the location you saved the .csv file.
JASP tries to guess at it and will use the symbols below.
You and Duante look at JASP’s guesses.
- If JASP got it wrong, correct it.
- Click on the appropriate variable data icon in the column title to change it to the correct format.
Paste a screenshot of your data column in JASP below.
PART 4: DATA ON YOUR PARTICIPANTS
Continuing to work with Duante, you decide to learn more about your participants.
Duante has done some research into GSS and provides you with the following information:
The General Social Survey uses random sampling to get a representative sample of adults across the United States (NORC, 2019).
Reference:
National Opinion Research Center (NORC). (2019). Appendix A: Sampling design and weighting. In General Social Surveys 1972–2018: Cumulative Codebook (pp. 3171–3189). https://gss.norc.org/documents/codebook/GSS_Codebook_AppendixA.pdf
Before you start looking at your participants, Juanita has a few questions for you.
Directions: Answer the questions below.
Scoring Criterion: Explain the use of a mean with different types of variables.
Can you use a mean for the variable Sex? Why or why not? |
|
Can you use a mean for the variable Race? Why or why not? |
|
Can you use a mean for the variable Age? Why or why not? |
|
Now that Juanita has a better understanding of the mean, the two of you start looking at your demographic data.
- Open JASP.
- Then open your .csv data file for your project. You can do this by clicking the three blue bars, selecting Open, then select Computer, then choose your file from wherever you saved it on your computer.
- In JASP, in your data file, select Descriptives.
Select age, then select the
Copy and paste the resulting table below.
PART 5: GRAPHIC DISPLAY OF YOUR PARTICIPANTS
Juanita likes to process information visually and asks if you can create graphs or charts of the data.
Directions: Create a bar graph, pie chart, and frequency table by following the directions below.
Scoring Criterion: Communicate statistical data in graphs and tables.
In JASP, in your data file, select Descriptives.
Select sex, then select the
Select race then select the
- Click Basic Plots, then put a check next to Distribution plots.
Copy and paste or take a screenshot of the Bar Graph for Race. Place it below.
- Next, Click Basic Plots, then put a check next to Pie Charts.
Copy and paste or take a screenshot of the Pie Chart for Sex. Place it below.
- Lastly, click Tables, then put a check next to Frequency Tables.
Copy and paste or take a screenshot of the Frequency Chart for Race. Place it below.
PART 6: LOOKING TO THE FUTURE
As you look over the participants, Amanda, Juanita, and Duante discuss what careers they want in the future. They ask you what your thoughts are and whether you’d ever consider a job that involves data analysis.
Directions: Answer all of the questions below.
Scoring Criterion: Discover career contingencies based on accurate self-assessment of abilities, achievement, motivation, and work habits.
Step 1: Statistics and data analysis are marketable job skills. Search the Internet for jobs you could apply for with a bachelor’s degree that require the use of statistics. Some good, key search terms: psychology research assistance or survey data analysis.
Step 2: Answer the following questions.
What is the job title? |
|
Provide a link to the job ad. |
|
What are the educational requirements? |
|
What other job requirements stand out to you? |
|
Does the job sound interesting to you? Why or why not? |