Variables, hypotheses, and research questions

Empirical and normative social research

  • Studies what is. It describes and explains the social world as it exists using verifiable evidence.

    • "Does the death penalty deter crime rates?" (Measured by statistics)

      • We collect data or use existing data in empirical social research.

Normative social research (values):

  • Studies what it should be. It evaluates the social world based on values, beliefs, and ethical standards.

    • "Is it morally right for the state to execute criminals?" (Debated by ethics)

      • We do not need any kind of data in normative social research.

Empirical social research and data

Empirical social research uses data to examine potential relationships between social concepts.

  • To study the potential relationships between variables, we must have data that specifically includes those social concepts. It’s because we want to find a justification from the real-world data.

  • To be able to use a social concept (variable) in empirical social research, we must have a data that includes those variables:

    • Primary data (self-collected – surveying and interviewing):

      • If you interview or survey people for a project examining “How frequency of social media use affects attention span,” you must directly ask questions related to “frequency of social media use” and “attention span.”

        • You gather this through interviews (qualitative) or surveys (quantitative) using questions you prepare.

          • Here you go out and ask questions to people.

    • Secondary data (collected or created by someone else):

      • If you analyze existing textual data from social media or survey data collected by other researchers for a project examining “How frequency of social media use affects attention span,” that data must actually include information related to “frequency of social media use” and “attention span.”

        • You obtain this through existing surveys, data archives, government records, or posts from social media platforms (e.g., Reddit, YouTube).

          • This is a desk-based job, and you do not ask question to people.

Variables

  • Empirical social research uses data to examine potential relationships between social concepts such as;

    • How experienced discrimination affects a sense of belonging.

      • In research, these social concepts are called “variables.”

  • A variable is any characteristics, number, or quantity that can be measured, observed, or counted. Any piece of information we know about our subjects (e.g., individuals).

    • Dependent variables: Dependent variable is the main topic that we investigate. It is the outcome: What is being affected? What’s being changed? You SHOULD have only ONE dependent variable. Because you have one main topic. Such as;

      • Dependent variable: Return migration

    • Independent variables: Independent variables are the factors through which we attempt to explain and understand our dependent variable (main topic). You SHOULD have more than ONE independent variables. Because you have one main topic; We can’t explain a social issue through only one factor. Such as;

      • Independent variable 1: Ethnic identification

      • Independent variable 2: Contact level with natives

      • Independent variable 3: Discrimination

Hypotheses

Hypothesis: Tentative predictions we have about what we are going to discover before we begin the research. Educated guesses by following the directions of the literature. The proposed links between independent variables and the dependent variable, such as;

  • Hypothesis 1: I expect to find that ethnic identification (Independent variable 1) increases return migration (Dependent variable)

  • Hypothesis 2: I expect to find that contact level with natives (Independent variable 2) decreases return migration (Dependent variable)

  • Hypothesis 3: I expect to find that discrimination (Independent variable 3) increases return migration (Dependent variable)

As there are three independent variables, there are three hypotheses.

  • So, we develop a hypothesis for the proposed relationship between each independent variable and the dependent variable.

  • Hypotheses MUST include:

    • An independent variable and the dependent variable and

    • How the independent variable influences or changes the dependent variable (in a positive or negative direction). It cannot be both “increases” and “decreases” or “depends.” It CANNOT be both “positively affects” and “negatively affects”. You need to choose one.

Hypotheses should be in this format:

  • I expect to find that independentvariable1 (choose one, delete the rest: increases OR decreases OR positively affects OR negatively affects) dependentvariable

  • I expect to find that independentvariable2 (choose one, delete the rest: increases OR decreases OR positively affects OR negatively affects) dependentvariable

  • I expect to find that independentvariable3 (choose one, delete the rest: increases OR decreases OR positively affects OR negatively affects) dependentvariable

    • Hypothesis 1: I expect to find that ethnic identification (Independent variable 1) increases return migration (Dependent variable)

    • Hypothesis 2: I expect to find that contact level with natives (Independent variable 2) decreases return migration (Dependent variable)

    • Hypothesis 3: I expect to find that discrimination (Independent variable 3) increases return migration (Dependent variable)

Research question

Research question: An answerable inquiry into a specific concern or issue. Research question is what the research attempts to answer.

  • Research question: To what extent do ethnic identification (Independent variable 1), contact level with natives (Independent variable 2), and discrimination (Independent variable 3) influence return migration (Dependent variable)?

Phrasing variables

  • Variables are the main elements of hypotheses where we propose causal relationships;

    • A increases B, A decreases B, A positive affects B, or A negatively affects B.

  • Therefore, phrase your variables so they;

    • (1) are specific and anyone can understand exactly what you mean,

    • (2) have high and low values - all variables must be able to increase or decrease, or be positively or negatively affected by other variables.

    • (3) show causality - variables should make sense in a hypothesis. Independent variables must be able to increase (positively affect) or decrease (negatively affect) the dependent variable so it shows causality.

      • Hypothesis 1: I expect to find that family involvement positively affects academic performance.

The three-step test

  • Step 1: The “What about it?” test (Check for specificity):

    • A variable is not a topic; it is a specific attribute of that topic.

    • The Test: Look at your variable. Does it force you to ask, “What about it?”

      • Example: You write “Social media.”

      • Question: “What about social media? Do you mean the time spent? The platform? The addiction?"

      • Fix: If you have to explain it, rewrite it.

        • For example, “Frequency of Social Media Use.”

  • Step 2: The “Higher or lower?” test (Check for measurement):

    • A variable must be able to vary (increase, decrease, positively affect(ed), negatively affect(ed)).

    • The Test: Can you meaningfully ask, “Is this variable higher or lower for this person/group?”

      • Example: You write “Social media.”

      • Question: “Is social media higher for youth?”

      • Fix: If it doesn’t make sense, rewrite it.

        • For example, “Level of social media addiction.”

  • Step 3: The “Is there a causality?” test (Hypothesis fit):

    • Variables must be phrased so it can logically “increase,” “decrease,” “positively affect(ed),” or “negatively affected”

      • (e.g., “I expect to find that X increases Y”).

    • The Test: Put your variables into the hypothesis template and read, “Is there a meaningful causality here?”

      • Hypothesis template: “I expect to find that independentvariable1 (choose one, delete the rest: increases OR decreases OR positively affects OR negatively affects) dependentvariable”

        • Example: You write “I expect to find that social media decreases attention span.”

      • Question: “Is there a meaningful causality here?”

      • Fix: If it doesn’t make sense, rewrite it.

        • For example, “I expect to find that using short video-based social media platforms decreases attention span.”

Bad variables, good variables

#
Bad variable
Why?
Good variable

1

“Political attitudes”

The nature/direction of the variable is not specified. Doesn’t increase (positively affect(ed)) or decrease (negatively affect(ed)).

Progressive political attitudes” OR

“Conservative political attitudes”

2

”Culture” OR “Globalization”

Too vague. What about it? Culture (of globalization) affects everything and is being affected by everything. Imagine you ask questions to respondents, or you read a social media post and you’ll say, “this is culture (or globalization).”

“Collectivistic culture” OR

“individualistic culture” OR

“Support for free trade”

3

“Race” OR

“Gender” OR

“Marital status” OR

“Employment status”

These are categories. You cannot say “Race (or Gender) increases X.” You must specify if you mean the identification or mean something else.

Perhaps, you mean “presence of a condition,” such as “being employed.” While the following doesn’t make sense: “Employment status increases financial security,” this makes sense: “Being employed increases financial security.”

"Strength of racial identification" OR

“Racism” OR

“Sexism” OR

“Gender wage gap” OR

“Being woman” OR

“Being Black” OR

“Being married” OR

“Being employed”

4

"Resources" OR

“Life conditions”

Too broad. Are we talking about money, friends, time, or information? It is impossible to measure “Resources” or “Life conditions” in general. Do everyone understand the same thing when we say “Resources” or “Life conditions”?

“Level of household income” (Financial) OR

“Size of social support network” (Social) OR

“Quality of housing conditions” OR

“Perception of neighborhood safety”

5

"Social relationships"

No specific attribute. Everyone has relationships. The variable must measure a specific attribute of those relationships (how good they are or how often they happen).

“Quality of social relationships” OR

“The number of friends” OR

“Frequency of social interaction”

6

“Quality of social relationships and the number of friends”

Double-barreled. There are two separate variables here. You need to focus on only one.

“Quality of social relationships” OR

“The number of friends”

7

“Education”

It is an institution, not a variable. Do you mean the time spent there, the degree earned, or the quality of the school?

“Years of formal schooling” OR

“Level of educational attainment”

8

“Climate change”

This is a global issue, not a variable. You usually measure people’s reactions to it or their knowledge about it.

“Concern about climate change” OR

“Support for environmental regulations”

9

“Crime”

Ambiguous. Does this refer to committing a crime, fearing crime, or being a victim? "Crime" is just a topic.

“Fear of crime” OR

“Frequency of criminal behavior” OR “Risk of victimization”

Last updated