The ethics police
Right or wrong, we test along
Valid or Void?
Measure Up!
Reliability Rumble
Watch and React
Trust Your Gut (Or Not)
Chp 7-1
Chp 7 -2
100

The Belmont Report outlines three main ethical principles. What are they?

Respect for persons, beneficence, and justice.

100

The Tuskegee Study involved 400 Black men who were denied treatment and not told the true purpose of the study. What ethical principles were violated in this case?

Lack of informed consent (Respect for Persons) and violation of Justice

100

A researcher is conducting a study on life satisfaction. They ask participants to fill out a survey where they rate their life satisfaction from 1 to 7 on various statements. What type of measurement is the researcher using?

Self-report measure.

100

Which type of variable would be used to categorize individuals into male, female, or non-binary in a psychological study?

Categorical (nominal) variable.

100

A new assessment claims to measure leadership skills, but many participants feel that the test items, which focus on basic math problems, don’t seem to match what they expect leadership to involve. What kind of validity is likely to be questioned based on this perception?

Face validity.

100

A survey asks, "How often do you use social media?" but doesn’t clearly define what "often" means. How might this affect the survey's results?

It may harm construct validity, as respondents might interpret "often" differently, leading to unclear or inconsistent data.

100

In a study on academic habits, students are asked how many hours they study each week. Knowing their responses might influence future school policies, many students report studying much more than they actually do. What kind of measurement is being used here, and what’s a major concern with using it in this context?

This is a self-report measure. A major concern is social desirability bias, where students give answers they think are more acceptable or impressive rather than truthful ones, which can lead to inaccurate data.

100

A researcher conducts a study on road rage using a sample of drivers from a small town. What question would you ask to assess if the results can apply to drivers across the entire country?

"Does this sample of drivers adequately represent the population of American drivers?" This addresses generalizability and external validity.

100

A researcher surveys 1,000 voters about an election. Why might we be able to predict the election results from this survey, even though we didn’t ask everyone?

Because the sample might be representative of the larger voting population, allowing for accurate generalization.

200

A researcher conducts a study where participants are not given a full explanation of the risks involved. After participating in a research study, Jamie learns the study involved exposure to a mild health risk that was never mentioned beforehand. This violates this ethical principle from the Belmont Report.

Respect for persons (informed consent was not properly obtained).

200

A study involves minimal risk, such as participants completing an anonymous online survey. Should the IRB meet in person to review this study, and why or why not?

No, the IRB typically does not meet in person for studies with little to no risk, as long as the study does not involve vulnerable populations.

200

A hospital is studying how often nurses follow handwashing protocols. During the first week, a researcher in a white lab coat stands near the sink with a clipboard. The number of nurses washing their hands increases dramatically. However, once the researcher leaves and cameras are turned off, handwashing rates drop again.

What concept explains why the nurses’ behavior changed while they were being watched?

Reactivity — the nurses altered their behavior because they were aware they were being observed, which temporarily improved their compliance.

200

scatterplot displays data points that are tightly clustered along a straight line. What does this indicate about the relationship between the two variables?

The relationship is very strong.

200

A psychologist develops a new test to measure creativity by assessing how many different colors a person can name in 30 seconds. Although the test provides consistent results, many experts argue that color-naming doesn’t accurately reflect creative thinking. What type of validity is in question?

Construct validity.

200

A political poll asks, "Do you support Candidate A or Candidate B?" without allowing for a neutral option. What kind of question format is this?

This is a forced-choice question, which limits respondents to two options and doesn’t allow for neutrality.

200

During a study, a manager is told that a certain group of employees are "high performers." As a result, the manager unknowingly gives them more positive feedback, and their performance improves. What is this called?

This is an example of the observer effect (also known as the expectancy effect), where the observer’s expectations influence the participants' behavior.

200

A study randomly selects students using every 5th person from a list of all students at a school. What is this sampling technique called?

Systematic sampling, where participants are chosen at regular intervals from a larger population.

200

If a pollster only surveys people who are easy to reach, like students on campus or people in their neighborhood, what type of sampling is this, and why is it problematic?

This is convenience sampling, which can lead to biased results because the sample may not be representative of the entire population.

300

A pharmaceutical company is testing a new medication that might help people with chronic pain, but the full risks haven’t been studied yet. What ethical principle should they consider to make sure the potential benefits outweigh the harms?

Beneficence (they need to ensure the benefits outweigh the risks and protect participants from harm).

300

In a study, participants are told they are completing a task to improve memory, but the true purpose of the experiment is to test their reactions to stress. What ethical standard is being applied when participants are not fully informed about the study's real purpose?

Deception (Standard 8.07), but the researcher must ensure the deception does not cause harm and should debrief participants afterward.

300

Researchers collect two types of data on smiling: participants record how often they smile during the day, and researchers also count the number of smiles from video recordings of social interactions. What types of measures are being used, and how does using both improve the study?

The study uses self-report and observational measures. Self-report captures internal experiences and how participants interpret their own behavior, while observational measures provide objective data on what actually happened. Using both improves validity by offering different perspectives and helping researchers cross-check the accuracy of each.

300

A researcher uses a scale to measure subjective well-being, with items worded differently but all designed to measure the same concept. What type of reliability is the researcher assessing?

Internal reliability.

300

A gym uses a new fitness assessment to predict how well clients will perform in an upcoming marathon. Later, they compare the assessment scores to the actual marathon finishing times and find that higher fitness scores didn’t correspond with faster times. What type of validity is lacking?

Criterion validity.

300

A survey asks, "Do you agree that the company should improve work conditions and increase pay?" Why is this a poor question, and how could it be improved?

It’s a double-barreled question because it asks about two different things (work conditions and pay). It should be split into two separate questions.

300

A researcher asks participants to recall their experience of a natural disaster that happened five years ago. Many provide specific details that differ from their earlier accounts given right after the event. What does this tell us about self-reporting on memories?

Self-reporting on memories can be unreliable, as memories can change over time, influenced by emotions, media, or other people’s accounts.

300

A researcher selects only people who responded to an email survey. Why might this sample not be generalizable to the entire population?

 This creates a self-selection bias, where only those who are willing and available respond, making the sample unrepresentative of the entire population.

300

A researcher wants to understand how college students across the U.S. cope with academic stress. They collect data by surveying students at a single elite private university on the East Coast.

What is the population in this study, and what is a potential problem with how the data was collected?

The population is all college students in the U.S.
The issue is that the sample may not be representative of the broader population — students at an elite private university might have different experiences than those at public schools, community colleges, or in different regions, which limits how well the results can be generalized.

400

Researchers are studying a new medical treatment, but they only test it on individuals from a low-income neighborhood. Meanwhile, the treatment is intended to benefit people from all backgrounds. Which principle of research ethics is being ignored, and how?

Justice (the burden of risk is unfairly placed on one group while others benefit).

400

A professor publishes a study claiming to have collected data from hundreds of participants, but in reality, no participants were involved, and the data was entirely invented. What term describes this unethical behavior?

Data fabrication, where the researcher creates data that was never collected.

400

Researchers measure body temperature in degrees Celsius. What type of scale is this?

Interval scale.

400

In a study, participants take an IQ test twice, one month apart. What type of reliability is being assessed by comparing the consistency of their scores over time?

 Test-retest reliability.

400

A new scale measuring job satisfaction is compared to a measure of physical health. The researcher finds a very low correlation between the two measures, supporting what type of validity for the job satisfaction scale?

Discriminant validity.

400

A respondent answers "Strongly agree" to every question on a life satisfaction survey, regardless of the content. What bias is this an example of, and how does it affect the survey’s results?

This is an example of acquiescence bias (yea-saying), where respondents agree with everything. It reduces the construct validity of the survey by failing to measure their true opinions.

400

A research team is studying identity and well-being. Participants are asked to describe their gender identity, report their household income, and rate how satisfied they feel with their lives. What type of measurement is being used in this study, and why is it the most appropriate choice?

Self-report measures are being used. This is appropriate because the variables (identity, income, satisfaction) are personal and subjective — they’re best captured through the participants’ own responses.

400

A university randomly selects 100 freshmen, 100 sophomores, 100 juniors, and 100 seniors to participate in a study on academic stress. What kind of sampling method is this?

This is stratified random sampling, where participants are divided into subgroups (strata), and random samples are taken from each group.

400

A researcher asks participants with Crohn’s disease to invite other people they know with the condition to join the study. What sampling method is this, and what is a key limitation?

This is snowball sampling, which may lead to biased results because the sample is not random and may not be representative of the larger population.

500

A research team wants to study how video games affect decision-making in children under 12. To attract more participants, they offer children $500 in exchange for joining the study, which is a large sum for the children. Why might this be considered unethical?

Offering a large sum of money could unduly influence minors and violate respect for persons, particularly vulnerable populations like children.

500

A psychologist plans to conduct a study on rats to test a new drug but hasn’t yet proven that using animals is necessary. What ethical standard must the researcher meet before proceeding with the study?

The researcher must justify why animals are necessary by following the ethical standard known as the Three Rs:

  • Replacement – Use alternatives to animals whenever possible (e.g., computer models, cell cultures)

  • Reduction – Use the fewest number of animals necessary to achieve valid results

  • Refinement – Modify procedures to minimize pain, distress, and suffering for the animals involved

500

In a survey, participants are asked to rank their satisfaction from "very satisfied" to "not satisfied at all." What type of measurement scale does this represent?

Ordinal scale.

500

What is considered an acceptable value for Cronbach’s alpha to indicate a reliable scale?

A score of .70 or higher is typically considered acceptable for internal reliability.

A negative Cronbach’s alpha is always a bad sign because it suggests the items on the scale are inversely related or measuring completely different constructs, meaning the scale lacks internal consistency altogether.

500

A researcher designs a stress survey and administers it to two groups: a group of fire-fighters just after a long shift and a group of students on a calm vacation. The fire-fighters score higher on the survey. What type of validity is demonstrated by this comparison?

Known-groups evidence for criterion validity.

500

A professor is rated on a scale from "Profs get F's too" (1) to "A real gem" (5). What type of scale is being used here?

This is a semantic differential scale, where respondents rate an object along a continuum between two opposing adjectives.

500

A researcher wants to observe the natural behavior of employees in a workplace but notices that productivity spikes whenever the employees know they are being watched. What can the researcher do to reduce reactivity?

The researcher could use unobtrusive observation methods, such as hidden cameras or long-term observation, where participants get used to being observed and return to their normal behavior.

500

In a study, researchers randomly puts participants to different treatment groups. What is this process called, and how does it differ from random sampling?

This is random assignment, which enhances internal validity by ensuring treatment groups are comparable. It differs from random sampling, which enhances external validity by making the sample representative of the population.

500

In a study, the researcher sets a target to survey 80 Asian Americans, 80 African Americans, and 80 Latinx participants but does not select them randomly. What type of sampling is being used?

This is quota sampling, where specific subgroups are targeted but selected nonrandomly.

600

A study is recruiting participants from a local care home, where many elderly people have cognitive impairments. The researchers do not take special care to explain the study in a way these individuals can understand. Which group from the Belmont Report’s principles is at risk in this scenario, and what should the researchers have done?

Vulnerable populations (the researchers should provide extra protection and ensure informed consent is obtained in a way participants can understand).

600

In one study, data is collected anonymously, and in another, it is collected confidentially.
What is the key difference between these two methods of collecting participant information?

Anonymous data collection means no identifying information is collected at all — the researcher cannot trace responses back to participants.


Confidential data collection means identifying information is collected, but the researcher protects it and does not share it with anyone outside the study.

600

What statistical measure combines the average inter-item correlation and the number of items in a scale to evaluate internal reliability? What is generally considered an acceptable score for this measure?

Cronbach’s alpha. It assesses how consistently items in a scale measure the same construct. 

600

A bathroom scale gives the same reading every day for a person, but it's always 10 pounds off from their actual weight. What does this example illustrate about the scale's reliability and validity?

High reliability, but low validity.

600

A researcher develops a pedometer that consistently provides the same reading every day, but it doesn’t match the actual number of steps walked. Which is the pedometer showing: reliability or validity?

Reliability (but not validity).

600

A researcher gives respondents the option to answer "neutral" on every question in a survey about environmental policies. Why might this lead to inaccurate data, and how could the researcher prevent this behavior?

Fence sitting might lead to inaccurate data by not capturing true opinions. The researcher could remove the neutral option or use forced-choice questions to prevent this.

600

A researcher expects that a new teaching method will improve student performance, and this expectation leads the teacher to give the students more attention during the study. What is the difference between observer bias and observer effects in this scenario?

Observer bias occurs when the researcher interprets the data to fit their expectations, while observer effects occur when the participants' behavior changes due to the researcher’s presence or actions, like giving more attention to students.

600

Researchers ensure that a smaller ethnic group makes up at least 20% of their sample, even though this group is less than 10% of the population. After the sample is collected, participants are split into treatment and control groups. What combination of techniques might they be using?

Oversampling and random assignment.

600

A researcher wants to test a new stress-reduction program on U.S. high school students. They start by selecting 50 high schools from across the country. Then, from each of those schools, they choose 20 students to take part in the study. After selecting the students, the researcher uses an appropriate procedure to avoid bias when deciding who receives the program and who does not.

List and explain the key steps in the sampling and assignment process used in this study.

  • Defined population: All U.S. high school students

  • Multistage sampling: Selected schools, then students within schools

  • Random selection: Ensures a representative sample from the population

  • Random assignment: The “appropriate procedure to avoid bias” refers to assigning students to groups in a way that avoids preexisting differences — this increases internal validity