Missing data refers to the absence of data entries in a dataset where values are expected but not recorded. They’re the blank cells in your data sheet. Missing values for specific variables or participants can occur for many reasons, including incomplete data entry, equipment failures, or lost files. When data are missing, it’s a problem. However, the issues go beyond merely reducing the sample size. In some cases, they can skew your results. [Read more…] about Missing Data Overview: Types, Implications & Handling
Blog
Data Aggregation: Strengths & Weaknesses of Aggregated Data
What is Data Aggregation?
Data aggregation is a crucial process that involves collecting data and summarizing it in a concise form. This method transforms atomic data rows—sourced from diverse origins—into comprehensive totals or summary statistics. Aggregated data, typically housed in data warehouses, enhances analytical capabilities and significantly speeds up querying large datasets. [Read more…] about Data Aggregation: Strengths & Weaknesses of Aggregated Data
Prospect Theory Overview & Examples
What is Prospect Theory?
Prospect Theory states that individuals place greater weight on losses than gains while making decisions. It is a descriptive model of how individuals make decisions involving risk and uncertainty proposed by Daniel Kahneman and Amos Tversky in 1979. Prospect theory describes how people evaluate and choose between different options. [Read more…] about Prospect Theory Overview & Examples
Gage R&R Overview & Example
What is Gage R&R?
Gage R&R assesses the amount and sources of measurement variation in a measurement system. It evaluates a measurement system’s precision and helps you target improvement efforts where they’re most needed. It does not assess accuracy or bias. [Read more…] about Gage R&R Overview & Example
Regression to the Mean: Definition & Examples
What is Regression to the Mean?
Regression to the mean is the statistical tendency for an extreme sample or observed value to be followed by a more average one. It is also known as reverting to the mean, highlighting the propensity for a later observation to move closer to the mean after an extreme value. The concept applies only to random variation in a process or system and does not pertain to interventions or events that affect the outcome. [Read more…] about Regression to the Mean: Definition & Examples
Self Selection Bias Overview & Examples
What is Self Selection Bias?
Self selection bias can occur when individuals choose to participate in a study, survey, or experiment. The bias exists when volunteers have different characteristics than those who do not participate. It is a form of sampling bias stemming from using a nonprobability sampling method, such as volunteer or convenience sampling. [Read more…] about Self Selection Bias Overview & Examples
Attrition Bias: Definition & Examples
What is Attrition Bias?
Attrition bias in research occurs when study participants who drop out have characteristics that differ significantly from those who remain. This selective dropout can lead to skewed results and misinterpretations if the researchers don’t adequately address it. This threat is higher for longitudinal studies and those with relatively high attrition rates. [Read more…] about Attrition Bias: Definition & Examples
Conjunction Fallacy: Definition & Example
What is the Conjunction Fallacy?
The conjunction fallacy is a cognitive bias that occurs when someone mistakenly believes that two events occurring together are more likely than either of the two events alone. In other words, it’s the mistaken belief that a precisely detailed, multifaced outcome is more likely to occur than a more generalized version of that outcome. [Read more…] about Conjunction Fallacy: Definition & Example
Base Rate Fallacy Overview & Examples
What is Base Rate Fallacy?
Base rate fallacy is a cognitive bias that occurs when a person misjudges an outcome by giving too much weight to case-specific details and overlooks crucial probability information that applies to all cases in a population. That vital probability is the outcome’s base rate of occurrence in the population. [Read more…] about Base Rate Fallacy Overview & Examples
Quasi Experimental Design Overview & Examples
What is a Quasi Experimental Design?
A quasi experimental design is a method for identifying causal relationships that does not randomly assign participants to the experimental groups. Instead, researchers use a non-random process. For example, they might use an eligibility cutoff score or preexisting groups to determine who receives the treatment. [Read more…] about Quasi Experimental Design Overview & Examples
Residual Sum of Squares (RSS) Explained
The residual sum of squares (RSS) measures the difference between your observed data and the model’s predictions. It is the portion of variability your regression model does not explain, also known as the model’s error. Use RSS to evaluate how well your model fits the data. [Read more…] about Residual Sum of Squares (RSS) Explained
Covariance vs Correlation: Understanding the Differences
Covariance vs correlation both evaluate the linear relationship between two continuous variables. While this description makes them sound similar, there are stark differences in how to interpret them.
Although these statistics are closely related, they are distinct concepts. How are they different?
In this post, learn about the differences between covariance vs correlation and what you can learn from each. [Read more…] about Covariance vs Correlation: Understanding the Differences
Risk Calculations: Relative vs Absolute & Risk Reduction
What’s the risk? People discuss risk frequently, but it’s not always clearly understood. It is your exposure to danger or adverse outcomes. Statistically, we define risk as the probability of a negative outcome occurring, and there are several ways to calculate it. [Read more…] about Risk Calculations: Relative vs Absolute & Risk Reduction
Omitted Variable Bias: Definition, Avoiding & Example
What is Omitted Variable Bias?
Omitted variable bias (OVB) occurs when a regression model excludes a relevant variable. The absence of these critical variables can skew the estimated relationships between variables in the model, potentially leading to erroneous interpretations. This bias can exaggerate, mask, or entirely flip the direction of the estimated relationship between an independent and dependent variable. [Read more…] about Omitted Variable Bias: Definition, Avoiding & Example
What is a Case Study? Definition & Examples
Case Study Definition
A case study is an in-depth investigation of a single person, group, event, or community. This research method involves intensively analyzing a subject to understand its complexity and context. The richness of a case study comes from its ability to capture detailed, qualitative data that can offer insights into a process or subject matter that other research methods might miss. [Read more…] about What is a Case Study? Definition & Examples
Sample Mean vs Population Mean: Symbol & Formulas
In statistics, the symbols and formulas for basic concepts such as the mean provide a foundational understanding of data analysis. Understanding the mean involves more than just knowing how to calculate an average; it’s about recognizing the nuances that differentiate a population mean from a sample mean. This distinction is crucial in statistical analysis, as the approach and symbol used for each vary (mu vs. x bar). [Read more…] about Sample Mean vs Population Mean: Symbol & Formulas
Correlational Study Overview & Examples
What is a Correlational Study?
A correlational study is an experimental design that evaluates only the correlation between variables. The researchers record measurements but do not control or manipulate the variables. Correlational research is a form of observational study. [Read more…] about Correlational Study Overview & Examples
Type 2 Error Overview & Example
What is a Type 2 Error?
A type 2 error (AKA Type II error) occurs when you fail to reject a false null hypothesis in a hypothesis test. In other words, a statistically non-significant test result indicates that a population effect does not exist when it actually does. A type 2 error is a false negative because the effect exists in the population, but the test doesn’t detect it in the sample. [Read more…] about Type 2 Error Overview & Example
Type 1 Error Overview & Example
What is a Type 1 Error?
A type 1 error (AKA Type I error) occurs when you reject a true null hypothesis in a hypothesis test. In other words, a statistically significant test result indicates that a population effect exists when it does not. A type 1 error is a false positive because the test detects an effect in the sample that doesn’t exist in the population. [Read more…] about Type 1 Error Overview & Example
Cross Sectional Study: Overview, Examples & Benefits
What is a Cross Sectional Study?
A cross-sectional study is an experimental design that analyzes data from a representative sample at a specific point in time. Researchers usually evaluate multiple attributes at once when using this design. Unlike longitudinal studies, these studies don’t track changes over time. [Read more…] about Cross Sectional Study: Overview, Examples & Benefits