+ - 0:00:00
Notes for current slide
Notes for next slide

Hypothesis and Study Design

Aly Lamuri
Indonesia Medical Education and Research Institute

1 / 30

Overview

  • Variables: independent and dependent
  • Formulating a hypothesis
  • Types of study design
  • Assessing hypothesis in published papers
1 / 30

Research in a nutshell

1 / 30

What are...?

  • Cases
    • Observable subjects
    • Can be anything
1 / 30

What are...?

  • Cases
    • Observable subjects
    • Can be anything
  • Variables
    • Characteristics in observed cases
    • Quantifiable
1 / 30

What are...?

  • Cases
    • Observable subjects
    • Can be anything
  • Variables
    • Characteristics in observed cases
    • Quantifiable

Example 1:

Ten schizophrenic patients participated in a study on brain activity.

1 / 30

What are...?

  • Cases
    • Observable subjects
    • Can be anything
  • Variables
    • Characteristics in observed cases
    • Quantifiable

Example 1:

Ten schizophrenic patients participated in a study on brain activity. They underwent computer-based tests on psychotic symptoms and fMRI examination to observe the functional connectivity.

1 / 30

What are...?

  • Cases
    • Observable subjects
    • Can be anything
  • Variables
    • Characteristics in observed cases
    • Quantifiable

Example 1:

Ten schizophrenic patients participated in a study on brain activity. They underwent computer-based tests on psychotic symptoms and fMRI examination to observe the functional connectivity.

Example 2:

A group of botanists observed different species of molds under conditioned environments.

1 / 30

What are...?

  • Cases
    • Observable subjects
    • Can be anything
  • Variables
    • Characteristics in observed cases
    • Quantifiable

Example 1:

Ten schizophrenic patients participated in a study on brain activity. They underwent computer-based tests on psychotic symptoms and fMRI examination to observe the functional connectivity.

Example 2:

A group of botanists observed different species of molds under conditioned environments. They set up different humidity levels and temperature to observe cellular viability.

1 / 30

What are...?

  • Cases
    • Observable subjects
    • Can be anything
  • Variables
    • Characteristics in observed cases
    • Quantifiable

Example 1:

Ten schizophrenic patients participated in a study on brain activity. They underwent computer-based tests on psychotic symptoms and fMRI examination to observe the functional connectivity.

Example 2:

A group of botanists observed different species of molds under conditioned environments. They set up different humidity levels and temperature to observe cellular viability.

Example 3:

The Ministry of Finance investigated an urban population. They considered age and ethnicity as a predictor of socioeconomic mobility.

1 / 30

Independent variable

  • Unaffected by other variables
  • In experiment: under your direct control
  • Design-wise: A presumed cause
2 / 30

Independent variable

  • Unaffected by other variables
  • In experiment: under your direct control
  • Design-wise: A presumed cause

Dependent variable

  • Affected by other variables
  • In experiment: under your indirect control
  • Design-wise: A presumed effect
2 / 30

Grouping the cases

  • Use categorical independent variable
  • Separate each observation
  • Differentiate within and between subjects
3 / 30

Grouping the cases

  • Use categorical independent variable
  • Separate each observation
  • Differentiate within and between subjects

Within?

  • Comparing the same subject
  • When we test the subject multiple times
  • Temporal differences of t0 and t1
  • Using different measures
3 / 30

Grouping the cases

  • Use categorical independent variable
  • Separate each observation
  • Differentiate within and between subjects

Within?

  • Comparing the same subject
  • When we test the subject multiple times
  • Temporal differences of t0 and t1
  • Using different measures

Between?

  • Comparing different subjects
  • When we use a test on multiple groups
3 / 30

That's neat!

3 / 30

That's neat! But why?

3 / 30

That's neat! But why?

  • Enables inferential statistics
  • Describes an association
  • Significance?
3 / 30

That's neat! But why?

  • Enables inferential statistics
  • Describes an association
  • Significance?

Any catch?

  • We need to control confounding variables
  • Reduce bias
  • Minimize random error
3 / 30

Confounding variable

  • Any variable: categoric or numeric
  • Causal association with either in- / dependent variables
  • Reduces internal validity!
3 / 30

Confounding variable

  • Any variable: categoric or numeric
  • Causal association with either in- / dependent variables
  • Reduces internal validity!
confounding Independent Independent Dependent Dependent Independent->Dependent Confounding Confounding Confounding->Independent Confounding->Dependent
3 / 30

Cognitive bias partiality

  • Selection
  • Information
  • Detection
  • Performance
  • Attrition
  • Reporting

Reduces internal validity!

4 / 30

Bias is a systematic error in research

  • Selection bias: the proportion of selected subjects does not represent the population
    • Undercoverage
    • Overcoverage
  • Information bias: the information does not represent the truth:
    • Recall bias
    • Interviewer bias
    • Response bias (vaguely interpreted questions)
    • Confirmation bias (only confirm what conforms)
  • Detection bias: only detecting outcome in a group with certain characteristics
  • Performance bias: a tendency to give different treatment to favour an outcome
  • Attrition bias: due to drop out and lost to follow-up
  • Reporting bias: only report significant findings and neglect insignificant ones

Handling bias

  • Random sampling
  • Random assignment
  • Blinding:
    • Participant
    • Investigator
    • Mediating parties
4 / 30

Systematic error

  • Both confounding variables and bias contribute to systematic errors
  • We have a hunch how they will affect our data
  • So we can prevent them
  • Often have a mathematical solution
  • Example: offset and scale factor errors
5 / 30

Systematic error

  • Both confounding variables and bias contribute to systematic errors
  • We have a hunch how they will affect our data
  • So we can prevent them
  • Often have a mathematical solution
  • Example: offset and scale factor errors

Is there any error we cannot completely handle?

5 / 30

Systematic error

  • Both confounding variables and bias contribute to systematic errors
  • We have a hunch how they will affect our data
  • So we can prevent them
  • Often have a mathematical solution
  • Example: offset and scale factor errors

Is there any error we cannot completely handle?

Yes.

5 / 30

Random error

  • A.K.A: unsystematic error, system noise, random variation
  • Unpredictable and unreplicable
  • Thus, no mathematical solution
5 / 30

Random error

  • A.K.A: unsystematic error, system noise, random variation
  • Unpredictable and unreplicable
  • Thus, no mathematical solution

Handling random error

  • Take the average measurement value
  • Increase sample size
5 / 30

Overview

  • Variables: independent and dependent
  • Formulating a hypothesis
  • Types of study design
  • Assessing hypothesis in published papers
5 / 30

Hypothesis: definition

  • Assumptions about the population parameter
  • Stated as the null H0 and alternative Ha hypothesis
  • Requires a formal procedure to reject H0
6 / 30

Hypothesis: definition

  • Assumptions about the population parameter
  • Stated as the null H0 and alternative Ha hypothesis
  • Requires a formal procedure to reject H0

Null hypothesis

Observed results happened purely from chance

6 / 30

Hypothesis: definition

  • Assumptions about the population parameter
  • Stated as the null H0 and alternative Ha hypothesis
  • Requires a formal procedure to reject H0

Null hypothesis

Observed results happened purely from chance

Alternative hypothesis

Observed results happened due to some non-random causes

6 / 30

Hypothesis: example

Remember when we flipped the coin last times?

set.seed(1)
coin <- sample(c("H", "T"), 10, replace=TRUE, prob=rep(1/2, 2)) %T>% print()
## [1] "T" "T" "H" "H" "T" "H" "H" "H" "H" "T"
7 / 30

Hypothesis: example

Remember when we flipped the coin last times?

set.seed(1)
coin <- sample(c("H", "T"), 10, replace=TRUE, prob=rep(1/2, 2)) %T>% print()
## [1] "T" "T" "H" "H" "T" "H" "H" "H" "H" "T"

Let's state a hypothesis on our coin flip!

H0: The probability of having the head is P=0.5
Ha: The probability of having the head is P0.5

7 / 30

Hypothesis: example

Remember when we flipped the coin last times?

set.seed(1)
coin <- sample(c("H", "T"), 10, replace=TRUE, prob=rep(1/2, 2)) %T>% print()
## [1] "T" "T" "H" "H" "T" "H" "H" "H" "H" "T"

Let's state a hypothesis on our coin flip!

H0: The probability of having the head is P=0.5
Ha: The probability of having the head is P0.5

coineeing the proportion being 0.6, can we reject the H0?

7 / 30

Hypothesis: example

Not quite! ;) We need a formal process to reject H0

7 / 30

Hypothesis: example

Not quite! ;) We need a formal process to reject H0

binom.test(x={sum(coin=="H")}, n=length(coin), p=0.5) %>%
broom::tidy() %>% knitr::kable() %>% kable_minimal()
estimate statistic p.value parameter conf.low conf.high method alternative
0.6 6 0.754 10 0.262 0.878 Exact binomial test two.sided
7 / 30

Hypothesis: example

Not quite! ;) We need a formal process to reject H0

binom.test(x={sum(coin=="H")}, n=length(coin), p=0.5) %>%
broom::tidy() %>% knitr::kable() %>% kable_minimal()
estimate statistic p.value parameter conf.low conf.high method alternative
0.6 6 0.754 10 0.262 0.878 Exact binomial test two.sided

Seeing the p-value > 0.05, we can formally state the failure on rejecting H0

7 / 30

7 / 30

Hypothesis: formal test

  • Does our data correspond to the H0?
  • Sample data are consistent with H0 do not reject H0
  • Sample data are inconsistent with H0 reject H0
8 / 30

Hypothesis: formal test

  • Does our data correspond to the H0?
  • Sample data are consistent with H0 do not reject H0
  • Sample data are inconsistent with H0 reject H0

Type of test

  • Two-tailed: H0Ha
  • Right-tailed: H0>Ha
  • Left-tailed: H0<Ha
8 / 30

Hypothesis: formal test

  • Does our data correspond to the H0?
  • Sample data are consistent with H0 do not reject H0
  • Sample data are inconsistent with H0 reject H0

Rejecting hypothesis

  • Region of acceptance with critical value (older)
  • P-value (newer)
8 / 30

Hypothesis: interpretation

  • We start from the assumption that H0 is true
  • Collect evidence to show otherwise
  • We then reject or fail to reject the H0 based on p-value
  • When testing the hypothesis we may happen to see statistical errors
9 / 30

Hypothesis: interpretation

  • We start from the assumption that H0 is true
  • Collect evidence to show otherwise
  • We then reject or fail to reject the H0 based on p-value
  • When testing the hypothesis we may happen to see statistical errors

Statistical error

  • Type I: False Reject Rate α false positive
  • Type II: False Accept Rate β false negative
9 / 30

9 / 30

9 / 30

Hypothesis: p-value

  • Reflect the probability of making a type I error
  • Lower p-value lower chance of H0 occurred by chance
  • Lower p-value higher significance
  • Common cut-off: 0.05
10 / 30

Overview

  • Variables: independent and dependent
  • Formulating a hypothesis
  • Types of study design
  • Assessing hypothesis in published papers
10 / 30

Study design

  • Cross-sectional
  • Cohort
  • Case-control
  • Randomized controlled trial
  • Systematic review
11 / 30

Cross-sectional

  • A snapshot of outcome and associated characteristics
  • No intervention, inferencing existing differences
  • Relatively inexpensive and quick to conduct
  • Static, reveal no temporal context
  • Unable to establish causality
  • Different time frame may lead to different result
12 / 30

Cohort

  • Longitudinal study to see an effect overtime
  • Prospective / retrospective: observing the future or looking back in time
  • Starting from a potential cause
  • Involving at least two groups of interest
  • Cannot completely control the confounding variables
  • No randomization lower external validity
  • Long completion period
13 / 30

Case-control

  • Longitudinal study to see an effect overtime
  • Retrospective in nature
  • Starting from an outcome of interest
  • Involving at least two groups
  • Usually employed to investigate rare conditions
  • Assessing multiple risk factors
  • Difficult to find a suitable control group
  • Inadequate to establish a diagnostic study
  • Should carefully address confounding variables
14 / 30

Randomized controlled trial

  • Randomization to control bias higher external validity
  • Akin to an experimental study, can effectively employ blinding methods
  • Statistically efficient
  • Clearly identified population
  • Expensive to conduct
  • Risk to have a volunteer bias
  • Loss to follow up attrition bias
15 / 30

Systematic review

  • Concatenate findings from multiple studies
  • Critical appraisal removes redundancies and addresses inconsistencies
  • Delineate where knowledge is lacking to guide future research
  • Does not establish novelty
  • Variation among published articles is a challenge to overcome
  • Potential bias for including unreviewed articles
16 / 30

Systematic review

  • Concatenate findings from multiple studies
  • Critical appraisal removes redundancies and addresses inconsistencies
  • Delineate where knowledge is lacking to guide future research
  • Does not establish novelty
  • Variation among published articles is a challenge to overcome
  • Potential bias for including unreviewed articles

Meta-analysis

  • Takes systematic review to another level :)
  • Analyzing previous analysis (thus the name: meta-analysis)
16 / 30

Overview

  • Variables: independent and dependent
  • Formulating a hypothesis
  • Types of study design
  • Assessing hypothesis in published papers
16 / 30

Reading scientific articles

  • Look for its underlying structure
  • Abstract
    • I: Introduction
    • M: Methods
    • R: Results
    • D: Discussion
  • Introduction
    • S: Situation
    • P: Problem
    • Q: Question
    • R: Resolution
18 / 30
  • By knowing the structure, you can extract information more efficiently
  • When reading the abstract, focus on introduction and methods first
  • Then shift into conclusions, since it contains the summary
  • Finally, you may want to read the results and discussion as well
  • Introduction usually elaborates the background and hypothesis
  • You can look for the study design in methods
  • Discussion part will help you understand current limitation and further suggestions

Abstract - Introduction

Current studies of depression among people living with HIV focus on describing its point prevalence. Given the fluctuating nature of depression and its profound impacts on clinical and quality-of-life outcomes, this study aimed to examine the prevalence, recurrence and incidence of current depressive symptoms and its underlying catalysts longitudinally and systematically among these individuals.

19 / 30

Abstract - Methods

We conducted a prospective cohort study between October 1, 2007 and December 31, 2012 using longitudinal linked data sources. Current depressive symptoms was identified using the Centre for Epidemiologic Studies Depression Scale or the Kessler Psychological Distress Scale, first at baseline and again during follow-up interviews. Multivariable regressions were used to characterize the three outcomes.

20 / 30

Abstract - Conclusion

Depressive symptoms are prevalent and likely to recur among people living with HIV. Our results support the direction of Ontario’s HIV/AIDS Strategy to 2026, which addresses medical concerns associated with HIV (such as depression) and the social drivers of health in order to enhance the overall well-being of people living with or at risk of HIV. Our findings reinforce the importance of providing effective mental health care and demonstrate the need for long-term support and routine management of depression, particularly for individuals at high risk.

21 / 30

Abstract - Results

Of the 3,816 HIV-positive participants, the point prevalence of depressive symptoms was estimated at 28%. Of the 957 participants who were identified with depressive symptoms at baseline and who had at least two years of follow-up, 43% had a recurrent episode. The cumulative incidence among 1,745 previously depressive symptoms free participants (at or prior to baseline) was 14%. During the five-year follow-up, our multivariable models showed that participants with greater risk of recurrent cases were more likely to feel worried about their housing situation. Participants at risk of developing incident cases were also likely to be younger, gay or bisexual, and unable to afford housing-related expenses.

22 / 30

Introduction - Situation

(par. 1)

Depression affects up to half of people living with HIV, a prevalence that is two to four times higher than that found in the general population [1]. Over 50% of people living with HIV and depression do not receive treatment for their depression [2–9], and this failure to treat contributes to significant negative clinical and quality-of-life outcomes [10–14].

23 / 30

Introduction - Situation

(par. 2)

Growing evidence supports a bi-directional relationship between HIV and depression involving a number of biological, psychosocial and social factors [1,14–16]. The persistent viral presence in the central nervous system may release toxic viral proteins that induce depression-like symptoms [17,18]; people living with HIV may possess a negative self-image or experience stigma [1,15,19–21]; and people living with HIV are more likely to struggle with stressors such as financial insecurity and unstable housing [22–25]. Recent reviews also suggest that people who suffer from severe mental illnesses (including depression) and/or co-occurring substance use disorder are more likely to engage in risky sexual behaviour, thereby elevating their risk of HIV acquisition [26–34].

24 / 30

Introduction - Problem

(par. 3)

To date, most studies about the prevalence of depression among people living with HIV have used cross-sectional designs [1,15]. Six studies have documented the incidence [35–38] and persistence (or recurrence) [39,40] of depression over time among people living with HIV. In Canada, information describing the epidemiology of depression among people living with HIV is scarce. In Canada, information describing the epidemiology of depression among people living with HIV is scarce. There have been two small convenience sample studies describing the prevalence of depression among people living with HIV. Williams et al. (2005), employing a small convenience sample of 297 individuals, described the prevalence of depressive symptoms at 54% among people living with HIV based on a self-report screening instrument [41]. Logie, James, Tharao, and Loutfy (2013), employing a sample of 173 Africa, Caribbean, and Black women, described the prevalence of depressive symptoms as 64% [42]. Thus, the epidemiology of this condition is not yet well documented in Canada.

25 / 30

Introduction - Question

(par. 4)

Given the fluctuating nature of depression over the life span and its profound impacts on clinical and quality-of-life outcomes, our study aimed to examine the prevalence, recurrence, and incidence of current depressive symptoms longitudinally and systematically among people living with HIV. We also characterized these three outcomes by HIV-positive participants’ socio-demographic characteristics, housing and neighbourhood conditions, substance-use behaviours and health status over a five-year follow-up period. Understanding change in the burden of depressive symptoms and the underlying catalysts of the condition from a longitudinal perspective would be important to program planners, policy-makers, and health care providers when planning and implementing effective mental-health programs and interventions for people living with HIV.

26 / 30

Introduction - Resolution

(par. 4)

Given the fluctuating nature of depression over the life span and its profound impacts on clinical and quality-of-life outcomes, our study aimed to examine the prevalence, recurrence, and incidence of current depressive symptoms longitudinally and systematically among people living with HIV. We also characterized these three outcomes by HIV-positive participants’ socio-demographic characteristics, housing and neighbourhood conditions, substance-use behaviours and health status over a five-year follow-up period. Understanding change in the burden of depressive symptoms and the underlying catalysts of the condition from a longitudinal perspective would be important to program planners, policy-makers, and health care providers when planning and implementing effective mental-health programs and interventions for people living with HIV.

27 / 30

Extracted information

  • Case: people living with HIV
  • Variables
    • Independent: depressive symptoms prevalence, recurrence, incidence
    • Dependent: younger, gay or bisexual, unable to afford expenses
    • Potential confounding: worrying their household situation
  • Study design: cohort prospective
  • Hypothesis
    • H0: Depressive symptoms are not likely to recur
    • Ha: Depressive symptoms are likely to recur
28 / 30

Query?

29 / 30

Quick assignment

In completing this assignment, you will:

  • Read two articles we briefly discussed in the first lecture
  • Identify their underlying structures (abstract and introduction)
  • Succintly describe the case, variables and study design
  • Determine the hypotheses and formulate them as H0 and Ha
  • Make a presentation on the abstract, introduction and extracted information
30 / 30

Quick assignment

In completing this assignment, you will:

  • Read two articles we briefly discussed in the first lecture
  • Identify their underlying structures (abstract and introduction)
  • Succintly describe the case, variables and study design
  • Determine the hypotheses and formulate them as H0 and Ha
  • Make a presentation on the abstract, introduction and extracted information
30 / 30

Overview

  • Variables: independent and dependent
  • Formulating a hypothesis
  • Types of study design
  • Assessing hypothesis in published papers
1 / 30
Paused

Help

Keyboard shortcuts

, , Pg Up, k Go to previous slide
, , Pg Dn, Space, j Go to next slide
Home Go to first slide
End Go to last slide
Number + Return Go to specific slide
b / m / f Toggle blackout / mirrored / fullscreen mode
c Clone slideshow
p Toggle presenter mode
t Restart the presentation timer
?, h Toggle this help
Esc Back to slideshow