Socratizer

Online learning platform

Final project

Goal

Produce a fully reproducible statistical report, in R, that demonstrates your mastery of the techniques we have covered during the classes.

You are free to choose any empirical dataset or study as long as the data allow you to apply (and justify) at least two advanced methods from the course.

This final project is worth 40% of your final course grade (see the Syllabus page for course-wide assessment details).

How to Choose Your Dataset

What to look for:

Datasets with more than 2 levels (e.g., students nested in schools), repeated measures, or multiple variables suitable for a complex linear model.

Submission format (single-file Quarto)

Use the provided template: final_project_template.qmd.

Submission rules: see How to Submit Assignments - Quarto (.qmd) Basics (Introduction section).

Data access (important)

Your project must be reproducible from the .qmd:

Minimum statistical content (syllabus-aligned)

Your project must include at least 2 advanced components from this list (you may include more):

  1. Diagnostics & robustness
    • model diagnostics (residuals, influence) and at least one robustness step (e.g., log/Box-Cox, bootstrap CI, sensitivity check)
  2. Generalized linear model (GLM)
    • logistic regression (binary) and/or Poisson regression (counts), interpretation on the response scale
  3. Mixed model (LMM/GLMM)
    • random intercepts and/or random slopes, justification of the random-effects structure, convergence/singularity checks
  4. SEM / CFA (lavaan)
    • CFA with fit indices + interpretation of loadings, and/or a simple SEM/path model
  5. Meta-analysis (metafor)
    • compute effect sizes and run a small meta-analysis (forest plot + at least one bias/sensitivity check)

At least one of your components must be from (2)–(5) (i.e., beyond “standard regression/ANOVA only”).

Report Structure

  1. Introduction
    • Background & theory
    • Clear specification of the research question(s)
    • Clear specification of dependent/independent variables and the structure of the dataset (levels, repeated measures, items)
    • Clear hypotheses that will be tested (it does not matter whether exploratory or confirmatory)
  2. Methods
    • Pre-processing steps (handling of missing data, transformations)
    • Model specification choices (coding, random effects, links, etc.) and why
  3. Results
    • At least 2 tables (descriptives + model results)
    • At least 2 figures (model-based plot(s), diagnostics, forest plot, etc.)
    • Precise reporting of statistics (estimates, CI, p, effect sizes where applicable)
  4. Discussion
    • Interpretation in light of hypotheses
    • Discussion of the statistical methods that were used
    • Limitations and what you would do next

Week 15 presentation requirements

You will present your final project in Week 15.

Bring:

Format (default; may be adjusted depending on class size):

Your presentation should answer (in order):

1) What is the research question and what is the dataset?
2) What model(s) did you fit and why are they appropriate?
3) What is the main result (with one key figure/table)?
4) What did you check (diagnostics/robustness/convergence/bias) and what did you conclude?

Grading rubric (40% of course grade)

This rubric is only for the final project (separate from homework rubrics).

Graded on a 0–10 scale (then weighted as 40% of the course grade):

Component Points (max) What “full points” means
Research question + design clarity 1.5 Clear question, variables, and data structure; appropriate scope
Data prep + transparency 1.5 Clear preprocessing; missing data handled; transformations justified
Methods + justification 3.0 At least 2 advanced components; correct implementation; choices justified
Diagnostics / robustness / bias checks 1.5 Appropriate checks for the methods used; conclusions reflect checks
Reporting + interpretation 2.0 Clear tables/figures; correct interpretation in plain language; limitations noted
Reproducibility 0.5 Renders cleanly; no absolute paths; data access is reproducible

Retake rule (if failed)

If the project is failed, you must submit a revised version (or a new analysis) during the retake period.

Completion of all homework assignments is required to submit the final project.

Max grade

10

Additional files

final_project_template.qmd
Quarto template (.qmd)

Send an assignment

Currently sent files:

You are not logged in. Log in to upload your assignment.