Exploring the Impact of Finite Sample Size and Missing Data on Statistical Modeling Decisions

Faculty of Health

School of Public Health Sciences

The side of a red brick building with writing on it that reads, "Lyle S. Hallman Institute for health promotion"

Research project description

Statistical models play a critical role in data applications for explanatory and predictive purposes. The model-building process involves use of various statistical tools, some of which make certain assumptions to yield good statistical properties like consistency. Such properties enable researchers to make reliable statistical inferences. However, when samples are small to moderate in size, issues arise when applying common model-building tools due to deviations from underlying assumptions. Such deviations can lead to unreliable parameter estimates, reduced statistical power, thereby affecting data-driven decisions. The process of model-building and conducting statistical tests is further impeded when data are incomplete due to missing values.

One arm of this research project investigates the impact of finite sample size on data-driven decisions. By understanding this impact, new modeling tools can be developed to improve data-driven decisions in cases where sample sizes are small to moderate. The second arm of this research project studies the impact of missing data on statistical inference, which is followed by developing methods to mitigate this issue. In data applications, traditional methods for dealing with missing data can be biased, inefficient, (mathematically) incongruent, and computationally intensive. This arm of research will develop new statistical tests and modeling methods for data with missing values. Methods developed in this research project will be applied in real health-related datasets.

Fields of research

  • Statistical Modeling
  • Asymptotic Theory
  • Statistical Consistency/Efficiency
  • Missing Data

Qualifications and ideal student profile

Prospective graduate student researchers must meet or exceed the minimum admission requirements for the programs connected to this opportunity. Visit the program pages using the links on this page to learn more about minimum admission requirements. In addition to minimum requirements, the research supervisor is looking for the following qualifications and student profile.

  • BSc/MSc in Statistics, Mathematics, or Computer Science
  • (Multidimensional) Calculus, Linear Algebra, Statistical Theory, Mathematical Statistics, Asymptotic Theory, Applied Statistics
  • Experience in Statistical modeling using R, Python, and/or SAS 
  • Experience in conducting simulation studies using R, Python, and/or SAS

Important dates

Exploring the Impact of Finite Sample Size and Missing Data on Statistical Modeling Decisions is an open and ongoing research opportunity. Expressions of interest can be submitted for any term.

Express interest in Exploring the Impact of Finite Sample Size and Missing Data on Statistical Modeling Decisions

Citizenship
Please ensure your CV file is a .PDF
One file only.
100 MB limit.
Allowed types: pdf.
Tell the supervisor why you're a fit for this research opportunity?
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.