Acknowledgments and resources

We are genuinely grateful to many people within the Statistics, Machine Learning, Epidemiology and R programming communities, who shared their valuable work, open source software, and training resources.

Licensing and use of the workshop materials

The workshop materials are licensed under the Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License. All borrowed external materials (images, worked examples, etc.), are credited with proper “Source” statements and governed by their own licenses. To our knowledge, all the consulted materials were published under “open access” or “creative commons” frameworks. If this were not the case for any content piece displayed here, please let us know and it will be removed.

Selected resources for self-guided learning

Below is a curated list of great resources (most of which free and openly accessible) you can peruse on your own.

Statistics, Biostatistics, Epidemiology with R examples

R packages & tools

Sources of practice datasets

  • Vanderbilt Department of Biostatistics (2023, September 17). Vanderbilt Biostatistics Datasets [Dataset]. https://hbiostat.org/data/
  • Chicco, D., & Jurman, G. (2020). Machine learning can predict survival of patients with heart failure from serum creatinine and ejection fraction alone [Dataset]. BMC Medical Informatics and Decision Making, 20(1), 16. https://doi.org/10.1186/s12911-020-1023-5
  • Ahmad, T., Munir, A., Bhatti, S. H., Aftab, M., & Raza, M. A. (2017). Survival analysis of heart failure patients: A case study [Dataset]. PLOS ONE, 12(7), e0181001. https://doi.org/10.1371/journal.pone.0181001
  • Joosten, H., Van Eersel, M. E. A., Gansevoort, R. T., Bilo, H. J. G., Slaets, J. P. J., & Izaks, G. J. (2013). Cardiovascular Risk Profile and Cognitive Function in Young, Middle-Aged, and Elderly Subjects [Dataset]. Stroke, 44(6), 1543–1549. https://doi.org/10.1161/STROKEAHA.111.000496