Stat 694: Applied Research in Statistics and Biostatistics

Department of Statistics and Biostatistics, CSU East Bay

Fall 2023:


Week 15:


Week 14:


Week 13:

Excellent References:

Machine Learning with R:

Machine Learning with Python:

Big Picture:

LLMs


Week 12:


Week 11:


Week 10:


Week 9:


Week 8:


Week 7:


Week 6:


Week 5:


Week 4:


Week 3:


Week 2:


Week 1:


Spring 2023:


Weeks 7:


Weeks 6:


Weeks 5:


Weeks 4:

  1. how many rows, n, number of observations

  2. how many columns, number of variables

  3. how many numeric variables, how many categorical

  4. examine the amount of missing data NA

  5. what are the summary statistics for each numeric variable

  6. what are the summary statistics for each categorical variable

  7. visualize the numeric data, is the data symmetric or skewed, is your data normally distributed

  8. visualize the categorical data

  9. what numeric variables are correlated? make scatterplots

  10. Are there any outliers in variables?

  11. variable names, do they follow good practice
    R janitor No spaces clean_names()


Weeks 3:


Weeks 1 & 2:


Fall 2022:


Week 16:


Week 15:

Excellent References:

Machine Learning with R:

Machine Learning with Python:

Big Picture:


Week 14:


Week 13:


Week 12:


Week 11:


Week 10:


Week 9:


Week 8:


Week 7:


Week 6:


Week 5:


Week 4:


Week 3:


Week 2:


Week 1:


Week 0: