Last updated: Thu Feb 15 06:56:51 PM 2024
The content of the first exam is primarily revolved around the basics of data: what is it, how do we quantify it, and how can we present it to others.
Statistics is concerned with studying populations and parameters by collecting samples and computing statistics
An observation is the smallest unit of study within a population. Attributes of observations are called variables
Variables:
In a data frame, rows are observations, columns are variables. The column associated with an ID or a name is known as an identifier.
A distribution describes what values a variable takes and how frequently those values occur
Visualization:
What if I have:
Numerical Summaries:
Two variables associated if the value of one tells us something about the value of another
Contingency tables and odds, and odds ratios
Standardized variables, what are they? z scores, what true about z-scores for correlation to be positive?