Linear regression is one of the simplest learning algorithms. This is a supervised learning algorithm because the data is labeled.
Ex: Predict the price of garden hoses
Training -> learning algo -> hypothesis func.
The hypothesis is where (affine function). This assumes that we have one variable .
If we have two variables (note that complex phenomena generally require many variables to be accurately predicted, e.g. thousands of pixles), . We can see that this nomenclature pattern matches with , and doesn't exist because is the affine parameter.
Our parameters can be stored in the vector .
Our features can be stored in .