/GTWeb

CS229 Lect. 2

Linear Regression

Linear regression is one of the simplest learning algorithms. This is a supervised learning algorithm because the data is labeled.

Ex: Predict the price of garden hoses

Training -> learning algo -> hypothesis func. $h$

The hypothesis is $h : X \to Y$ where $h (x \in X) = θ_{0} + θ_{1} x$ (affine function). This assumes that we have one variable $x$ .

If we have two variables (note that complex phenomena generally require many variables to be accurately predicted, e.g. thousands of pixles), $h (x_{1}, x_{2}) = θ_{0} + θ_{1} x_{1} + θ_{2} x_{2}$ . We can see that this nomenclature pattern matches $θ_{k}$ with $x_{k}$ , and $x_{0}$ doesn't exist because $θ_{0}$ is the affine parameter.

Our parameters can be stored in the vector $θ = θ_{1} θ_{2} θ_{3}$ .

Our features can be stored in $x = x_{1} x_{2} x_{3}$ .