Machine learning fundamentals

1 Introduction

Data collection \(\rightarrow\) Feature extraction \(\rightarrow\) Model training.

Consider a machine learning problem as a system with input and output.

Input \(\rightarrow\) Machine learning \(\rightarrow\) Output.

Whether the output is continuous, discrete, or a structured object.

Whether the output is known.

Linear models vs. Nonlinear models.

Parametric models take a presumed functional form and are completely determined by a fixed set of model parameters.

data = signal + noise.

Underfitting occurs when the learning performance is not satisfactory even in the training data.

Overfitting occurs when we notice a nearly perfect performance in the training data but a fairly poor performance in another unseen evaluation data.

Bias - Underfitting.

Variance - Overfitting.

\(\text{learning error} = \text{bias}^2 + \text{variance}\).