Training vs. Testing
- Training set:
- Used for inducing the function.
- Testing set:
- Used for evaluating the quality of the induced function.
- Does the function "generalize" to previously unseen inputs?
Inductive Bias
- What assumption is made in order to generalize?
- Example algorithm:
- Training:
- Create a hash table with one entry (a linked list) per label.
- For each training example:
- Add it to the list corresponding to its label
- Deployment:
- If the presented data is contained in a list, return its label.
- Otherwise...