|
- Linger uses a hybrid approach to classification.
- A genetic algorithm is used to isolate which features can be
used to distinguish between mail types.
- Genetic algorithms can be used to efficiently search through
a large space of possible parameters.
- In Linger's case, it allows an optimal feature selection to be
found quickly based on the variance of each word across all mailboxes
- A neural network is then used to classify the unknown mail based
on what features it contains and what words are important to your
mail folders.
- Neural networks are very good at adapting to their input values
and have the ability to generalise to predict unseen data.
- This ability to generalise is very important in email classification,
where the subject of a text can vary wildly.
http://neko.freeshell.org/linger
|