Described are techniques used automatic generation of classification rules used in machine learning. A single rule is formed of one or more logical expressions and an associated target. Using a set of training data, rules are formed one logical expression at a time using special data structures that require each feature to be sorted only once per rule formation. The FOIL gain metric is used in determining optimal splits for categorical features. Rule formation ceases with the production of five bad rules in which a bad rule is one in which there are more negative than positive examples in the training data set.

 
Web www.patentalert.com

> Method, system and program product for automated testing of changes to externalized rules

~ 00395