Improving the Performance Stability of Inductive Expert Systems Under Input Noise.

Author List: Mookerjee, Vijay S.; Gilson, Robert; Mannino, Michael V.;

Information Systems Research, 1995, Volume 6, Issue 4, Page 328-356.

Inductive expert systems typically operate with imperfect or noisy input attributes. We study design differences in inductive expert systems arising from implicit versus explicit handling of input noise. Most previous approaches use an implicit approach wherein inductive expert systems are constructed using input data of quality comparable to problems the system will be called upon to solve. We develop an explicit algorithm (ID3<sub>ecp</sub>) that uses a clean (without input errors) training set and an explicit measure of the input noise level and compare it to a traditional implicit algorithm, ID3<sub>p</sub> (the ID3 algorithm with the pessimistic pruning procedure). The novel feature of the explicit algorithm is that it injects noise in a controlled rather than random manner in order to reduce the performance variance due to noise. We show analytically that the implicit algorithm has the same expected partitioning behavior as the explicit algorithm. In contrast, however, the partitioning behavior of the explicit algorithm is shown to be more stable (i.e., lower variance) than the implicit algorithm. To extend the analysis to the predictive performance of the algorithms, a set of simulation experiments is described in which the average performance and coefficient of variation of performance of both algorithms are studied on real and artificial data sets. The experimental results confirm the analytical results and demonstrate substantial differences in stability of performance between the algorithms especially as the noise level increases.

Keywords: inductive expert systems;input data noise;performance stability;variance reduction;controlled scrambling

Algorithm:

List of Topics

#97	0.349	set approach algorithm optimal used develop results use simulation experiments algorithms demonstrate proposed optimization present analytical distribution selection number existing
#147	0.180	process problem method technique experts using formation identification implicit analysis common proactive input improvements identify traditional stages identifying explicit setting
#93	0.148	performance results study impact research influence effects data higher efficiency effect significantly findings impacts empirical significant suggest outcomes better positive
#129	0.136	expert systems knowledge knowledge-based human intelligent experts paper problem acquisition base used expertise intelligence domain inductive rules machine artificial task