Author List: Balachandran, Krishnamohan; Buzydlowski, Jan; Dworman, Garett; Kimbrough, Steven O.; Shafer, Tate; Vachula, William J.;
Journal of Management Information Systems, 1999, Volume 16, Issue 1, Page 17-36.
This paper reports on conceptual development in the areas of database mining and knowledge discovery in databases (KDD). The authors' efforts have also led to a prototype implementation, called MOTC, for exploring hypothesis space in large and complex data sets. Their KDD conceptual development rests on two main principles. First, they use the crosstab representation for working with qualitative data. This is by now standard in on-line analytical processing (OLAP) applications, and the authors reaffirm it with additional reasons. Second, and innovatively, they use prediction analysis as a measure of goodness for hypotheses. Prediction analysis is an established statistical technique for analysis of associations among qualitative variables. It generalizes and subsumes a large number of other such measures of association, depending on specific assumptions the user is willing to make. As such, it provides a very useful framework for exploring hypothesis space in a KDD context. The paper illustrates these points with an extensive discussion of MOTC.
Keywords: data mining; data visualization; hypotheses exploration; knowledge discovery in databases; OLAP; prediction analysis
Algorithm:

List of Topics

#37 0.239 intelligence business discovery framework text knowledge new existing visualization based analyzing mining genetic algorithms related techniques large proposed novel artificial
#82 0.160 case study studies paper use research analysis interpretive identify qualitative approach understanding critical development managerial elements exploring points positivist presents
#215 0.149 data classification statistical regression mining models neural methods using analysis techniques performance predictive networks accuracy method variables prediction problem measure
#281 0.114 database language query databases natural data queries relational processing paper using request views access use matching automated semantic based languages
#263 0.082 instrument measurement factor analysis measuring measures dimensions validity based instruments construct measure conceptualization sample reliability development develop responses assess use
#77 0.079 information systems paper use design case important used context provide presented authors concepts order number various underlying implementation framework nature