Author List: Saar-Tsechansky, Maytal; Provost, Foster;
Information Systems Research, 2007, Volume 18, Issue 1, Page 4/22/2017.
It can be expensive to acquire the data required for businesses to employ data-driven predictive modeling--for example, to model consumer preferences to optimize targeting. Prior research has introduced "active-learning" policies for identifying data that are particularly useful for model induction, with the goal of decreasing the statistical error for a given acquisition cost (error-centric approaches). However, predictive models are used as part of a decision-making process, and costly improvements in model accuracy do not always result in better decisions. This paper introduces a new approach for active data acquisition that specifically targets decision making. The new decision-centric approach departs from traditional active learning by placing emphasis on acquisitions that are more likely to affect decision making. We describe two different types of decision-centric techniques. Next, using direct-marketing data, we compare various data-acquisition techniques. We demonstrate that strategies for reducing statistical error can be wasteful in a decision-making context, and show that one decision-centric technique in particular can improve targeting decisions significantly. We also show that this method is robust in the face of decreasing quality of utility estimations, eventually converging to uniform random sampling, and that it can be extended to situations where different data acquisitions have different costs. The results suggest that businesses should consider modifying their strategies for acquiring information through normal business transactions. For example, a firm such as Amazon.com that models consumer preferences for customized marketing may accelerate learning by proactively offering recommendations--not merely to induce immediate sales, but for improving recommendations in the future.
Keywords: active learning; classifier induction; decision making; decision-support systems; predictive modeling
Algorithm:

List of Topics

#215 0.207 data classification statistical regression mining models neural methods using analysis techniques performance predictive networks accuracy method variables prediction problem measure
#8 0.148 decision making decisions decision-making makers use quality improve performance managers process better results time managerial task significantly help indicate maker
#44 0.106 approach analysis application approaches new used paper methodology simulation traditional techniques systems process based using proposed method present provides various
#5 0.081 consumer consumers model optimal welfare price market pricing equilibrium surplus different higher results strategy quality cost lower competition firm paper
#10 0.069 strategies strategy based effort paper different findings approach suggest useful choice specific attributes explain effective affect employ particular online control
#130 0.065 online users active paper using increasingly informational user data internet overall little various understanding empirical despite lead cascades help availability
#151 0.052 costs cost switching reduce transaction increase benefits time economic production transactions savings reduction impact services reduced affect expected optimal associated