Abstract
Increasing the efficiency of rule discovery is currently a major focus of research interest in data mining. Strategies available to the data miner include data sampling, knowledge-guided discovery, attribute reduction, parallelisation of the discovery process, and focusing on the discovery of a restricted class of rules, or those which appear most promising according to some measure of rule interest. This paper presents a new approach which combines the strategies of focusing on rules which appear most interesting, exploiting structural features of the data set when possible, and decomposition of the discovery process into sub-tasks which can be executed independently on parallel processsors.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Cendrowska, J.: PRISM: an algorithm for inducing modular rules. International Journal of Man-Machine Studies 27 (1987) 349–370
Frawley, W.J., Piatetsky-Shapiro, G., Matheus, C.J.: Knowledge Discovery in Databases: an Overview. In Piatetsky-Shapiro, G., Frawley, W.J. (eds.) Knowledge Discovery in Databases (AAAI Press, Menlo Park, CA, 1991) 1–27
McSherry, D.: An algorithm for the discovery of characteristic rules. Digest No. 96/198 (Institution of Electrical Engineers, London, 1996) 4/1–3
McSherry, D.: Qualitative assessment of rule interest in data mining. Proceedings of the Sixteenth Annual Technical Conference of the BCS Specialist Group on Expert Systems, Cambridge, December 1996, 204–215
Murphy, P.M., Aha, D.W.: UCI Repository of Machine Learning Databases. http://www.ics.uci.edu/~mlearn/MLRepository.html (1995)
Nelson, C.: Improving customer retention with knowledge guided data mining. BCS Specialist Group on Expert Systems Newsletter, No. 33 (1995) 15–20
Piatetsky-Shapiro, G.: Discovery, analysis and presentation of strong rules. In Piatetsky-Shapiro, G., Frawley, W.J. (eds.) Knowledge Discovery in Databases (AAAI Press, Menlo Park, CA, 1991) 229–248
Quinlan, J.R.: Induction of decision trees. Machine Learning 1 (1986) 81–106
Shortland, R.J., Scarfe, R.T.: Data mining applications in BT. BT Technology Journal 12 (1994) 17–22
Simoudis, E., John, G., Kerber, R., Livezey, B., Miller, P.: Developing customer vulnerability models using data mining techniques. Proceedings of IDA-95, Baden-Baden, August 1995, 181–185
Smyth, P., Goodman, R.M.: Rule induction using information theory. In Piatetsky-Shapiro, G., Frawley, W.J. (eds.) Knowledge Discovery in Databases (AAAI Press, Menlo Park, CA, 1991) 159–176
Thompson, S., Bramer, M.A.: Parallel knowledge discovery: a review of existing techniques. Digest No. 96/198 (Institution of Electrical Engineers, London, 1996) 5/1–5
Ziarko, W.: Discovery, analysis, and representation of data dependencies in databases. In Piatetsky-Shapiro, G., Prawley, W.J. (eds.) Knowledge Discovery in Databases (AAAI Press, Menlo Park, CA, 1991) 195–209
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 1997 Springer-Verlag
About this paper
Cite this paper
McSherry, D. (1997). A strategy for increasing the efficiency of rule discovery in data mining. In: Liu, X., Cohen, P., Berthold, M. (eds) Advances in Intelligent Data Analysis Reasoning about Data. IDA 1997. Lecture Notes in Computer Science, vol 1280. Springer, Berlin, Heidelberg. https://doi.org/10.1007/BFb0052857
Download citation
DOI: https://doi.org/10.1007/BFb0052857
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-63346-4
Online ISBN: 978-3-540-69520-2
eBook Packages: Springer Book Archive