A strategy for increasing the efficiency of rule discovery in data mining

David McSherry¹

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 1280))

Included in the following conference series:

International Symposium on Intelligent Data Analysis

716 Accesses

Abstract

Increasing the efficiency of rule discovery is currently a major focus of research interest in data mining. Strategies available to the data miner include data sampling, knowledge-guided discovery, attribute reduction, parallelisation of the discovery process, and focusing on the discovery of a restricted class of rules, or those which appear most promising according to some measure of rule interest. This paper presents a new approach which combines the strategies of focusing on rules which appear most interesting, exploiting structural features of the data set when possible, and decomposition of the discovery process into sub-tasks which can be executed independently on parallel processsors.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Cendrowska, J.: PRISM: an algorithm for inducing modular rules. International Journal of Man-Machine Studies 27 (1987) 349–370
Article MATH Google Scholar
Frawley, W.J., Piatetsky-Shapiro, G., Matheus, C.J.: Knowledge Discovery in Databases: an Overview. In Piatetsky-Shapiro, G., Frawley, W.J. (eds.) Knowledge Discovery in Databases (AAAI Press, Menlo Park, CA, 1991) 1–27
Google Scholar
McSherry, D.: An algorithm for the discovery of characteristic rules. Digest No. 96/198 (Institution of Electrical Engineers, London, 1996) 4/1–3
Google Scholar
McSherry, D.: Qualitative assessment of rule interest in data mining. Proceedings of the Sixteenth Annual Technical Conference of the BCS Specialist Group on Expert Systems, Cambridge, December 1996, 204–215
Google Scholar
Murphy, P.M., Aha, D.W.: UCI Repository of Machine Learning Databases. http://www.ics.uci.edu/~mlearn/MLRepository.html (1995)
Google Scholar
Nelson, C.: Improving customer retention with knowledge guided data mining. BCS Specialist Group on Expert Systems Newsletter, No. 33 (1995) 15–20
Google Scholar
Piatetsky-Shapiro, G.: Discovery, analysis and presentation of strong rules. In Piatetsky-Shapiro, G., Frawley, W.J. (eds.) Knowledge Discovery in Databases (AAAI Press, Menlo Park, CA, 1991) 229–248
Google Scholar
Quinlan, J.R.: Induction of decision trees. Machine Learning 1 (1986) 81–106
Google Scholar
Shortland, R.J., Scarfe, R.T.: Data mining applications in BT. BT Technology Journal 12 (1994) 17–22
Google Scholar
Simoudis, E., John, G., Kerber, R., Livezey, B., Miller, P.: Developing customer vulnerability models using data mining techniques. Proceedings of IDA-95, Baden-Baden, August 1995, 181–185
Google Scholar
Smyth, P., Goodman, R.M.: Rule induction using information theory. In Piatetsky-Shapiro, G., Frawley, W.J. (eds.) Knowledge Discovery in Databases (AAAI Press, Menlo Park, CA, 1991) 159–176
Google Scholar
Thompson, S., Bramer, M.A.: Parallel knowledge discovery: a review of existing techniques. Digest No. 96/198 (Institution of Electrical Engineers, London, 1996) 5/1–5
Google Scholar
Ziarko, W.: Discovery, analysis, and representation of data dependencies in databases. In Piatetsky-Shapiro, G., Prawley, W.J. (eds.) Knowledge Discovery in Databases (AAAI Press, Menlo Park, CA, 1991) 195–209
Google Scholar

Download references

Author information

Authors and Affiliations

School of Information and Software Engineering, University of Ulster, Coleraine BT52 1SA, Northern Ireland
David McSherry

Authors

David McSherry
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Xiaohui Liu Paul Cohen Michael Berthold

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

McSherry, D. (1997). A strategy for increasing the efficiency of rule discovery in data mining. In: Liu, X., Cohen, P., Berthold, M. (eds) Advances in Intelligent Data Analysis Reasoning about Data. IDA 1997. Lecture Notes in Computer Science, vol 1280. Springer, Berlin, Heidelberg. https://doi.org/10.1007/BFb0052857

Download citation

DOI: https://doi.org/10.1007/BFb0052857
Published: 19 May 2006
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-63346-4
Online ISBN: 978-3-540-69520-2
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics