Skip to main content

Fairness in Predicting Recidivism Score

  • Conference paper
  • First Online:
Smart Data Intelligence (ICSMDI 2024)

Abstract

The growing use of machine learning models in many domains has made human activities much simpler. One of the key domains in which machine learning models are applied is criminology and criminal justice. The prediction of recidivism is the main topic of this paper. Recidivism is defined as an individual reoffending a crime after being released on bail, sentence, or parole. Predicting the likelihood of an individual reoffending after involved in criminal justice is called recidivism score. Several algorithms exhibit bias related to age, gender, and ethnicity, among other factors that affect fairness in score prediction. To address this, bias migration and dimensionality reduction are used during the pre-processing stage of a dataset. The Correctional Offender Management Profiling for Alternative Sanctions (COMPAS) dataset (https://www.kaggle.com/datasets/danofer/compass/data [1]; Larson et al. in How we analyzed the COMPAS recidivism algorithm. ProPublica, p 9, 2016 [2]), collected by ProPublica and obtained via Kaggle, is used. Several models are used to train the dataset and compare how this model find the hidden patterns and reduce bias in predicting, and they are support vector machine (SVM), decision tree, and ensemble learning. The accuracy of the models is SVM 67.85%, decision tree 66.55%, and ensemble learning 68.01%.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
eBook
USD 259.00
Price excludes VAT (USA)
Hardcover Book
USD 249.99
Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. https://www.kaggle.com/datasets/danofer/compass/data

  2. Larson, J., Mattu, S., Kirchner, L., Angwin, J.: How we analyzed the COMPAS recidivism algorithm, p. 9. ProPublica (2016, May)

    Google Scholar 

  3. Travaini, G.V., Pacchioni, F., Bellumore, S., Bosia, M., De Micco, F.: Machine learning and criminal justice: a systematic review of advanced methodology for recidivism risk prediction. Int. J. Environ. Res. Public Health 19(17), 10594 (2022)

    Article  Google Scholar 

  4. McKay, C.: Predicting risk in criminal procedure: actuarial tools, algorithms, AI and judicial decision-making. Curr. Issues Crim. Just. 32(1), 22–39 (2020)

    Article  Google Scholar 

  5. Biddle, J.B.: On predicting recidivism: epistemic risk, tradeoffs, and values in machine learning. Can. J. Philos. 52(3), 321–341 (2022)

    Article  Google Scholar 

  6. Tollenaar, N., van der Heijden, P.G.: Which method predicts recidivism best? A comparison of statistical, machine learning and data mining predictive models. J. R. Stat. Soc. Ser. A Stat. Soc. 176(2), 565–584 (2013)

    Article  MathSciNet  Google Scholar 

  7. Skeem, J.L., Lowenkamp, C.T.: Risk, race, and recidivism: predictive bias and disparate impact. Criminology 54(4), 680–712 (2016)

    Article  Google Scholar 

  8. Karimi-Haghighi, M., Castillo, C.: Enhancing a recidivism prediction tool with machine learning: effectiveness and algorithmic fairness. In: Proceedings of the Eighteenth International Conference on Artificial Intelligence and Law, pp. 210–214 (2021, June)

    Google Scholar 

  9. Ozkan, T.: Predicting recidivism through machine learning. Doctoral dissertation (2017)

    Google Scholar 

  10. Wang, C., Han, B., Patel, B., Rudin, C.: In pursuit of interpretable, fair and accurate machine learning for criminal recidivism prediction. J. Quant. Criminol. 39(2), 519–581 (2023)

    Article  Google Scholar 

  11. Ingram, E., Gursoy, F., Kakadiaris, I.A.: Accuracy, fairness, and interpretability of machine learning criminal recidivism models. In: 2022 IEEE/ACM International Conference on Big Data Computing, Applications and Technologies (BDCAT), pp. 233–241. IEEE (2022, Dec)

    Google Scholar 

  12. Cortes, C., Vapnik, V.: Support-vector networks. Mach. Learn. 20(3), 273–297 (1995). https://doi.org/10.1007/bf00994018

    Article  Google Scholar 

  13. Quinlan, J.R.: Induction of decision trees. Mach. Learn. 1(1), 81–106 (1986). https://doi.org/10.1007/bf00116251

    Article  Google Scholar 

  14. Opitz, D., Maclin, R.: Popular ensemble methods: an empirical study. J. Artif. Intell. Res. 11, 169–198 (1999)

    Article  Google Scholar 

  15. https://en.wikipedia.org/wiki/COMPAS_(software)

  16. https://www.kaggle.com/code/ajaswanthkiran/predicting-recidivism-score?kernelSessionId=158592555a

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Jaswanth Kiran Athota .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2024 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Athota, J.K., Parimi, K.K., Teja, M.K., Bhavani, M.A., Yamuna Devi, M.M. (2024). Fairness in Predicting Recidivism Score. In: Asokan, R., Ruiz, D.P., Piramuthu, S. (eds) Smart Data Intelligence. ICSMDI 2024. Algorithms for Intelligent Systems. Springer, Singapore. https://doi.org/10.1007/978-981-97-3191-6_18

Download citation

Publish with us

Policies and ethics