Document Zbl 07542347

Bayesian networks for the test score prediction: a case study on a math graduation exam. (English) Zbl 07542347

Vejnarová, Jiřina (ed.) et al., Symbolic and quantitative approaches to reasoning with uncertainty. 16th European conference, ECSQARU 2021, Prague, Czech Republic, September 21–24, 2021. Proceedings. Cham: Springer. Lect. Notes Comput. Sci. 12897, 255-267 (2021).

Summary: In this paper we study the problem of student knowledge level estimation. We use probabilistic models learned from collected data to model the tested students. We propose and compare experimentally several different Bayesian network models for the score prediction of student’s knowledge. The proposed scoring algorithm provides not only the expected value of the total score but the whole probability distribution of the total score. This means that confidence intervals of predicted total score can be provided along the expected value. The key that enabled efficient computations with the studied models is a newly proposed inference algorithm based on the CP tensor decomposition, which is used for the computation of the score distribution. The proposed algorithm is two orders of magnitude faster than a state of the art method. We report results of experimental comparisons on a large dataset from the Czech National Graduation Exam in Mathematics. In this evaluation the best performing model is an IRT model with one continuous normally distributed skill variable related to all items by the graded response models. The second best is a multidimensional IRT model with an expert structure of items-skills relations and a covariance matrix for the skills. This model has a higher improvement with larger training sets and seems to be the model of choice if a sufficiently large training dataset is available.
For the entire collection see [Zbl 1487.68022].

MSC:

68T37

Reasoning under uncertainty in the context of artificial intelligence

Keywords:

Bayesian networks; educational testing; score prediction; efficient probabilistic inference; multidimensional IRT; CP tensor decomposition

Software:

rhalton; mirt; Algorithm 247

Cite Review PDF

Full Text: DOI

References:

[1]	Almond, RG; Mislevy, RJ; Steinberg, LS; Yan, D.; Williamson, DM, The future of Bayesian networks in educational assessment, Bayesian Networks in Educational Assessment, 583-599 (2015), New York: Springer, New York · doi:10.1007/978-1-4939-2125-6_16
[2]	Almond, RG; Mislevy, RJ, Graphical models and computerized adaptive testing, Appl. Psychol. Meas., 23, 3, 223-237 (1999) · doi:10.1177/01466219922031347
[3]	Carroll, JD; Chang, JJ, Analysis of individual differences in multidimensional scaling via an n-way generalization of Eckart-Young decomposition, Psychometrika, 35, 283-319 (1970) · Zbl 0202.19101 · doi:10.1007/BF02310791
[4]	Chalmers, R.P.: mirt: a multidimensional item response theory package for the R environment. J. Statist. Softw. 48(6), 1-29 (2012). doi:10.18637/jss.v048.i06
[5]	van der Gaag, L.C., Bodlaender, H.L., Feelders, A.J.: Monotonicity in Bayesian networks. In: Proceedings of the Twentieth Conference on Uncertainty in Artificial Intelligence, pp. 569-576. AUAI Press (2004)
[6]	Halton, J.H.: Algorithm 247: radical-inverse quasi-random point sequence. Commun. ACM 7(12), 701-702 (1964). doi:10.1145/355588.365104
[7]	Harshman, R.A.: Foundations of the PARAFAC procedure: models and conditions for an “explanatory” multi-mode factor analysis. UCLA Working Pap. Phonetics 16, 1-84 (1970)
[8]	Hartig, J., Höhler, J.: Multidimensional IRT models for the assessment of competencies. Stud. Educ. Eval. 35(2), 57-63 (2009). doi:10.1016/j.stueduc.2009.10.002
[9]	Jensen, FV, Bayesian Networks and Decision Graphs (2001), New York: Springer, New York · Zbl 0973.62005 · doi:10.1007/978-1-4757-3502-4
[10]	Lauritzen, SL; Spiegelhalter, DJ, Local computations with probabilities on graphical structures and their application to expert systems (with discussion), J. Roy. Stat. Soc. B, 50, 157-224 (1988) · Zbl 0684.68106
[11]	Jeevan, M.; Dhingra, A.; Hanmandlu, M.; Panigrahi, BK; Lobiyal, DK; Mohapatra, DP; Nagar, A.; Sahoo, MN, Robust speaker verification using GFCC based i-vectors, Proceedings of the International Conference on Signal, Networks, Computing, and Systems, 85-91 (2017), New Delhi: Springer, New Delhi · doi:10.1007/978-81-322-3592-7_9
[12]	Masegosa, AR; Feelders, AJ; van der Gaag, LC, Learning from incomplete data in Bayesian networks with qualitative influences, Int. J. Approximate Reasoning, 69, 18-34 (2016) · Zbl 1344.68190 · doi:10.1016/j.ijar.2015.11.004
[13]	Olesen, K.G., et al.: A Munin network for the median nerve - a case study on loops. Appl. Artif. Intell. 3(2-3), 385-403 (1989). doi:10.1080/08839518908949933
[14]	Pearl, J., Probabilistic reasoning in intelligent systems: networks of plausible inference (1988), San Francisco: Morgan Kaufmann Publishers Inc., San Francisco · Zbl 0746.68089
[15]	Plajner, M., Vomlel, J.: Learning bipartite Bayesian networks under monotonicity restrictions. Int. J. Gen. Syst. 49(1), 88-111 (2020). doi:10.1080/03081079.2019.1692004
[16]	Plajner, M., Vomlel, J.: Monotonicity in practice of adaptive testing (2020). https://arxiv.org/abs/2009.06981 · Zbl 1491.68169
[17]	Samejima, F.: Estimation of latent ability using a response pattern of graded scores. Psychometrika 34, 1-97 (1969). doi:10.1007/BF03372160
[18]	Savicky, P.; Vomlel, J., Exploiting tensor rank-one decomposition in probabilistic inference, Kybernetika, 43, 5, 747-764 (2007) · Zbl 1148.68539
[19]	Vomlel, J., Bayesian networks in educational testing, Int. J. Uncertainty Fuzziness Knowl.-Based Syst., 12, supp01, 83-100 (2004) · Zbl 1101.68847 · doi:10.1142/S021848850400259X
[20]	Wainer, H., Dorans, N.J.: Computerized Adaptive Testing: A Primer. Routledge (1990)

This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. In some cases that data have been complemented/enhanced by data from zbMATH Open. This attempts to reflect the references listed in the original paper as accurately as possible without claiming completeness or a perfect matching.