{"id":117,"date":"2021-11-29T17:12:04","date_gmt":"2021-11-29T17:12:04","guid":{"rendered":"http:\/\/192.168.64.4\/in-en\/?post_type=ai&#038;p=117"},"modified":"2023-06-12T09:56:28","modified_gmt":"2023-06-12T09:56:28","slug":"predicting-student-scores-in-standardized-tests-with-1pl-item-response-theory","status":"publish","type":"ai","link":"https:\/\/www.embibe.com\/in-en\/artificial-intelligence-ai-in-education\/predicting-student-scores-in-standardized-tests-with-1pl-item-response-theory\/","title":{"rendered":"Predicting Student Scores in Standardized Tests with 1pl Item Response Theory"},"content":{"rendered":"<p>At Embibe, we help students improve their scores in standardized examinations by incorporating insights and models from learning theory and education research through item response theory models.<\/p>\n<p>One widely used model named Item Response Theory[1, 2] predicts a student\u2019s likelihood of answering a question correctly by estimating the student\u2019s skill or ability level and the difficulty level of the question being attempted. It was first proposed in the 1960s, and many variants of this exist today, such as the 1PL model[2, 3] and the 2PL model[2].<\/p>\n<h2><b>1PL Model of Item Response Theory<\/b><\/h2>\n<p>The 1PL or 1 parameter Item Response Theory model, also known as the Rasch model[3] is described as follows.<\/p>\n<p><span style=\"font-weight: 400;\">Let i be a learner or student; j be a question. Let \u03b8i be the learner&#8217;s ability and \u03b2j the question&#8217;s difficulty level. Then as per the 1PL model, the probability Pij of the ith user correctly answering the jth question is given as <\/span><span style=\"font-weight: 400;\">Pij = 1 \/ (1 + exp(\u03b8i &#8211; \u03b2j)) The term \u03b2j &#8211; \u03b8i represents the difference between the difficulty of the question (\u03b2j) and the ability of the learner (\u03b8i). The probability of a correct response is modelled using the logistic function, which maps the difference to a probability value between 0 and 1.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">The logit function, which is the inverse of the logistic function, is used to transform the probability back to a log-odds scale. The logit function is defined as follows:<\/span><\/p>\n<ul>\n<li><span style=\"font-weight: 400;\">logit(x) = log(x \/ (1 &#8211; x))<\/span><\/li>\n<\/ul>\n<p><span style=\"font-weight: 400;\">Using the 1PL Item Response Theory parameter estimation techniques, we can predict the ability level \u03b8i of a learner, given the data about the learner\u2019s response to each attempted question.<\/span><\/p>\n<h3><b>A Deep Learning Architecture for 1PL Item Response Theory<\/b><\/h3>\n<p><span style=\"font-weight: 400;\">The 1PL item response theory model is indeed logistic regression with a domain-specific parametrization. Consequently, we can realize such a model using any deep learning framework. The deep learning architecture for the 1PL model is shown in Figure 1.<\/span><\/p>\n<p><img decoding=\"async\" class=\"aligncenter size-full wp-image-481777\" src=\"https:\/\/exams-assets.embibe.com\/exams\/wp-content\/uploads\/2021\/11\/03203420\/ai51.png\" alt=\"\" width=\"419\" height=\"238\" \/><\/p>\n<p><em><strong>Figure 1: Neural Network Architecture for Estimating the 1PL IRT Model Parameters<\/strong><\/em><\/p>\n<p><span style=\"font-weight: 400;\">Our model is implemented in Keras<\/span><span style=\"font-weight: 400;\">[4]<\/span><span style=\"font-weight: 400;\"> as a deep neural network. The advantages of modelling the problem as a neural network are:<\/span><\/p>\n<ul>\n<li style=\"font-weight: 400;\" aria-level=\"1\"><span style=\"font-weight: 400;\">Ability to handle missing values in the input \u2014 not every user needs to attempt every question,<\/span><\/li>\n<li style=\"font-weight: 400;\" aria-level=\"1\"><span style=\"font-weight: 400;\">Ability to scale to a large number of users and items,<\/span><\/li>\n<li style=\"font-weight: 400;\" aria-level=\"1\"><span style=\"font-weight: 400;\">Ability to extend the framework to 2PL, 3PL and other Item Response Theory models with more parameters.<\/span><\/li>\n<\/ul>\n<p><span style=\"font-weight: 400;\">We refer to this model as the 1PL Deep Item Response Theory model.<\/span><\/p>\n<h3><b>Validation<\/b><\/h3>\n<p><span style=\"font-weight: 400;\">In order to benchmark and validate the modelling strategy, we generate simulated data as follows:<\/span><\/p>\n<ul>\n<li style=\"font-weight: 400;\" aria-level=\"1\"><b>i N(0,1):<\/b><span style=\"font-weight: 400;\"> learner ability is generated using a normal distribution with mean 0 and standard deviation 1,<\/span><\/li>\n<li style=\"font-weight: 400;\" aria-level=\"1\"><b>j U(-1,1):<\/b><span style=\"font-weight: 400;\"> question difficulty values are uniformly generated between -1 and 1,<\/span><\/li>\n<li style=\"font-weight: 400;\" aria-level=\"1\"><b>Pij= i \u2013 j: <\/b><span style=\"font-weight: 400;\">the probability of correct responses is calculated using user ability and item difficulty (using the 1PL Item Response Theory equation),<\/span><\/li>\n<li style=\"font-weight: 400;\" aria-level=\"1\"><b style=\"font-size: revert; color: initial;\">yijk Bern(Pij):<\/b><span style=\"font-weight: 400;\"> binary responses (correct, not-correct) are sampled from the Bernoulli distribution with success probability Pij, where the number of responses per item per learner is configurable.<\/span><\/li>\n<\/ul>\n<p><span style=\"font-weight: 400;\">We have simulated 100 questions, 100 learners, and one response per learner per question using applications of item response theory to practical testing problems.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">We fit the 1PL Deep Item Response Theory model to the simulated dataset. Inputs to the neural network are the user vector (one-hot encoded) and the question vector (also one-hot encoded), and outputs are the parameters of the Item Response Theory model, including item difficulty, learner ability and prediction of whether the learner will answer correctly or not. The neural network is fully connected. It has two input layers, intermediate layers for difficulty and ability, and one output layer for prediction.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">We compare the 1PL Deep Item Response Theory output from the neural network with the true outputs from the simulated data.<\/span><\/p>\n<h3><b>Implementation<\/b><\/h3>\n<p><b>Model: <\/b><span style=\"font-weight: 400;\">The architecture of 1PL Item Response Theory is defined by harnessing the compositionality of NNs, utilizing Keras functional APIs. The overall model is structured by stacking Dense layers \u2013 here, 2 Dense layers for the 1PL model, each representative of a User or Item parameters pivotal in driving the likelihood (Pij) of a user (i) responding to an Item (j).<\/span><\/p>\n<p><b>Hyper Parameters: <\/b><span style=\"font-weight: 400;\">The following default settings are used in each dense layer<\/span><\/p>\n<ul>\n<li style=\"font-weight: 400;\" aria-level=\"1\"><span style=\"font-weight: 400;\">Kernel &amp; Bias initializers: Normal (0,1)<\/span><\/li>\n<li style=\"font-weight: 400;\" aria-level=\"1\"><span style=\"font-weight: 400;\">l1\/l2 regularizations: l_1=0, l_2=0<\/span><\/li>\n<li style=\"font-weight: 400;\" aria-level=\"1\"><span style=\"font-weight: 400;\">Activity regularizers: l_1=0, l_2=0<\/span><\/li>\n<\/ul>\n<p><span style=\"font-weight: 400;\">A developer can override the above settings, or the best configuration can be obtained by searching over a space of configurations for item response theory parameter estimation techniques. Such details will be in a forthcoming blog. The defined model is flexible enough to extend its usage for two or even three parameters, namely discrimination and guessability, and consequently, in retrospect, an extended model coupled with Neural Architecture Search capability can be constrained to perform as one or two PL models.<\/span><\/p>\n<h3><b>Experimental Results<\/b><\/h3>\n<p><span style=\"font-weight: 400;\">The below plots show the correlation between:<\/span><\/p>\n<ul>\n<li style=\"font-weight: 400;\" aria-level=\"1\"><span style=\"font-weight: 400;\">Predicted difficulty versus true difficulty level, with Pearson correlation coefficient 0.9857.<\/span><\/li>\n<li style=\"font-weight: 400;\" aria-level=\"1\"><span style=\"font-weight: 400;\">Predicted ability versus true ability level, with Pearson correlation coefficient 0.9954.<\/span><\/li>\n<li style=\"font-weight: 400;\" aria-level=\"1\"><span style=\"font-weight: 400;\">Predicted probability of answering each question correctly versus true probability, with Pearson correlation coefficient 0.9926.<\/span><\/li>\n<\/ul>\n<p><span style=\"font-weight: 400;\">From the Deep Item Response Theory model trained on 1PL data, the log-likelihood of the 1PL DIRT model is 0.587.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">As we can see, we get a good correlation in all three cases, indicating that our 1PL Deep Item Response Theory model successfully predicts the difficulty, ability and test score with good accuracy.<\/span><\/p>\n<p><img decoding=\"async\" class=\"aligncenter size-full wp-image-481778\" src=\"https:\/\/exams-assets.embibe.com\/exams\/wp-content\/uploads\/2021\/11\/03203421\/ai61.png\" alt=\"\" width=\"512\" height=\"340\" \/> <img decoding=\"async\" class=\"aligncenter size-full wp-image-481779\" src=\"https:\/\/exams-assets.embibe.com\/exams\/wp-content\/uploads\/2021\/11\/03203424\/ai71.png\" alt=\"\" width=\"512\" height=\"370\" \/><\/p>\n<p><strong><em>Figure 2: Scatterplots of the True Versus Derived Keras Model Difficulty and Ability Parameters<\/em><\/strong><\/p>\n<p><img decoding=\"async\" class=\"aligncenter size-full wp-image-481780\" src=\"https:\/\/exams-assets.embibe.com\/exams\/wp-content\/uploads\/2021\/11\/03203426\/ai81.png\" alt=\"\" width=\"512\" height=\"353\" \/><\/p>\n<p><strong><em>Figure 3: Hexbin Plot of the True Probability of Answering the Questions Correctly Versus. The Probability Derived from our Trained Keras Model<\/em><\/strong><\/p>\n<h3><b>Conclusion<\/b><\/h3>\n<p><span style=\"font-weight: 400;\">We have shown that, based on simulations, the 1PL Item Response Theory model can be implemented via a Deep Learning model. Using the Item Response Theory parameters, we can get good estimates of learner ability and the difficulty level of the questions using our 1PL Item Response Theory-based model. These estimates can, in turn, be used in generating adaptive tests, goal setting, and other downstream problems.<\/span><\/p>\n<p><b>References<\/b><\/p>\n<ol>\n<li style=\"font-weight: 400;\" aria-level=\"1\"><span style=\"font-weight: 400;\">Frank B. Baker. \u201cThe basics of item response theory.\u201d ERIC, USA, 2001<\/span><\/li>\n<li style=\"font-weight: 400;\" aria-level=\"1\"><span style=\"font-weight: 400;\">Wikipedia. Item Response Theory https:\/\/en.wikipedia.org\/wiki\/Item_response_theory<\/span><\/li>\n<li style=\"font-weight: 400;\" aria-level=\"1\"><span style=\"font-weight: 400;\">Georg Rasch. &#8220;Studies in mathematical psychology: I. Probabilistic models for some intelligence and attainment tests.&#8221; 1960.<\/span><\/li>\n<li style=\"font-weight: 400;\" aria-level=\"1\"><span style=\"font-weight: 400;\">Keras Deep Learning framework: Keras<\/span><\/li>\n<\/ol>\n","protected":false},"featured_media":0,"template":"","yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v21.1 - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\n<title>Predicting Student Scores in Standardized Tests with AI | Embibe<\/title>\n<meta name=\"description\" content=\"Explore Embibe&#039;s 1PL Item Response Theory for score prediction in standardized tests. Learn about the correlation between predicted and true difficulty, ability, and probability.\" \/>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/www.embibe.com\/in-en\/artificial-intelligence-ai-in-education\/predicting-student-scores-in-standardized-tests-with-1pl-item-response-theory\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Predicting Student Scores in Standardized Tests with AI | Embibe\" \/>\n<meta property=\"og:description\" content=\"Explore Embibe&#039;s 1PL Item Response Theory for score prediction in standardized tests. Learn about the correlation between predicted and true difficulty, ability, and probability.\" \/>\n<meta property=\"og:url\" content=\"https:\/\/www.embibe.com\/in-en\/artificial-intelligence-ai-in-education\/predicting-student-scores-in-standardized-tests-with-1pl-item-response-theory\/\" \/>\n<meta property=\"og:site_name\" content=\"EMBIBE - The most powerful AI-powered learning platform\" \/>\n<meta property=\"article:modified_time\" content=\"2023-06-12T09:56:28+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/exams-assets.embibe.com\/exams\/wp-content\/uploads\/2021\/11\/03203420\/ai51.png\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data1\" content=\"6 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"WebPage\",\"@id\":\"https:\/\/www.embibe.com\/in-en\/artificial-intelligence-ai-in-education\/predicting-student-scores-in-standardized-tests-with-1pl-item-response-theory\/\",\"url\":\"https:\/\/www.embibe.com\/in-en\/artificial-intelligence-ai-in-education\/predicting-student-scores-in-standardized-tests-with-1pl-item-response-theory\/\",\"name\":\"Predicting Student Scores in Standardized Tests with AI | Embibe\",\"isPartOf\":{\"@id\":\"https:\/\/www.embibe.com\/in-en\/#website\"},\"datePublished\":\"2021-11-29T17:12:04+00:00\",\"dateModified\":\"2023-06-12T09:56:28+00:00\",\"description\":\"Explore Embibe's 1PL Item Response Theory for score prediction in standardized tests. Learn about the correlation between predicted and true difficulty, ability, and probability.\",\"breadcrumb\":{\"@id\":\"https:\/\/www.embibe.com\/in-en\/artificial-intelligence-ai-in-education\/predicting-student-scores-in-standardized-tests-with-1pl-item-response-theory\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/www.embibe.com\/in-en\/artificial-intelligence-ai-in-education\/predicting-student-scores-in-standardized-tests-with-1pl-item-response-theory\/\"]}]},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/www.embibe.com\/in-en\/artificial-intelligence-ai-in-education\/predicting-student-scores-in-standardized-tests-with-1pl-item-response-theory\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\/\/www.embibe.com\/in-en\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Predicting Student Scores in Standardized Tests with 1pl Item Response Theory\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/www.embibe.com\/in-en\/#website\",\"url\":\"https:\/\/www.embibe.com\/in-en\/\",\"name\":\"EMBIBE - The most powerful AI-powered learning platform\",\"description\":\"Just another WordPress site\",\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/www.embibe.com\/in-en\/?s={search_term_string}\"},\"query-input\":\"required name=search_term_string\"}],\"inLanguage\":\"en-US\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Predicting Student Scores in Standardized Tests with AI | Embibe","description":"Explore Embibe's 1PL Item Response Theory for score prediction in standardized tests. Learn about the correlation between predicted and true difficulty, ability, and probability.","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/www.embibe.com\/in-en\/artificial-intelligence-ai-in-education\/predicting-student-scores-in-standardized-tests-with-1pl-item-response-theory\/","og_locale":"en_US","og_type":"article","og_title":"Predicting Student Scores in Standardized Tests with AI | Embibe","og_description":"Explore Embibe's 1PL Item Response Theory for score prediction in standardized tests. Learn about the correlation between predicted and true difficulty, ability, and probability.","og_url":"https:\/\/www.embibe.com\/in-en\/artificial-intelligence-ai-in-education\/predicting-student-scores-in-standardized-tests-with-1pl-item-response-theory\/","og_site_name":"EMBIBE - The most powerful AI-powered learning platform","article_modified_time":"2023-06-12T09:56:28+00:00","og_image":[{"url":"https:\/\/exams-assets.embibe.com\/exams\/wp-content\/uploads\/2021\/11\/03203420\/ai51.png"}],"twitter_card":"summary_large_image","twitter_misc":{"Est. reading time":"6 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"WebPage","@id":"https:\/\/www.embibe.com\/in-en\/artificial-intelligence-ai-in-education\/predicting-student-scores-in-standardized-tests-with-1pl-item-response-theory\/","url":"https:\/\/www.embibe.com\/in-en\/artificial-intelligence-ai-in-education\/predicting-student-scores-in-standardized-tests-with-1pl-item-response-theory\/","name":"Predicting Student Scores in Standardized Tests with AI | Embibe","isPartOf":{"@id":"https:\/\/www.embibe.com\/in-en\/#website"},"datePublished":"2021-11-29T17:12:04+00:00","dateModified":"2023-06-12T09:56:28+00:00","description":"Explore Embibe's 1PL Item Response Theory for score prediction in standardized tests. Learn about the correlation between predicted and true difficulty, ability, and probability.","breadcrumb":{"@id":"https:\/\/www.embibe.com\/in-en\/artificial-intelligence-ai-in-education\/predicting-student-scores-in-standardized-tests-with-1pl-item-response-theory\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/www.embibe.com\/in-en\/artificial-intelligence-ai-in-education\/predicting-student-scores-in-standardized-tests-with-1pl-item-response-theory\/"]}]},{"@type":"BreadcrumbList","@id":"https:\/\/www.embibe.com\/in-en\/artificial-intelligence-ai-in-education\/predicting-student-scores-in-standardized-tests-with-1pl-item-response-theory\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/www.embibe.com\/in-en\/"},{"@type":"ListItem","position":2,"name":"Predicting Student Scores in Standardized Tests with 1pl Item Response Theory"}]},{"@type":"WebSite","@id":"https:\/\/www.embibe.com\/in-en\/#website","url":"https:\/\/www.embibe.com\/in-en\/","name":"EMBIBE - The most powerful AI-powered learning platform","description":"Just another WordPress site","potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/www.embibe.com\/in-en\/?s={search_term_string}"},"query-input":"required name=search_term_string"}],"inLanguage":"en-US"}]}},"_links":{"self":[{"href":"https:\/\/www.embibe.com\/in-en\/wp-json\/wp\/v2\/ai\/117"}],"collection":[{"href":"https:\/\/www.embibe.com\/in-en\/wp-json\/wp\/v2\/ai"}],"about":[{"href":"https:\/\/www.embibe.com\/in-en\/wp-json\/wp\/v2\/types\/ai"}],"wp:attachment":[{"href":"https:\/\/www.embibe.com\/in-en\/wp-json\/wp\/v2\/media?parent=117"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}