Endnotes - Shamaa

AU - حسن، ياسر عبد الله حفني AB - هدفت الدراسة إلى بحث أثر اختلاف طريقة المعادلة (المتوسط/المتوسط، المتوسط / الانحراف المعياري) وطرق تقدير الدرجات (التقليدية، التجريبية، وطريقة الاحتمال المقترح للإجابة الصحيحة) وقواعد صياغة فقرات الاختبار (المحكم، المخالف) على دقة تقدير معالم الفقرات وقدرات الأفراد في ضوء القياس الكلاسيكي والنموذج اللوجستي ثلاثي البارامتر، وتكونت عينة الدراسة من 1500 طالبا وطالبة تراوحت أعمارهم بين (20.4 – 7,21) سنة، من طلاب كلية التربية جامعة أم القرى بمكة المكرمة، تم اختيارهم بالطريقة العشوائية الطبقية، ولتحقيق أهداف الدراسة والإجابة عن تساؤلاتها قام الباحث بإعداد نموذجي اختبار لمقرر الاختبارات والمقاييس من نوع الاختيار من متعدد ذو الأربعة بدائل، وتم معالجة النتائج وتحليلها باستخدام البرامج الإحصائية SPSS(22) - XCalibre (4.1.7) - IRTEQ، وتوصل الباحث إلى النتائج التالية: اختلاف التقديرات لكل من النظرية الكلاسيكية والنموذج اللوجستي ثلاثي البارامتر، فمن منظور القياس الكلاسيكي: كان متوسط الصعوبة والتمييز لفقرات الاختبار المحكم البناء أعلى من متوسط صعوبة وتمييز فقرات الاختبار المخالف لقواعد الصياغة، ومن منظور النموذج اللوجستي ثلاثي البارامتر: أظهرت النتائج أن الاختبار المحكم أكثر كفاءة وفاعلية من الاختبار المخالف عند مستويات القدرة المختلفة، وأن فقرات الاختبار المحكم كانت أكثر دقة في تقدير قدرة الأفراد من الاختبار المخالف، وأن تحليل الفقرة في ضوء نظرية الاستجابة للفقرة كان أكثر دقة من النظرية الكلاسيكية في تقدير معلمة الصعوبة والتمييز والتخمين، وكانت أكثر طرق تقدير الدرجات الكلاسيكية ارتباطا بالنموذج اللوجستي ثلاثي البارامتر في تقدير قدرات الطلاب وصعوبة وتمييز الفقرات، الطريقة التقليدية ثم الطريقة التجريبية ثم طريقة الاحتمال المقترح للإجابة الصحيحة، وأشارت النتائج إلى أن قيم التحيز وجذر متوسط مربع الخطأ، تقل مع ازدياد حجم العينة وطول الاختبار، فكلما زاد حجم العينة، وطول الاختبار زادت دقة المعادلة، وفي ضوء محكي التحيز وجذر متوسط مربع الخطأ، تعتبر طريقة (المتوسط/المتوسط) أكثر دقة في معادلة درجات الاختبارات من طريقة (المتوسط/الانحراف المعياري) وفق النموذج اللوجستي ثلاثي البارامتر. (الملخص المنشور) http://search.shamaa.org/abstract_ar.gif AB - The study aimed at investigating the effect of different functioning method (mean & mean method, mean & sigma method), methods of scoring (the conventional method, the experimental method and the method of probability assigned to the correct answer), and the rules of crafting items (the well-structured test, the ill structured test) on the accuracy of estimating the parameters of items and the abilities of individuals in the light of classical measurement and the three-parameter logistic model. The sample of the study consisted of (1500) male and female students aging from (20.4-21.7) years, from the Faculty of Education, at Umm Al-Qura University, who have been chosen stratified randomly. In order to achieve the aims of this study and to answer its questions, the researcher prepared two test modules for the course of tests and measurements of multiple choices type with four alternatives. Data were analyzed through using (SPSS 22, XCalibre 4.1.7, IRTEQ). The results indicated differences between classical theory and three parameters logistic model. The classical perspective: the difficulty and discrimination mean of well-structured test items was higher than the difficulty and discrimination mean of ill structured test items. Three parameters logistic model perspective: the well-structured test is more efficient and effective than the violated test at different ability levels. The well-structured test is more accurate in estimating the parameters of individuals than the violated test, and the item analysis in the light of item response theory is better than classical theory of the test regarding parameter difficulty, discrimination and guessing. The conventional method was the most related method to the three parameters logistic model among the other classical methods in estimating the abilities of the students, the difficulty and discrimination of items, then the experimental method, followed by the method of proposed probability of the answer. The results showed that bias values and root mean square errors decreased with the increase of sample size and test duration. The bigger sample size and the longer test, the more accurate the equation becomes. For the effect of three parameter model, in light of bias simulation and root mean square error, mean & mean method is considered better than mean & sigma method in equating test score. (Published abstract) http://search.shamaa.org/abstract_en.gif OP - ص ص. 352-434 T1 - أثر اختلاف طريقة المعادلة وطرق تقدير الدرجات وقواعد صياغة الفقرات على دقة تقدير معالم الفقرات وقدرات الأفراد في ضوء القياس الكلاسيكي والنموذج اللوجستي ثلاثي البارامتر [مقال] UL - http://search.shamaa.org/PDF/Articles/EGJfeau/JfeauVol35No7Y2019/jfeau_2019-v35-n7_352-434.pdf النص الكامل (PDF) 1 http://search.shamaa.org/fulltext.gif