mehdi molaei yasavoli; ali delavar; mohammad asgari; jalil Younesi; vahid rezaei tabar
Abstract
Efficiency and bias of parameter estimation is one of the most important psychometric issues in behavioral science measurements. The existence of various algorithms such as MHRM and their application in tests with missing data is one of the challenges in the field of item-response theory models. The ...
Read More
Efficiency and bias of parameter estimation is one of the most important psychometric issues in behavioral science measurements. The existence of various algorithms such as MHRM and their application in tests with missing data is one of the challenges in the field of item-response theory models. The purpose of this study was to investigate the risk of MHRM algorithm in multidimensional models of item-response theory in multi-valued data by considering the mechanism and the amount of missing data. The research method was experimental using a multi-group post-test design. The study sample was created based on simulation studies under different conditions of independent variables in 27 cases with 100 replications for each. The model used was a multidimensional scaled response model and the studied parameters were the slope and threshold of the questions. R statistical software was used to generate and analyze the data. The results showed that MHRM algorithm has less estimated risk compared to EM and MCEM algorithms. The results also showed that there is a significant difference in the risk of slope and threshold parameters between three different mechanisms of missing data, but no significant difference was observed in relation to the independent variable of missing data. There was also a significant interaction between the type of algorithm and the missing mechanism, which indicated the optimal performance of the MHRM algorithm. Thus when this algorithm is used, the mean and variance of the MSE slope and threshold parameters in all three loss mechanisms also converge as they decrease. As a result, it can be said that the application of MHRM algorithm is essential in data with high data missing and types of missing. Therefore, researchers are advised to use the MHRM algorithm in data analysis with complex structure such as high data missing and various missing mechanisms
Zahra Jahanbakhsh; Noorali Farrokhi; Jalil Younesi
Abstract
In order to increase the precision of measurement, survey studies apply scales for the evaluation of latent features. This study aimed to examine differential item functioning (DIF) of Academic Alienation scale items in students based on the Item Response theory (IRT) framework. DIF analysis is used ...
Read More
In order to increase the precision of measurement, survey studies apply scales for the evaluation of latent features. This study aimed to examine differential item functioning (DIF) of Academic Alienation scale items in students based on the Item Response theory (IRT) framework. DIF analysis is used to test the hypothesis if the item parameters for the different groups of identical ability are invariant. DIF analyses of gender and education duration were performed using a sample of 1100 Psychology and educational sciences students in the city of Tehran. The Academic Alienation items (Burbach, 1972) studied were 4-point Likert-type scales. The questionnaire had three subscales: isolation, meaningless, and powerlessness. Analyses were conducted using the graded item response model with likelihood ratio (IRT-LR) tests of DIF. DIF analyses were conducted with 4 anchor items and after Bonferroni adjustment only 3 items based on gender and 2 items according to education duration were detected with meaningful DIF. The explanation of DIF detection can be discussed based on the cultural context of universities. Sexual inequality, teachers’ attitudes, university management policies, and finally educational injustice may be the reasons for the sense of academic alienation in students.
Jalil Younesi
Abstract
To keep up with increasing organizational complexities, the best survey and survey practices have evolved. This is evidenced by the continuous listening strategy, which entails conducting surveys and developing plans more frequently than in a single static event, such as biennial, annual, or short period ...
Read More
To keep up with increasing organizational complexities, the best survey and survey practices have evolved. This is evidenced by the continuous listening strategy, which entails conducting surveys and developing plans more frequently than in a single static event, such as biennial, annual, or short period surveys to provide consistent feedback, data, and insights. The flow of the continuous listening in organizations has steered the focus of psychometric specialists towards the continuous survey or pulse survey model. The objective of this research was to study the current and future status of employee surveys in organizations in Iran, including public and private entities, from the perspective of their Human Resources specialists. For this purpose, a survey was conducted for two months between July and September of 2022. The study targeted organizations in Iran that conducted official activities during 2022. The purposeful stratified sampling method was applied to collect data from organizations and companies of varying sizes & industry sectors. The findings demonstrate that advancements in all facets of the Pulse Survey Cycle, from design to reporting and analysis to providing more timely and relevant data, have motivated organizations to move away from traditional surveying strategies and align with the advancements of newer technologies through Pulse Surveys. Despite the slow evolution of surveys among Iranian organizations and companies, it is evident that they are making gradual advancements, which align with the progress of scientific resources and popular media coverage.
Reyhane Rahimi; jalil Younesi; Ali Moghadamzade; Mohammad Asgari
Abstract
Using different methods and techniques to achieve the best results and create synergies between them can be helpful in many issues. Educational data mining is one of the relatively new fields that can be used to solve educational problems, especially problems in the field of measurement. But before using ...
Read More
Using different methods and techniques to achieve the best results and create synergies between them can be helpful in many issues. Educational data mining is one of the relatively new fields that can be used to solve educational problems, especially problems in the field of measurement. But before using these methods, it should be as familiar as possible and its problems, disadvantages and advantages should be examined. In this study, the aim is to investigate the technique of nonnegative matrix factorizations and how to determine the number of clusters before implementing the model. The research method is descriptive and the study population is all those present at the entrance exam for mathematics and technical sciences in 1398, of which 5,000 people were randomly selected by the country's assessment and education organization and provided to the researcher. The research tool is math questions and entrance exam geometry. The results of this analysis showed that there is a difference in estimating the number of clusters of math questions, but in the case of geometry questions, the results of all methods were the same. Due to the observed differences, it is suggested that in future research with the help of data simulation, this issue will be examined in more detail.
Pouria Rezasoltani; Ebrahim Khodaie; jalil Younesi; Amin Mousavi; Ali Moghadamzade
Abstract
Person fit assessment is useful in ensuring validity and fairness in the use and interpretation of test scores. In this research, applied the H^T person fit statistic to examine response pattern of TIMSS eight grade mathematics test of Australia, Iran, and Republic of Korea. In order to investigate the ...
Read More
Person fit assessment is useful in ensuring validity and fairness in the use and interpretation of test scores. In this research, applied the H^T person fit statistic to examine response pattern of TIMSS eight grade mathematics test of Australia, Iran, and Republic of Korea. In order to investigate the effect of contextual variables on students’ person fit statistic, hierarchical linear modeling was used, due to the hierarchical structure of data. Based on intraclass correlation coefficient, 83.7% of variance of the H^T person fit statistic is at student level, and 16.3% of variance of the H^T person fit statistic is at school and country levels. In addition, according to the hierarchy linear final model between the H^T person fit statistic and factors of students, schools, and countries; only the average of students mathematics achievement in countries, school emphasis on students academic success, students confident in mathematics, and estimate of students ability, are significant factors in the final model.
Somayeh Kaveh; Ebrahim Khodaie; Amin Musavi; Ali Moghadamzadeh; jalil Younesi
Abstract
In order to facilitate the interpretation of raw scores, they are usually converted to scale scores. In some cases, these conversions are a series of nonlinear transformations that can affect the conditional standard error of measurement throughout the scale of score. Therefore, the purpose of this study ...
Read More
In order to facilitate the interpretation of raw scores, they are usually converted to scale scores. In some cases, these conversions are a series of nonlinear transformations that can affect the conditional standard error of measurement throughout the scale of score. Therefore, the purpose of this study was to introduce methods for calculating the conditional standard error of measurement based on the strong true score theory. Furthermore, comparison of normalized and equipercentile nonlinear transformations on the raw scores of the academic achievements of the graduates of mathematical sciences in 2014 and their effect on conditional standard error of measurement was also conducted. So, in order to achieve these purposes, we used a sample of 3943 high school graduates of Mathematics and Physics in 2014 who had participated in national university entrance examination in 2015 randomly selected by National Organization of Educational Testing. The conditional standard error of measurement under these transformations was estimated based on the binomial procedure of Brennan and Lee (1999) and Chang (2006) method based on the beta-binomial distribution. The results of this study indicated that the conditional standard error of measurement of the Chang was smoother than binomial procedure, but in both methods the estimated errors are larger for middle points and smaller for extreme points. Additionally, the conditional standard errors of measurement of equipercentile were always less than normalized tranformation, so the equipercentile method found to be better than normalized transformation.
jalil Younesi; mehran mokarami
Abstract
The purpose of this study was to determine the basic skills of reading and comprehension and analyze the test questions, and finally, strengths and weaknesses of subjects through cognitive diagnostic assessment models were fitted with retrofit analysis. The data examined in this study included responses ...
Read More
The purpose of this study was to determine the basic skills of reading and comprehension and analyze the test questions, and finally, strengths and weaknesses of subjects through cognitive diagnostic assessment models were fitted with retrofit analysis. The data examined in this study included responses from 3,000 participants in the graduate entrance examination for English language field, which were selected from among 16044 people randomly. At first, with the help of experts, a matrix Q, consisting of the relationship between 20 questions of reading test with 7 basic skills was formed and then, using the G-DINA model, last analysis was performed. Finally, using a posteriori expected method within the framework of the G-DINA model, probability of domination of every single person on each skill was calculated by CDM package skills in software R, and by determining the cut-off point 0.6 for dominance, investigated participants in each skill, divided into two dominant and non-dominant groups. The results showed that in each of the skills, most people have not reached to dominance, Although the situation in the SUM skill (Analyzing and evaluating the relative importance of the information contained in the text, by differentiating between the main idea and details) was better than other skills, and about half of those got dominant in the skill. The final result suggests that, in general, the participant's situation in the application of cognitive strategies to achieve the correct answer is unsatisfactory and these strategies must be included in the curriculum of reading skills and comprehension.
Mojtaba Jahanifar; Ebrahim Khodaie; jalil Younesi; Amin Musavi
Abstract
In order to the better interpreting and comparing scores in test batteries the raw scores in each test are converted to a common scale that called scale score. There are different Linear and nonlinear methods to convert raw scores to scale scores. Conventional methods of non-linear converting raw scores ...
Read More
In order to the better interpreting and comparing scores in test batteries the raw scores in each test are converted to a common scale that called scale score. There are different Linear and nonlinear methods to convert raw scores to scale scores. Conventional methods of non-linear converting raw scores to scale scores are normalization and Arcsine methods. In This study that aimed to compare the standard error of measurement in non-linear converting methods we used 10000 random simulated sample data and 10000 random real sample data from Iran university entrance exam applicants.in order to compare converting methods conditional standard error of measurement that called CSEM, frequency charts and statistical indexes like moments was used. The results showed that both methods have different features. Although scores in both methods have high reliability and accuracy but Arcsine method reducing score error undulation for different score levels, also the mean of standard error of measurement for Arcsine scale scores was less than normalized scale scores.
faranak boroujerdi; jalil younesi; mohammad salehi; elham assadpour
Abstract
This study aimed at examining the causal relationship of variables with maximum impact on academic achievement of Tehran students. Methodology:The statistical population included all Tehran students. Using convenience sampling, 400 individuals were selected among them as the convenient sample in different ...
Read More
This study aimed at examining the causal relationship of variables with maximum impact on academic achievement of Tehran students. Methodology:The statistical population included all Tehran students. Using convenience sampling, 400 individuals were selected among them as the convenient sample in different fields. The research was carried out through path analysis method and research tools included Rotter's Locus of Control Test, Abedi test of creativity, Delavar's academic self-concept test, and Herman's Achievement Motivation Test. Research results indicated fitness of the model using data. All the variables had direct significant effects on academic achievement. The indirect effects of creativity on academic achievement were also significant. Academic motivation, creativity and academic self-concept and locus of control were of the factors effective in academic achievement. With the increase in Academic motivation, creativity and academic self-concept, academic achievement increases and internal Locus of Control lead to higher levels of academic achievement. Therefore, considering these characteristics are recommended while assessing educational achievement of students
MITRA GHADIMI NOURAN; Jalil Younesi
Abstract
Context: One of ways to prevent mental and behavioral problems, improving psychological capital is composed of four structures hope, optimism, resilience and self-efficacy. Aim: This study aims to construct and validate the scale of psychological capital for public Areas use and its relation to psychological ...
Read More
Context: One of ways to prevent mental and behavioral problems, improving psychological capital is composed of four structures hope, optimism, resilience and self-efficacy. Aim: This study aims to construct and validate the scale of psychological capital for public Areas use and its relation to psychological well-being. Method: The research method is correlation, with the statistical population composed of all married people in Tehran, out of whom 520 individuals were selected using the multistep method. The employed scales include the researcher-developed 66-item scale, Luthans 24-item Scale of Psychological Capital (2007), and Ryff psychological Well-being 18-Item Short Form (1995). For the purpose of data analyses, IRT and first- and second-order confirmatory factor analysis were adopted. In order to carry out IRT, the hypotheses of one-dimensionality and local independence were tested and measured by means of VSS, MAP and exploratory factor analysis. Then, five factors namely hopefulness, self efficacy, positive optimism, negative optimism, and resiliency were extracted by means of the first-order confirmatory factor analysis and model fit. In order to investigate whether these five factors were derived from a larger factor, i.e. psychological capital, or not, the second-order factor analysis was used. In order to measure the reliability, internal consistency was adopted as Cronbach's alpha and theta ordinal. Results: As a result of such analyses, the researcher-developed 66-item questionnaire was changed to the 44-item scale. the scale had highly and acceptably reliability coefficient (Cronbach's alpha = 0.945 and theta = 0.952) and Measuring convergent validity، the foregoing questionnaire is in positive، significant relation to Luthans Scale and Furthermore, a positive, significant relation was found between psychological well-being scale and its corresponding six factors. Conclusion: The results suggest that the researcher-developed scale is adequately reliable and valid for the purpose of measuring this construct in various social areas and fields.
jalil Younesi
Abstract
Objective: This research was carried out with the aim of exploration, description and explication of PhD applicants’ and faculty members’ attitudes on the method of the PhD applicants’ selection (semi-focused and non-focused). Method: To do so, a mixed-methods design was used and the ...
Read More
Objective: This research was carried out with the aim of exploration, description and explication of PhD applicants’ and faculty members’ attitudes on the method of the PhD applicants’ selection (semi-focused and non-focused). Method: To do so, a mixed-methods design was used and the data was collocated in two stages from the academics and also the experts in different fields. In qualitative process about 70 professors and students in different majors were individually interviewed in focus groups; open coding method was used in the process of analyzing the qualitative data, then, the concerns and mental stresses of the beneficiaries in this area were identified. In the quantitative part, 500 academics from the universities in Tehran were asked to answer the researcher-made questionnaires and then the questionnaires were analyzed very carefully. Especially in administrative part, the executive priority was given to the qualitative research and analysis of qualitative findings. Discussion: the discussion was carried out based on the findings of qualitative analysis. In each part, for the purpose of comparing, approving or determining the conflicting cases, the findings of quantitative analysis were presented together with the qualitative findings. Results: the results showed that there was a significant difference between the professors’ and students’ attitudes on the selection of PhD applicants in semi-focused method; namely the professors tended to have more share in the process of selecting PhD applicants, while the applicants were in favor of allocating more share to national organization of educational testing (NOET) in the selection process.
Jalil Younesi
Abstract
هدف پژوهش حاضر ، سنجش روایی و پایایی مقیاس رضایت از کیفیت خدمات سفرهای هوایی در کل کشور بود. جامعه آماری پژوهش، کلیه مسافران خطوط هوایی داخلی در کل کشور بودند. با استفاده ...
Read More
هدف پژوهش حاضر ، سنجش روایی و پایایی مقیاس رضایت از کیفیت خدمات سفرهای هوایی در کل کشور بود. جامعه آماری پژوهش، کلیه مسافران خطوط هوایی داخلی در کل کشور بودند. با استفاده از روش نمونهگیری خوشهای چندمرحلهای از میان فرودگاههای کشور، چهار فرودگاه (مهرآباد تهران، اصفهان، شیراز و مشهد) به شیوۀ هدفمند و 20 زوج شهر بر اساس میزان تعدد پروازها و حجم مسافر جابجا شده انتخاب شد. اجرای مقدماتی و نهایی در بازه زمانی تابستان و اوایل پاییز سال 1389 بوده و تعداد 2194 پرسشنامه معتبر تکمیل شده است. پرسشنامه اولیه (60 سؤالی) بر اساس 5 عامل مدل سروکوال (عوامل فیزیکی، پاسخگو بودن، همدلی، اطمینان خاطر و قابلیت اعتماد) تدوین شد؛ پس از اجرای مقدماتی و با استفاده از نرمافزارهای تحلیل سؤال (Multilog و Statistica) در رویکرد IRT و نتایج تحلیل عاملی تأییدی مرتبه اول و دوم، دادهها تحلیل و ابزار اندازهگیری رضایت از کیفیت خدمات سفرهای هوایی رواسازی شد.
jalil Younesi; farzad Eskandari; Ali Delavar; Mohammad Reza Falsafinezhad; Noor Ali Farokhi
Volume 5, Issue 15 , April 2014, , Pages 166-186
Abstract
Background: Validity of the multilevel analyses with a focus on differences in learning theories (with both classis approach toward measurement and new approach toward measurement (IRT)) by means of various data has recently been studied. Aim: This study is aimed at determining the level of impact of ...
Read More
Background: Validity of the multilevel analyses with a focus on differences in learning theories (with both classis approach toward measurement and new approach toward measurement (IRT)) by means of various data has recently been studied. Aim: This study is aimed at determining the level of impact of schools on the scores gained by students involved in advanced TIMSS’s 2008 test. Methodology: In order to achieve the chief goal of the study, the researchers adopted the data relating to administration of advanced TIMSS’s 2008 physics test, which assesses the course of teaching advanced physics to the high school seniors (pre-university students). The population and sample group of this study comprise the pre-university candidates of mathematics and physics taking advanced TIMSS physics test administered in the academic year 2007-2008. The sample size of Iranian students involved in this measurement equaled 2556 subjects. Findings: Results of the analyses generally suggest that firstly multilevel IRT (MLIRT) analyses are more powerful than multilevel true scores (MLTS) analyses in clarifying school differences. Secondly, the level of difference in the schools involved in advanced TIMSS math test reflected in intraclass correlation (ICC) is high in MLIRT analyses. Discussion and Conclusion: First, considering measurement error of each item in analyses within Bayesian framework and by means of Gibbs sampling can remarkably improve the power of multilevel analyses and lead to a significant rise in the ratio of the explicated variance. Second, there is too much educational difference and discrimination among schools which is largely due to school-level variables (such as those relating to teacher or school-related variables).