Shoeayb Qasemi; ali delavar; noorali farrokhi; Farzad Eskandari
Abstract
This study was conducted with the aim of examining the gender differential item functioning of the English language items of the master psychology series exam using the methods available in the context of cognitive diagnostic models. The research sample included 2,455 female applicants and 919 male applicants ...
Read More
This study was conducted with the aim of examining the gender differential item functioning of the English language items of the master psychology series exam using the methods available in the context of cognitive diagnostic models. The research sample included 2,455 female applicants and 919 male applicants who attended the Master of Psychology series exam in 1396. The G-DINA cognitive diagnostic model was fitted with data. DIF of the questions were examined with three methods: the Wald test, the likelihood ratio test and the revised likelihood ratio test. The results of these three methods had a moderate agreement on DIF detection. Based on the results 16 questions had DIF. Most of the questions had uniform DIF in favor of men. Questions that had non-uniform DIF, for the dominant applicants were in favor of men and for non-dominated applicants were in favor of women. It is recommended that additional studies be conducted to investigate the existence of the bias and the cause of the DIF.
Somayeh Bahmanabadi; Mohammad reza Falsafinejad; noorali farrokhi; asghar minaei
Abstract
Identification of the consequences of violations of the assumptions of measurement models is a major concern in the field of psychometrics. The purpose of the present study was to investigating the role of test dimensionality violation in equating errors of IRT and classical theory models. Research method ...
Read More
Identification of the consequences of violations of the assumptions of measurement models is a major concern in the field of psychometrics. The purpose of the present study was to investigating the role of test dimensionality violation in equating errors of IRT and classical theory models. Research method was experimental and a 3 × 3 factorial design was used. The study population consisted of all the Mathematical and Technical Sciences Entrance Examiners in 2017 and 2018. The sample group consisted of 5000 examiners who were selected by random sampling. Mathematic test (55 items) was used for equating that three different types of data structure including one-dimensional, two- dimensional and three-dimensional data was generated from it. Data was equated with three equating methods including equipercentile method, true score and observed score equating method. Equating standard errors, bias and root mean square error were used to evaluate the effects of independent variables. To determine the error statistics, the equating results in all three datasets and the three equating methods were repeated 20 times in different samples. Data analysis showed that violation of dimensionality increases equating standard error, bias of equating results and the root mean square error. The bias of equating results in classical methods is more than the IRT methods. The effect of dimensionality violation on equating errors was not different between the classical theory and IRT models.
Hassan Moshtaghian Abarghouei; Mohammad Reza Flasafi Nejad; Ali Delavar; Noor Ali Farrokhi
Abstract
Identifying distractors as sources of Differential Item Functioning(DIF) in polyotomous items has great importance to designers and analysts. Although DIF is one of the common methods for examining the measurement invariance, It is accompanied by challenges and limitations, especially in multiple choice ...
Read More
Identifying distractors as sources of Differential Item Functioning(DIF) in polyotomous items has great importance to designers and analysts. Although DIF is one of the common methods for examining the measurement invariance, It is accompanied by challenges and limitations, especially in multiple choice items. The purpose of this study was to assess the performance of Nested logit Model(NLM) for detecting Differential Distractor Functioning(DDF) by using experimental (simulated data) and descriptive-analytical (real data) methods. Six items were simulated under different conditions of difficulty and slope, ability distribution, presence or absence of DIF/DDF, and DIF/DDF magnitude, with a sample size of 2000 and 50 replicates. The data of Math Entrance Exam (D-form,2018), with a random sample of 2000 men and women constituted the real data. Based on the results of the simulation analysis: The NLM revealed 88% of DIF and 97% of DDF, on average. the Type I error rates is very close to the theoretical expected values, although it showed some inflation in unequal distribution conditions. according to the findings, the detection rate was influenced by the item parameters(difficulty and slope) and the DIF or DDF levels. Based on real data analysis, 2 items represented both DIF(Large and Medium) and DDF (Partial to Moderate) simultaneously, whereas in the NRM approach, 11 items detected as DIF/DDF; so, as expected the approaches based on “divided by distractor” strategy, fewer items were detected as DIF/DDF. The NLM approach, while separating the DDF from the DIF test, allows for a clear evaluation of whether the distractor may be responsible for DIF. Since high-stakes tests have a special role in selection and DIF and DDF analyzes have a special place in determining the validity and measurement invariance of these exam items, it is recommended to screen the bias items, DIF/DDF comprehensive analyzes based on NLM be used.
ALI TAYARANI RAD; mohamad reza falsafinejad; noorali farrokhi; ahmad borjali
Abstract
The main purpose of this study was to identify the optimum approach to analysis of epistemic data in measuring personality traits. For this purpose, MPLUS software and lavan software package simulate different formats of response scales, and two traditional scoring approaches and Thurston's IRT method ...
Read More
The main purpose of this study was to identify the optimum approach to analysis of epistemic data in measuring personality traits. For this purpose, MPLUS software and lavan software package simulate different formats of response scales, and two traditional scoring approaches and Thurston's IRT method were compared in some of the most important psychometric outcomes. Overall the findings of the study indicated that the Thurstonin IRT approach performs better than the traditional scoring approach in accurately estimating actual scores, criterion validity, locating a person on the attribute continuum. Also, the research findings showed that the average correlation between estimated scores and actual scores for all three correlation coefficients (0, 0.25 and 0.50) in Thurstonian Item Response Theory is higher than the scores obtained from the classical method; True for all three correlation coefficients, in Thurstonian Item Response Theory more than classical method scores and mean experimental reliability scores for correlation coefficients 0.25 and 0.5 with classical method are higher than scores obtained by Thurstonian IRT method, but for coefficient The mean correlation of the mean scores of the Turestonian IRT method is higher than the classical scores. The findings also showed that the difference between the two tests, both Classical and Trestonium IRT, was significant for both the 48 and 96 questions, and the correlation between estimated scores and actual scores in the Thursetonium IRT method was more than the classical method.
Faramarz Sohrabi; peyman mamsharifi; Noorali Farrokhi
Abstract
Improving participation in voluntary activities in Iran is important because increasing productivity among young people, given the country's young population, contributes to community sustainability. Therefore, a better understanding of volunteer motivation in Iranian youth is needed. The Volunteer Functions ...
Read More
Improving participation in voluntary activities in Iran is important because increasing productivity among young people, given the country's young population, contributes to community sustainability. Therefore, a better understanding of volunteer motivation in Iranian youth is needed. The Volunteer Functions Inventory (VFI) for assessing volunteer motivations has good psychometric properties and is adapted to several languages, but no validated Iranian translation yet exists. The purpose of this study was to investigate the psychometric Characteristics of voluntary functions inventory in members of Iranian Red Crescent Society. Sample size was 595 members of Youth Organization of Iranian Red Crescent Society from 31 provinces and 175 cities of Iran that were selected by multi-stage cluster sampling method and responded to voluntary functions inventory. Data were collected using a demographic sample and voluntary functions inventory (VFI). Confirmatory factor analysis using principal components method was used for data analysis. The results of the present study showed that the voluntary functions inventory had validity and reliability. Also, the factor structure showed that 29 items and 6 factors well assess people's attitudes to volunteering, and the structure of this inventory was well-fitted and confirmed all goodness of fit models. The present study provides the use of the Iranian translation of the Voluntary Functions Inventory (6 scales and 29 items) to assess volunteer motivation among young Iranian volunteers.
alireza khoshgooyanfard; Mohammadreza Falsafinejad; noorali farrokhi
Abstract
Nonresponse is an inevitable challenge to large-scale studies and can result in wasting money, time and human resource involved in data collection and can also prevent the studies from obtaining their objects especially scores distribution. Imputation methods have thus been invented to estimate item ...
Read More
Nonresponse is an inevitable challenge to large-scale studies and can result in wasting money, time and human resource involved in data collection and can also prevent the studies from obtaining their objects especially scores distribution. Imputation methods have thus been invented to estimate item nonresponses in order to make inference from a completed data set. Using a simulation study on a real data set in the form of a multivariate experimental design, this paper evaluates the accuracy of three models including cumulative logit model, graded response model and explanatory item response model. The results show that the imputed values of all three models are acceptable under random nonresponse mechanism although the imputed values of the explanatory item response model are always more accurate than those of the other models. If nonrandom nonresponses are occurred, explanatory item response model has acceptable imputed values only at 5% nonresponse rate and the other models are not accurate at all. The results also show that it is more accurate to impute individual item nonresponses and then compute the total score instead of directly imputing the total score.
Tayebe Dehghan nayeri; Ali Delavar; Noorali Farrokhi; Ahmad Borjali
Abstract
The major aim of this research was to investigate the precision of the up parametric statistics to detect common response styles in polytomouse data. In the present study, five-choices data was simulated with 25 replication according to a fully-crossed design based on Partial Cradit Rasch model. The ...
Read More
The major aim of this research was to investigate the precision of the up parametric statistics to detect common response styles in polytomouse data. In the present study, five-choices data was simulated with 25 replication according to a fully-crossed design based on Partial Cradit Rasch model. The design included foure types of response styles ;Extreme responding positive, negative, acquiescence and midpoint responding, in four -different situations ; Sample size, test length, percentages of aberrant-responding examinees and percentages of aberrant items was investigated. precision of the Up statistic was high to detect the response styles in all situations respectively; ;Extreme responding negative, positive, acquiescence and it was possible to distinguish between the compatible and aberrant response patterns with the highest precision in these response styles,except to detect the midpoint responding was less precision, in some situations. The Up parametric statistics has high sensitivity to detect common response styles in non-cognitive multiple choice data and it is suggested that be used this statistics to detect of the response styles of extreme responding negative, positive, acquiescence .
Noor Ali Farokhi
Abstract
Back ground: Nowadays use of teams and work groups to perform different tasks is increasing Social loafing point to the important thing that people may show less conscious effort in group than when working individually . Aim: The present study aimed to Construction and Standardization of adolescents ...
Read More
Back ground: Nowadays use of teams and work groups to perform different tasks is increasing Social loafing point to the important thing that people may show less conscious effort in group than when working individually . Aim: The present study aimed to Construction and Standardization of adolescents social loafing questionnaire Method: This research is a descriptive study in which 27 high school students of Kermanshah city (first and second secondary) participated A 34-item questionnaire, developed based on research literature and was completed by students for data collection. To determine the number of factor s of questionnaire, exploratory and confirmatory factor analysis by LISREL 8.7 and SPSS 16.0 software were used. Also awareness function of questions calculated by using software MULTI-LOG. Results: The research showed that the Cronbach's alpha coefficient of questionnaire that indicates the inner reliability and parallelism is high.( α =0.923) and validity through exploratory factor analysis by varimax's method produced four factors that are: Job characteristics, individual perception, group characteristics, inferiority and negative perceptions. Also by using of first and second order confirmatory factor analysis these four factors examined and results showed that validity of questionnaire is acceptable. Discussion and conclusions: Research findings indicate that adolescents social loafing questionnaire have acceptable psychometric characteristics. and it can be use as valid and reliable tool for research in Sociology and psychologically
meysam sadeghi; mohammadreza falsafinejad; ali delavar; noorali farrokhi; ehsan jamali
Abstract
The purpose of the present study was to identify the weight of each of the courses of academic record and scoring based on the psychometric approach of the IRT continuous model and approaches based on the viewpoint of the specialists of Topsis and AHP. The method of this study was a combination. In order ...
Read More
The purpose of the present study was to identify the weight of each of the courses of academic record and scoring based on the psychometric approach of the IRT continuous model and approaches based on the viewpoint of the specialists of Topsis and AHP. The method of this study was a combination. In order to conduct research in the first part, 11 items (courses) of academic background were taken from the expert group and weighed and prioritized using Tapis and AHP techniques. The final third year grades of secondary school students were received from the assessment organization and accepted for psychology and counseling in the humanities group. Theta tests were used to estimate the grades. In order to compare different weighting and grading models, the sample group was first ranked based on scores derived from all three approaches, and compared in terms of dispersion indices and the degree of difference between rankings. The results of the comparison of the three approaches indicated that the most variance is related to the IRT continuous modeling method. The results of Wilcoxon's statistical analysis to compare the average rankings showed that in the small sample size, the weighting method in all three methods produced a significant difference in the rankings compared to the same weight method, and therefore the weighting model in the ranking of the volunteers In small sample size affects. But there are no significant differences between the three weighting models in the candidate ranking. Also, the results showed that in the high sample size (2000 persons), the weighting method in all three methods compared to the same weights makes a significant difference in the rankings. IRT method scores are more reliable than expert-based approaches.However, the weighting method does not seem to have a significant effect on the reliability of the scores.
sara yadollahi; mohammadreza falsafinezhad; Ahmad Borjali; noorali farokhi
Abstract
Background: The ability to understand and being able to profoundly and deeply understand the within-person differences entail profile analysis which is considered to be a serious challenge in psychometrics area. Aim: This study was aimed at applying Profile Analysis via Multidimensional Scaling (PAMS) ...
Read More
Background: The ability to understand and being able to profoundly and deeply understand the within-person differences entail profile analysis which is considered to be a serious challenge in psychometrics area. Aim: This study was aimed at applying Profile Analysis via Multidimensional Scaling (PAMS) procedure in order to extract prototypical profiles underlying the Wechsler Intelligence Scale for Children- Fourth Edition (WISC-IV). Method: The design of the study is correlational and the population of the study consisted of all of the students studying at 6th distinct schools of Tehran. Using random cluster sampling method 200 students were selected. The sample were administered full subscales of WISC-IV. Results: Data analysis using the PAMS procedure resulted in extracting 3 cumulative profiles. Statistically meaningfulness of the subtests within profiles were determined and they were labeled according to their dimensionality and underlying meaning (processing speed vs. working memory, lexical knowledge vs. visual alertness and perceptual reasoning vs. verbal comprehension). Conclusions: using the PAMS procedure in interpreting WISC-IV results could be beneficiary. Relying on individual parameters further clinical and applied implications are discussed.
somayeh pourehsan; Mohammadreza Falsafinejad; Ali Delavar; Noorali Farokhi; Ahmad Borjali
Abstract
The main goal of this research was to apply the multi-level in determining the role of metacognitive components associated with drug abuse among adolescents of Kerman Province. The method used in this research was correlation type. In order to conduct this research 1000 school students of Kerman Province ...
Read More
The main goal of this research was to apply the multi-level in determining the role of metacognitive components associated with drug abuse among adolescents of Kerman Province. The method used in this research was correlation type. In order to conduct this research 1000 school students of Kerman Province have been selected by using multistage cluster sampling method. For data collection in this research, ZARGAR drug addiction preparation Questionnaire and Wels’s Metacognitive Questionnaire (MCQ30) have been used. The results of multi-level approach using One-way random effects Anova showed that the four components (positive beliefs about worries, negative beliefs about worries, low cognitive trust and cognitive self-awareness) explained about 49% of the variance of addiction tendency at school level. Moreover, the random intercept model explained 12% of the student-level variance of drug abuse tendency. Here, the significance of qi-square with the value of 37.983 at the significant level (P <0.0001) reflects the significant change of the null model and model 1. In response to the question that how much the relationship between the tendency to drug abuse and student-level components is similar among schools, a comprehensive model with both student-level and school-level variables (model 2) was used. In model 2, the results showed that at the school level, the mean of positive beliefs and cognitive self-awareness about drug abuse tendency have a significant relationship, but the mean of cognitive trust and negative beliefs has not a significant relationship with the tendency to drug abuse. At the students level, there is a significant relationship between cognitive trust and negative beliefs with drug abuse tendency (P <0.01). In total, it can be said that for variables related to samples with a nesting nature, it is necessary to use all levels in data analysis.
nushravan mohamadi; ali dalavar; noorali farrokhi; asghar minaei
Abstract
The aim of the research was identification of Wechsler intelligence scale for children (WISC-IV) basic attributes based on Cattle- Horn- Carrol (CHC) narrow abilities by using generalized DINA (G-DINA) cognitive diagnostic model. The population of the research was all of Iranian primary school students ...
Read More
The aim of the research was identification of Wechsler intelligence scale for children (WISC-IV) basic attributes based on Cattle- Horn- Carrol (CHC) narrow abilities by using generalized DINA (G-DINA) cognitive diagnostic model. The population of the research was all of Iranian primary school students which randomly 1222 students selected. We use Wechsler intelligence scale for children (WISC-IV) for gathering data. Results show that from the nearly 70 CHC narrow abilities, 9 narrow ability (visualization, induction, working memory, general verbal information, lexical knowledge, flexibility of closure, memory span, language development, general sequential reasoning) Identified as basic attribute of WISC-IV. Also from the constrained models, the LLM model based on Wald statistics was chosen. Additionally, most of student mastered in lexical knowledge and the lowest rate of mastering belongs to flexibility of closure. finally, the probability of guessing parameter in easy items was high and probability of slipping in difficult items was high.
RogheaAsadi Roghea; Ali Dlavar
Abstract
The purpose of this study was to investigate the relationships between personality traits and subjective well-being in predicting spiritual intelligence in order to develop structural model. For this aim 220 students of the Amir Kabir University at the under graduate, master’s and doctoral degrees ...
Read More
The purpose of this study was to investigate the relationships between personality traits and subjective well-being in predicting spiritual intelligence in order to develop structural model. For this aim 220 students of the Amir Kabir University at the under graduate, master’s and doctoral degrees using systematic random sampling were selected. They completed Spiritual Intelligence Scale of King (2007), Keyes comprehensive scale of well-being (1998) and Big Five Factor personality Inventory (NEO-FFI). In order to assess the relationship between latent and measured variables in the conceptual model, the structural equation modeling was used. The results showed that high scores in the spiritual intelligence predicted through low scores in neuroticism and high scores in extraversion and conscientiousness. In this model, well-being was mediator, and neuroticism, extraversion and conscientiousness with the mediating mental wellbeing could predict the variance of spiritual intelligence, regression weights of direct effects were not significant but the weights of indirect and total effects were significant, and the predictor variables explained 25% variances of spiritual intelligence. The results of the present study emphasize the role of personality traits on well-being and spiritual intelligence.
Noor-Ali Farroukhi; laila bahrami
Abstract
Background: Recognizing multiple sources of measurement error and estimates each source separately, distinguishes between relative and absolute decisions, distinguishes between fixed and random facets and also the capability of dealing with different D study designs can be mentioned as the strength points ...
Read More
Background: Recognizing multiple sources of measurement error and estimates each source separately, distinguishes between relative and absolute decisions, distinguishes between fixed and random facets and also the capability of dealing with different D study designs can be mentioned as the strength points of generalizability theory which have no corresponding statuses in classical test theory. Generalizability theory is unknown for our researchers and there are rare researches in this area. Objective: The Purpose of this article was introduction of generalizability theory and presentation the practical applicability of this theory in assessing the reliability of measurements. Results: In addition to comparison between classical test theory and generalizability theory, conceptual framework of generalizability theory was explained easily. Also, in this article the process of design, analysis and interpretation of a measurement study in shape of an example with relevant calculations and equations explained in detail in 15 steps to guide researchers and test developers who aimed to assessing reliability. Conclusion: This article shows that utility of generalizability theory in reliability estimation especially in complicated measurement situations is more than classical test theory. Generalizability theory enables researchers to decrease errors in plan of measurement through optimization proceedings which will increase accuracy in generalization of results.
Mohammadreza Falsafinejad; Noorali Farroukhi; laila bahrami
Abstract
Background: High school final exams are one of the most decisive tools for scientific assessment of students. Given the importance of this examinations, carried out systematic research on the quality and functionality of their questions is necessary to separate volunteers. Aim: The aim of this study ...
Read More
Background: High school final exams are one of the most decisive tools for scientific assessment of students. Given the importance of this examinations, carried out systematic research on the quality and functionality of their questions is necessary to separate volunteers. Aim: The aim of this study is to determine the psychometric properties and capabilities of the final exam questions in the field of experimental biology and Persian literature in the selection of candidates for admission to undergraduate courses. Methodology: The population, all of the questions of the third year high school final exams string in June 2011. For determining the psychometric properties of these questions, performance of 600 students, selected randomly, in aforementioned lessons in the school districts of the city of Khorramabad were used. Findings: Estimated reliability coefficient in biology and Persian literature was determined by Cronbach alpha, (0.97, 0.96). According to CTT, the average coefficient of difficulty and discrimination in Persian literature and biology test were (0.65, 0.57) and (0.50, 0.65) respectively. In analyzing the IRT; two-parameter model fitted to the data revealed more. Also, in IRT models, the average difficulty and discrimination of questions for Persian literature and biology courses were (-0.69, 1.03) and (-0.09, 0.96) respectively. The most informant rate for two tests of Persian literature and biology, belongs to levels of ability which are (-0.7, 0.1) respectively, the agreement between the CTT and IRT in terms of discrimination parameter the two courses, were (%98.36, %93.59) respectively. Conclusion: Due to the important decisions based on high stakes tests, implications of final examinations in the selection of candidates were discussed.
Zahra Karami Baghtifooni; Noor Ali Farokhi
Volume 5, Issue 16 , July 2014, , Pages 83-116
Abstract
AbstractBackground: Creativity is an important construct in the psychology of individual differences, and Compared to the similar areas such as intelligence, it has received little attentions of researchers. Seventy-year history of theory making and research on creativity, still there is not a consistent ...
Read More
AbstractBackground: Creativity is an important construct in the psychology of individual differences, and Compared to the similar areas such as intelligence, it has received little attentions of researchers. Seventy-year history of theory making and research on creativity, still there is not a consistent pattern for understanding creativity. Objectives: The aim of Current study was to examine the relationship between fluid and crystallized intelligence with creativity and Explanation of this relationship based on higher order latent factors such as information processing speed and openness to experiences was taken place using structural equation modeling. Methods: All Tehran Humanistic universities were selected as population and 481 students from Shahid beheshti, Allameh Tabatabayi and Tehran universities were chosen as sample study. The sub-scales of Wechsler Adult Intelligence Scale- Revised (WAIS-R) were used to measure intelligence, Divergent thinking tests and questionnaires on creativity in everyday life, creative achievement and creative self- concept were also used to measure creativity and in order to measure processing speed, ZVT, and KDT paper-pencil tests were used and finally to measure openness to experience, NEO-FFI inventory was used. Structural equation models were postulated to investigate inter-relationship of constructs based on literature. Covariance matrix of observed variables was used to investigate postulated models, then, using AMSO-18 All statistical computations were analyzed. Results: Results showed that openness to experience, in all three cases (the first case, when all markers of self-assessment of creativity and divergent thinking are used simultaneously; the second case, when divergent thinking is used as the dependent variable and the third case, when self-assessment creativity is used as the dependent variable) will lead to successful and powerful prediction of creativity, Low impact processing speed has little but significant impact on creativity. Also, the fluid and crystallized intelligence had different effects on this structure depending on the type of creativity measurement. Conclusion: In general, the research results indicate that the personality has a more important role in being creative and producing innovative products; and intelligence and the information processing speed are in the next stage of having positive impact on the type of creative works.
jalil Younesi; farzad Eskandari; Ali Delavar; Mohammad Reza Falsafinezhad; Noor Ali Farokhi
Volume 5, Issue 15 , April 2014, , Pages 166-186
Abstract
Background: Validity of the multilevel analyses with a focus on differences in learning theories (with both classis approach toward measurement and new approach toward measurement (IRT)) by means of various data has recently been studied. Aim: This study is aimed at determining the level of impact of ...
Read More
Background: Validity of the multilevel analyses with a focus on differences in learning theories (with both classis approach toward measurement and new approach toward measurement (IRT)) by means of various data has recently been studied. Aim: This study is aimed at determining the level of impact of schools on the scores gained by students involved in advanced TIMSS’s 2008 test. Methodology: In order to achieve the chief goal of the study, the researchers adopted the data relating to administration of advanced TIMSS’s 2008 physics test, which assesses the course of teaching advanced physics to the high school seniors (pre-university students). The population and sample group of this study comprise the pre-university candidates of mathematics and physics taking advanced TIMSS physics test administered in the academic year 2007-2008. The sample size of Iranian students involved in this measurement equaled 2556 subjects. Findings: Results of the analyses generally suggest that firstly multilevel IRT (MLIRT) analyses are more powerful than multilevel true scores (MLTS) analyses in clarifying school differences. Secondly, the level of difference in the schools involved in advanced TIMSS math test reflected in intraclass correlation (ICC) is high in MLIRT analyses. Discussion and Conclusion: First, considering measurement error of each item in analyses within Bayesian framework and by means of Gibbs sampling can remarkably improve the power of multilevel analyses and lead to a significant rise in the ratio of the explicated variance. Second, there is too much educational difference and discrimination among schools which is largely due to school-level variables (such as those relating to teacher or school-related variables).