Introduction

Overweight and obesity are chronic conditions [1] that affect over 600 million people worldwide [2,3,4,5]—including the elderly population [6,7,8,9].

Besides these alarming rates, several studies showed that the prevalence of weight stigmatization has rapidly increased during the last 20 years [10,11,12,13,14]. Indeed, people with overweight and obesity are regularly exposed to weight-related negative bias and discrimination in everyday life [12, 15] such as workplace, educational, and health care settings, in the media as well as in interpersonal relationships [16]. As a consequence of this continuous exposure to weight stigma, people with overweight and obesity tend to internalize these biases [17,18,19].

The internalization of weight sigma (IWS) leads individuals with overweight and/or obesity to self-stigmatize themselves, thus confirming main negative weight stereotypes [13, 18]—such as failure to keep self-control [20], blame, incompetence [21], and lack of self-discipline—further increasing their risk of being socially devaluated and/or rejected [12, 15, 16]. Moreover, IWS plays a fundamental role in non-adherence to weight loss programs [22]—which in turn contributes to increasing stigma and further promotes weight gain—as is was also observed in patients undergoing bariatric surgery [23]. IWS is also associated with adverse health outcomes including cardiovascular diseases, hypertension, osteoarthritis, and diabetes [24, 25], and it is also strongly related to poor quality of life as well as psychological/psychiatric disorders, such as depression, anxiety, distress, somatization, and eating problems [17, 18, 26,27,28,29].

Despite multidisciplinary interventions have been developed to improve weight loss and to reduce IWS [30,31,32,33,34], the extent to which weight self-stigma affects health outcomes is still unclear [26, 27]. To fill the knowledge gaps in the particular field, the Weight Self-Stigma Questionnaire (WSSQ), was shown to be a reliable and valid tool to assess IWS and to detect the impact of effective interventions targeting weight-related stigma [17].

The WSSQ was first developed and validated in two American samples (n = 169) of participants with overweight/obesity (non-treatment-seeking and treatment-seeking). The resulting two main factors (“self-devaluation” and “fear of enacted stigma”) described by the WSSQ showed good reliability and validity indices as well as an adequate sensitivity to change in subjects seeking treatment for IWS reduction, as well as a good sensitivity to change and temporal stability. Good psychometric proprieties of the WSSQ—such as convergent validity, reliability, and predictive values for psychological variables—were confirmed in a later study [13] Moreover, the scale showed associations with clinical and/or subclinical conditions such as anxiety, depression, Quality of Life (QoL) and self-esteem [17].

The WSSQ was adapted in German [19], Arabic [35], and French [36] language. The German version [19] was validated in a sample of 94 adult outpatients with severe obesity demonstrating good psychometrical proprieties. Hain and colleagues also showed the impact of feelings of guilt (related to body image), BMI (≥ 50), and dissociative symptoms in the prediction of IWS [19]. The Arabic version [35] was validated on a sample of 170 Iranian women with overweight/obesity and showed good validity and internal consistency indices. Moreover, the authors evaluated the association between IWS, QoL, and psychological distress. Results confirmed the negative impact of IWS on both psychological distress and QoL, suggesting that IWS might represent a barrier for subjects with obesity to undertake health-promoting attitudes and behaviors. However, neither for the German study nor for the Arabic validation of the WSSQ any factor analysis (exploratory or confirmatory) was conducted to assess the factorial structure of the questionnaire. Instead, a confirmatory factor analysis (CFA) approach was employed to validate the French version of WSSQ [36]. The authors tested the factorial structure of the questionnaire in a sample of 156 adolescents with overweight/obesity, and findings revealed a good of model fit indices and reliability. Maïano and colleagues also assessed the correlation (convergent validity) between IWS, self‑esteem, anxiety, depression, fear of appearance evaluation, and tendency to eating‑related pathology. Results revealed an association between IWS and all the above psychological constructs. In addition, the authors tested the measurement invariance of the tool across gender using a MIMIC-based (Multiple Indicators Multiple Causes) approach, which revealed an influence of participant gender on WSSQ response. Notably, in line with the literature, none of the above validation studies showed associations between IWS and BMI, suggesting that IWS is complex phenomenon that goes beyond the mere individuals’ BMI [18]. Given that the IWS is a well-known risk-factor among those negatively influencing the individuals’ psychological health [37] in individuals with overweight and obesity, it is important to assess it both in the clinical and research fields.

Since there is no standardized validated version of the WSSQ available for the Italian population, the present study aimed to explore the factorial structure of the Italian version of the WSSQ (Study1), and to assess its structural validity, internal consistency, test–retest reliability, and measurement invariance across main socio-demographics characteristics (Study2).

Study1

Methods

Translation and cross-cultural adaptation

The WSSQ was independently translated by two Italian experts in the field and back-translated into English by an independent translator—to guarantee conceptual equivalency [38]. The questionnaire was also administered to 10 subjects with overweight/obesity—who did not enter the analysis—to assess whether the items were understandable by the target population. No adjustments were needed.

Participants and procedure

According to the most commons rule-of-thumb for Exploratory Factor Analysis (EFA) [39], the sample was composed by 150 adult inpatients [48 males (32%) and 102 females (68%)] with overweight/obesity—aged from 18 to 78 years (mean = 50.34, SD = 14.457). Participants were recruited from the IRCCS Istituto Auxologico Italiano, San Giuseppe Hospital, Verbania (Italy), during their first week of a 1-month hospital-based program for weight reduction and rehabilitation. Inclusion criteria were: (A) being a native Italian speaker; (B) being over 18 years; (C) signing written and informed consent. Exclusion criteria were: (D) illiteracy; (E) inability to complete the assessment due to vision and/or cognitive impairments. All participants completed a self-report demographic and anthropometric measures form and the Italian WSSQ. Sample descriptive statistics are displayed in Table 1.

Table 1 Study1 and Study2: sample descriptive statistics

Measures

Demographic and anthropometric measures

Participants were asked for demographic information (Table 1) including gender, civil status, educational level, work status, and the occurrence of previous hospitalization(s) for obesity and weight reduction, as well as to provide anthropometric measures, such as age, height, and weight (used to calculate their BMI).

Weight Self‑Stigma Questionnaire

Participants completed the Italian version of the Weight Self-Stigma Questionnaire (WSSQ) [17]. It consists of 12 items rated on a five-point Likert scale (from 1 = strongly disagree to 5 = strongly agree) and provides scores in two domains (self-devaluation and fear of enacted domains). The first six items measure self-devaluation (Self-D), which is the continuous tendency to self-depreciating by attributing negative stigmatizing characteristics to his/her own [17]. An example of a representative item is: “I became overweight because I’m a weak person” (item#4) and “I don’t have enough self-control to maintain a healthy weight” (item#6). The following six items measure the fear of enacted stigma (FES), which refers to self-stigmatized thoughts [17, 40]. Representative items are: “People discriminate against me because I’ve had weight problems” (item#8) and “Others will think I lack self-control because of my weight problems” (item#10). Coefficient alphas of the original WSSQ were: Self-D = 0.812; FES = 0.869; total score = 0.878.

Statistical analysis

Statistical analyses were performed with the R statistical software [41] with the following packages: psych [42] and MplusAutomation [43].

Item analysis was performed to check normality. Due to acceptable deviation from normality, Maximum Likelihood estimator (ML) has been used to perform the EFA [39, 44,45,46], with a Direct Oblimin rotation due to an expected correlation between factors. Factors were retained according to the 95th percentile of Parallel Analysis (PA) with 10,000 random correlation matrices—used to compute random data eigenvalues. The size of these random data eigenvalues was compared with the ones from the original observations. The best factor solution was determined by the number of eigenvalues that exceeded the corresponding values from the random dataset [47,48,49]. The factorability of the correlation matrix was evaluated with Bartlett’s Test of Sphericity and with Kaiser–Meyer–Olkin (KMO) measure of sampling adequacy. For the EFA to be considered appropriate, the above tests had to show significant results (p < 0.05) and values higher than 0.7–—respectively [39]. Lastly, Cronbach’s alpha was computed as measure of internal consistency with values higher than 0.7 deemed acceptable [39].

Results

Exploratory factor analysis of WSSQ

Item analysis reveals a univariate almost-normal distribution of all the indicators (Table 2). Absolute skewness ranged from |0.007| for item#11 to |1.033| for item#2: meansk = 0.03; SDsk = 0.52. Absolute kurtosis ranged from |0.025| for item#2 to |1.313| for item#11: meank = − 0.96; SDk = 0.38.

Table 2 Study1: item translation (English/Italian) of the WSSQ, item descriptive statistics and the exploratory factor analysis (EFA)

Sample correlation matrix reveals two eigenvalues higher than 1 (i.e., I = 4.659; II = 1.945; III = 0.956) that were compared with PA’s random data eigenvalues (i.e., I = 1.622; II = 1.443; III = 1.326) allowing the extraction of two factors that explained the 55.04% of the overall variance. The first factor accounted for 38.83% of the variance, while the second one explained 16.21% of the variance. In line with the Kaiser–Meyer–Olkin’s test [KMO = 0.797], the Bartlett’s Test of Sphericity reached statistical significance: [χ2 (66) = 733.4, p < 0.001]—thus supporting the factorability of the correlation matrix.

As depicted in Table 2, the first six items—reflecting the original Self-D factor—loaded significantly on the first factor (p < 0.001): meanloadings = 0.635; SDloadings = 0.161, ranging from 0.389 (item#1) to 0.784 (item#3). However, item#2 and item#4 showed significant cross-loadings (respectively: λ = − 0.241; λ = − 0.156), albeit lower than 0.3. The last six items—reproducing the original FES—loaded significantly on the second factor (p < 0.001): meanloadings = 0.631; SDloadings = 0.07; ranging from 0.556 (item#10) to 0.746 (item#9). Still, four items (item#7, item#9, item#10 and item#11) showed significant cross-loadings (respectively: λ = 0.276; λ = − 0.170; λ = 0.233; λ = 0.213), although lower than 0.3. Furthermore, EFA revealed a significant correlation between the two factors: r = 0.444; p < 0.001. Cronbach’s alpha exhibited an acceptable internal consistency: Self-D: α = 0.815; FES: α = 0.827; total score: α = 0.852.

Study2

Methods

Participants and procedure

According to (A) Monte Carlo study simulation [50] and (B) the ratio of subjects-to-variables rule-of-thumb [45, 46], the sample was composed by 446 adult inpatients with overweight and obesity (BMI ≥ 35) [170 males (38.1%) and 276 females (61.9%)] aged from 18 to 81 years (mean = 51.69, SD = 13.09). Patients were enrolled at the IRCCS Istituto Auxologico Italiano, San Giuseppe Hospital, Verbania (Italy) and tested within 3 days from their admission into the 1-month hospital-based program for weight reduction and rehabilitation. Inclusion/exclusion criteria described in Study1 were applied. Complete descriptive statistics are presented in Table 1. Participants completed: (A) the demographic and anthropometric measures form—described in Study1; (B) the WSSQ; (C) the Weight Bias Internalization Scale (WBIS); (D) the Body Uneasiness Test (BUT); and (E) the Center for Epidemiologic Studies Depression Scale (CES-D). To assess test–retest reliability, a random subsample of 40 subjects was also retested at discharge from the hospital (after ± 28 days)—to ensure no memory effect [51]. During the hospitalization period, none of the patients received psychological interventions focused on the reduction of IWS.

Measures

Weight Bias Internalization Scale (WBIS)

The WBIS [18, 52] is an 11-item—single-factor—self-report scale. Developed for people with overweight/obesity, the WBIS measures the agreement with self-directed negative statements about negative weight-related stereotypes. Higher scores are indicative of a higher weight-related bias own perception. In this study, the WBIS Cronbach’s alpha was 0.840.

Body Uneasiness Test (BUT)

The BUT [53] is a 34-item self-report questionnaire evaluating body uneasiness and concerns for physical appearance. Not designed for a specific target population, the BUT comprises a Global Severity Index (GSI) and 5 subscales: (I) fear regarding the body’s weight (Weight Phobia: WP), (II) over-concern related to physical appearance (Body Image Concerns: BICons), (III) body image-related avoidance behavior (Avoidance: A), (IV) compulsive checking of physical appearance (Compulsive Self-Monitoring: CSM), (V) estrangement feelings toward his/her own body (Depersonalization: Dep). In the present study, the BUT Cronbach’s alphas were: GSI = 0.960; WP = 0.849; BICons = 0.911; A = 0.873; CSM = 0.723; Dep = 0.867.

Center for Epidemiologic Studies Depression Scale (CES-D)

The CES-D [54, 55] is a self-report scale that assesses psychological and behavioral symptoms of depression. Developed for the general population, the CES-D is composed of 20-item loading on a single factor. Higher scores reflect greater depressive symptomatology. In the current study, the CES-D Cronbach’s alpha was 0.903.

Statistical analysis

All statistical analyses were carried out with R statistical software [41] and the following packages: psych [42], MplusAutomation [43], lme4 [56], and irr [57].

Item analysis was performed to check normality. Due to a non-perfect normal distribution, Robust Maximum Likelihood estimator (MLR) was used to confirm the factorial structure of the Italian WSSQ. The MLR is a ‘robust’ variant of Maximum likelihood—which has many advantages over other estimators; i.e., computational efficiency [58]—providing robust standard errors and mean-adjusted χ2 statistics that are robust to non-normality and the violation of independence-of-observation assumption [45, 46, 58,59,60,61]. Model fit was tested using the χ2 statistic and the goodness-of-fit indices [45, 46, 62]: (A) the Root-Mean Square Error of Approximation (RMSEA) [63], (B) the Comparative Fit Index (CFI) [64] and (C) the Standard Root Mean square Residual (SRMR) [65]. A non-significant χ2 was desirable as indicating a better model fit [45, 46, 62]. The RMSEA expresses fit per degrees of freedom of the model, and values lower than 0.08 suggest an acceptable model fit [66, 67]. The CFI designates the amount of variance and covariance accounted by the model compared with a baseline, with values higher than 0.90 considered adequate [66, 67]. The SRMR represents the average discrepancy between the correlations observed in the input matrix and those predicted by the model [45, 65], and values lower than 0.08 are considered good [46, 66].

Multi-group CFAs were performed to test MI across (1) gender (male vs. female), (2) BMI (Class II obesity class vs. Class III obesity), (3) age (younger vs. older: grouped by the median split technique), and (4) previous hospitalization for obesity (occurrence vs. non-occurrence). MI entailed a series of hierarchical (nested) CFAs in which models parameters were consecutively constraint to be equals (invariants) across groups in a logically ordered and increasingly restrictive mode [45, 46, 67,68,69,70]. Nested model comparisons were conducted using (I) the Log-likelihood rescaled difference test (− 2ΔLL), (II) the test differences in absolute goodness-of-fit indices: CFI (ΔCFI > 0.01), RMSEA (ΔRMSEA > 0.015), and SRMR (ΔSRMR > 0.03) [71, 72]. The first step was to test the goodness-of-fit in the two groups, separately. The second step was to test the equivalence of the models (configural invariance). The third step (metric invariance) constrained the factor loadings to be equal across groups. The fourth step (scalar invariance) forced the equivalence of factor loadings and item’s intercepts across groups [45, 46, 67,68,69,70].

Moreover, items must provide information regarding the respondents: in other words, the item should be able to discriminate subjects with or without IWS. Therefore, following recommended guidelines [73], a series of independent sample t-tests were computed, and the Cohen’s d coefficient [74] was calculated to determine item discriminating power (IDP). These statistics refer to the magnitude to which the item discriminates the respondent’s level of the measured trait (low vs. high). A highly discriminating item divides clearly the respondents (d ≥ 0.80), a moderate discriminant item reveals regions of ambiguity (0.21 ≤ d ≤ 0.79) and a poorly discriminating item shows very small magnitude (d ≤ 0.20).

Moreover, since none of the participants received a psychological intervention to reduce IWS, temporal stability was assessed using a mixed-model repeated measure regression analysis accounting for random variability within individuals: a non-significant p value was expected.

Test–retest reliability was estimated using the two-way mixed intraclass correlation coefficient (ICCconsistency) [39, 51, 75,76,77]

Pearson’s correlation was also computed to assess the convergent validity between internalized self-stigma (WSSQ), weight bias (WBIS), depressive symptomatology (CES-D), body uneasiness (BUT), and the individuals’ BMI.

Results

Model’s fit and psychometric proprieties: structural validity

Item analysis revealed a non-perfect univariate normal distribution of some indicators (Table 3). Absolute skewness ranged from |0.004| for item5 to |1.410| for item12: meansk = 0.29; SDsk = 0.61. Absolute kurtosis ranged from |0.017| for item9 to |1.343| for item11: meank = -0.74; SDk = 0.74.

Table 3 Study2: item descriptive statistics, confirmatory factor analysis (CFA), item discriminant power (IDP) and item temporal stability (ITS)

Results from the CFA suggested a non-adequate two-related-factors solution for the Italian WSSQ: the χ2 was statistically significant [SBχ2 (53) = 231.219; p < 0.001] and both the RMSEA and the CFI did not meet the recommended threshold suggesting a non-ideal fit: RMSEA = 0.087; CFI = 0.887, despite an acceptable SRMR (0.066).

Exploration of modification indexes [45, 46, 49, 69] also revealed that the model could be improved by correlating residuals of item#10 and item#11 (Δχ2 = 41.345). More in detail, a deep examination of these items suggested a possible non-independence among these two indicators. After correlating these items’ residuals [45, 46, 49, 69], the model provided a better fit to the data (ritem#10-#11 = 0.396). Despite the Chi-square resulted statistically significant [SBχ2 (52) = 189.874; p < 0.001], the remaining fit indices overcame the threshold for good model fit: RMSEA = 0.077 [90% CI: 0.065–0.089], CFI = 0.913 and SRMR = 0.066.

As reported in Table 3, each item loaded significantly on its associated factor: meanloadings = 0.64; SDloadings = 0.15; ranging from 0.338 (item#1) to 0.856 (item#4). In addition, a moderate correlation between the two factors was found: r = 0.513, p < 0.001. Lastly, item’s explained variance (R2) ranged between 0.114 (item#1) and 0.733 (item#4) with a meanloadings of 0.43 and a SDloadings equal to 0.18.

Measurement invariance

Configural invariance. A configural invariance model was specified: the same two-factor model was estimated simultaneously within each group. A good model fit was found for: (A) gender, (B) BMI status, (C) age, and (D) previous hospitalization for obesity (Table 4). Achieved configural invariance suggests that the factor structure was similar across groups.

Table 4 Study2: Goodness-of-fit indices of the WSSQ model and measurement invariance across gender, BMI-status, age and previous hospitalization for obesity

Metric invariance. Equality of item factor loadings across groups was examined in a metric invariance model: all factor loadings were constrained to be equal across conditions. The metric invariance model well fit the data (Table 4). Compared to the configural model, no significant decrease in fit indices was found for (A) gender [− 2ΔLL (10) = 12.604; p = 0.247; ΔRMSEA = 0.003; ΔCFI = 0.002; ΔSRMR = -0.005], (B) BMI [− 2ΔLL (10) = 13.530; p = 0.196; ΔRMSEA = 0.002; ΔCFI = 0.008; ΔSRMR = -0.006] and (D) previous hospitalization for obesity [− 2ΔLL (10) = 7.587; p = 0.669; ΔRMSEA = 0.004; ΔCFI = -0.001; ΔSRMR = -0.003]. Inversely, a significant loss of fit compared to the configural model was found for (C) age in the Log-likelihood rescaled difference test [− 2ΔLL (10) = 21.411; p = 0.018]. Achieved metric invariance indicated that the items were equivalently related to the latent factor in each group.

Scalar invariance. Equality of item intercepts was examined in a strict invariance model: all factor loadings and all item intercepts were constrained to be equal across groups. The model well fitted the data for each condition except for gender (CFI = 0.898). No decrease in fit indices compared to the metric invariance model was found for (B) BMI [− 2ΔLL (10) = 17.264; p = 0.069; ΔRMSEA = 0.001; ΔCFI = 0.005; ΔSRMR = − 0.002] and (D) previous hospitalization for obesity [− 2ΔLL (10) = 9.432; p = 0.492; ΔRMSEA = 0.002; ΔCFI = 0.001; ΔSRMR = − 0.002]. Conversely, a significant model decrease in fit was observed for (A) gender [− 2ΔLL (10) = 37.873; p < 0.001; ΔRMSEA = − 0.004; ΔCFI = 0.017; ΔSRMR = − 0.002]. As scalar invariance was achieved, it indicated that groups had the same expected item response at the same absolute level of the trait.

Psychometric properties

The analysis of IDP (reported in Table 3) revealed that each of the 12 items well discriminates between subjects without IWS from individuals with IWS—with a high magnitude (d) in all cases. More in detail, the absolute values of the discrimination parameter (t-test: ti) ranged from |8.57| (item#2) to |25.31| (item#7), with an associated effect size that ranged respectively from 1.00 (item#2) to 2.90 (item#7)—meant = -15.71; SDt = 5.21; meand = 1.82; SDd = 0.60. With no exception, all the discriminating values suggest a strong relationship between each of the WSSQ item and the underlying IWS construct.

Temporal stability and test–retest reliability provided good results: Self-D, ICC(T1,T2) = 0.883 (95% CI: from 0.779 to 0.938), p < 0.001; FES: ICC(T1,T2) = 0.852 (95% CI: from 0.721 to 0.922), p < 0.001; Total score: ICC(T1,T2) = 0.886 (95% CI: from 0.785 to 0.940), p < 0.001).

Moderate-to-large bivariate correlations indicated that both WSSQ factors and the WSSQ total score were positively and significantly associated with WBIS, CES-D, and BUT. A significant small correlation was also found between BMI and FES factor, whereas—in line with previous literature—no significant correlation was found between WSSQ and BMI for the Self-D subscale and the total score (Table 5).

Table 5 Study 2: correlations between measures

Discussion

The alarming rates of obesity have brought widespread attention to the medical consequences of this public health problem [12, 15, 16, 32, 78, 79]. Stigmatization and discrimination are still experienced in different conditions [80,81,82], and in individuals with overweight/obesity these biases might lead to serious health consequences [17, 18, 26,27,28,29]. Interventions aimed to specifically address the IWS are urgently required, and more research is needed to support and improve them [23].

Across studies, languages and cultures [17, 19, 35, 36], the WSSQ was shown to be a valid and reliable tool measuring weight self-stigma [17].

The present study is the first aimed to explore and assess the psychometrical proprieties of the Italian version of the WSSQ in a sample of adult inpatients with overweight/obese. In addition, it is unique in examining the WSSQ measurement invariance across several conditions.

In line with previous researches [83, 84], the first study was aimed to explore the structure of the WSSQ. In line with its original validation study [17], the exploration of the factorial structure of the WSSQ (Study1) indicated the presence of two factors (“Self-devaluation” and “Fear of enhanced stigma”) accounting for 55.04% of variance Moreover, items’ significant cross-loadings were lower than the threshold of 0.3—confirming this factorial solution.

In Study2, the two-factor structure of the WSSQ was further confirmed: results of the CFA revealed that—after correlating residuals for item#10 and item#11—the questionnaire shows a good fit to the data. Moreover, the two-factor solution of the WSSQ—Italian version was in line with previous studies validating the tool in other languages (English [17], German [19], Arabic [35], and French [36]). Each subscale of the Italian WSSQ showed reliability indices over the cutoff value for adequate internal consistency [39]—both in Study1 and Study2—in particular, the Self-D’s alpha was higher than those reported for the German and the Arabic versions of the WSSQ (0.74 and 0.75, respectively), but lower than in its original validation study (0.87). The FES’s alpha showed an almost perfect overlap with the German version (0.83) and it was higher than those of the original and the Arabic versions of the tool (0.81 and 0.78, respectively). The Italian WSSQ also demonstrates good test–retest reliability, with slightly different values from the work of Lillis and colleagues [17].

Moreover, in line with the existing literature, positive and statistically significant relationships were found between both factors of the WSSQ and the convergent measure of weight-related stigma (WBIS), the BUT-body image concerns subscales, and depressive symptomatology (CES-D). Although a non-significant association between Self-D and BMI was detected in the present as in previous research (i.e., German, Arabic, and French versions), it was not in its the WSSQ original validation study [17]. In addition, the correlation between FES and BMI was consistent with the study of Lillis and colleagues [17], but in contrast with other validation studies [19, 35, 36]. These results suggest a potential cross-cultural influence in the relationship between BMI and self-stigma interpretation.

To our knowledge, this is the first study examining the WSSQ measurement invariance across gender, BMI, age, and the occurrence of previous hospitalization(s) for obesity. Age was shown to affect item responses; indeed, only configural invariance was achieved. These results suggest that the factor structure of the WSSQ is the same for both age group, but also that they interpret the items differently and at a different absolute level of the trait. In line with the results of Maïano and colleagues [36], measurement invariance suggested a possible effect of gender on the WSSQ responses: only metric invariance was held, indicating that the same latent factors were measured in each group, but there was a non-equivalence of the trait’s level measured (items intercept).The WSSQ also demonstrated scalar invariance across BMI and the occurrence of previous hospitalization(s) for obesity, indicating the equivalence of the factor structure model, item factor loadings, and item intercepts across these groups. These results showed that—within these two variables—the observed differences in item means (between groups) are due to factor mean differences only.

Among, the limitations of the study we consider the self-report nature of selected measures—whose answers might have been influenced by social desirability and items’ comprehensibility. Moreover, a convenience sample of adult inpatients with overweight and obesity was recruited for this study—and future research should be conducted on outpatients or children/adolescents to confirm the reliability of this findings. Despite a WSSQ post-treatment measure, the small sample size (n = 40) prevented from assessing longitudinal MI—the collection of more data at the end of hospitalization could solve this issue.

Despite these limits, the present study is the first consistently increasing knowledge in the field since—compared to previous WSSQ validation studies—the invariance of the questionnaire was tested across several conditions, and the Italian WSSQ can, therefore, be considered a valid and reliable instrument for assessing weight self-stigma in adults with overweight and obesity.

Regarding the implications for the clinical and research field, the WSSQ represents a useful tool that—given its brief and agile administration—it is suitable both for hospital-based [85,86,87] and at-distance technology-based interventions [88,89,90] aimed at monitoring and promoting the individuals psychological health over time [91,92,93].

Conclusion

The Italian WSSQ is a solid tool for the measurement of IWS and can extend the research concerning stigma in obese people. This is of great importance for improving treatment outcomes in clinical practice, as individuals with high IWS are at greater risk for psychosocial impairments [17, 19, 35, 36, 94] and failure to weight loss/maintenance [23, 95]. The WSSQ can be used to properly identifying those subjects who would benefit from specialized interventions aimed at reducing IWG, also allowing the evaluation of the impact of such treatments.

What is already known on this subject?

Weight stigma is a widespread complex problem among individuals with overweight and obesity.

The Weight Self-Stigma Questionnaire (WSSQ), is a solid, valid and reliable tool for the measurement of weight-related bias in overweight and obese people.

The WSSQ was successfully translated and/or validated in several languages such as English, German Arabic, and French.

What does this study add?

The present study aimed to validate the Italian version of the WSSQ in a sample of inpatients with overweight and obesity.

This study for the first time aimed to test—psychometric properties of the WSSQ such as measurement invariance across gender, body mass index (BMI), age, and occurrence of previous hospitalization(s) for obesity.

The factorial structure of the WSSQ was successfully replicated across two independent studies (Study1 and Study2).