Introduction

Psychotic spectrum disorders (PSDs), such as schizophrenia and psychotic mood disorders, affect around 3.5% of the global population [1] and are considered to be among the top 25 contributors to disability worldwide [2]. They are also among the mental illnesses associated with the highest economic costs for health care services, partially due to repeated hospitalisations [3, 4]. Internationally, as much as two-thirds of the current psychiatric inpatient population are experiencing psychosis [5], also being the group most frequently subject to involuntary admissions [6].

During acute crises, patients with PSDs can pose high risks to themselves and others, requiring treatment in acute psychiatric inpatient wards (also known as secure, locked or acute wards) [7]. In contrast to open wards, where inpatients are treated after their most severe symptoms have subsided, acute psychiatric inpatient wards often focus primarily on psychopharmacological treatment rather than psychological interventions, resulting in on-going patient dissatisfaction [8]. The lack of psychotherapeutic activity moreover contrasts with treatment guidelines, which recommend psychological interventions for psychosis already in the acute treatment stage [9, 10] to improve patients' functioning and support recovery [11, 12]. Interestingly, recent systematic reviews and meta-analyses report heterogeneous findings for guideline-recommended cognitive behavioural therapy for psychosis (CBTp) in acute psychiatric inpatient settings [13,14,15]. However, promising evidence supports the efficacy of third-wave therapies like Acceptance and Commitment Therapy (ACT) and CBT approaches integrating third-wave components, such as Metacognitive Training (MCT) [13,14,15].

Disorder-specific CBTp protocols aim to change the appearance and nature of psychotic symptoms [16]. In contrast, third-wave therapies focus on how individuals process and manage experiences while encouraging a mindful and accepting attitude towards them [17]. They also often directly focus on targeting transdiagnostic change mechanisms that are thought to positively impact treatment outcomes [18]. In this context, change mechanisms rely on psychological processes found to be responsible for the onset and maintenance of disorders [17]. In the case of psychosis, third-wave approaches have a particular interest in various aspects of impaired metacognitive processes and associated metacognitive change mechanisms [19, 20]. More precisely, approaches try to enhance patients’ critical awareness of own thoughts (“thinking about thinking”) [21] in order to change immediate thought-related reactions [19]. MCT, for example, aims to promote patients’ cognitive insight via raising metacognitive awareness and knowledge for cognitive biases [22] and has demonstrated significant effectiveness in reducing positive symptoms [16, 23, 24]. ACT on the contrary, although not categorized specifically as a metacognitive therapy, also incorporates several metacognitive elements. Key ACT concepts such as mindfulness, acceptance, cognitive defusion (ACT term for cognitive distancing), and value commitment [25], are associated with metacognitive awareness and functional metacognitive goals and strategies [26,27,28]. With regard to acute inpatients with PSDs, ACT-based interventions have been shown to reduce general psychopathology and rehospitalisation rates [11, 29, 30].

While altering cognitive responses to experiences instead of directly challenging them seems to be especially helpful in treating acute psychotic symptoms [31], existing evidence has to be approached with caution [13,14,15]. Apart from the current small evidence base and methodological shortcomings, most of the metacognitive interventions for psychosis that have been studied were originally developed for outpatients [32,33,34,35] or for inpatients with mild to moderate symptoms [22] and were not tailored to fit the unique characteristics of acute psychiatric settings and inpatients [13, 36]. These include restrictive environments, high economic pressure, brief admissions, and acutely unwell patients likely to pose high risks, have multiple disorders, cognitive difficulties and low motivation for treatment [7]. Given the urgent need to improve acute inpatient care, yet a remaining substantial research gap, studies are needed to investigate the feasibility and effectiveness of adapted interventions [37].

Therefore, the present research aimed to examine the feasibility and acceptability of a novel modularized and mechanism-based treatment, while evaluating preliminary clinical outcomes and alterations in potential change mechanisms. More precisely, the current study extended our previous work [36] on designing an adapted metacognitive treatment using Intervention Mapping [38] as suggested by best practice guidelines on complex intervention development [39]. Specifically, the novelty of the intervention (see Supplementary Material and our previous work for details) [36] is that it (1) focuses directly on underlying transdiagnostic metacognitive change mechanisms (cognitive insight and cognitive defusion) rather than on specific symptom content, thus following a current paradigm shift towards mechanism-based psychotherapeutic treatments [18, 40, 41], (2) combines and integrates different existing evidence-based therapeutic approaches in a hybrid and modularized approach allowing for tailored treatments and greater flexibility [42, 43], (3) is delivered in a group format to take advantage of social support and optimal resource use [44, 45], and (4) adapts all therapeutic elements to be brief, flexible and low-key to meet the needs of acute inpatients with PSDs [37].

We hypothesised that (1) feasibility and acceptability measures would exceed the 80% benchmark necessary to proceed to a fully powered effectiveness randomized controlled trial (RCT) [46]. Furthermore, we assumed that (2) participants would show significant improvements (compared to baseline) on general psychopathology, positive and negative symptoms, symptom distress, symptom severity, and functioning, and that (3) targeting metacognitive treatment mechanisms would lead to positive changes, as evidenced by increased cognitive insight and decrease in cognitive fusion (i.e. greater cognitive defusion from internal experiences).

Materials and methods

Procedure and participants

Between May 2021 and February 2022, we recruited a total of N = 37 participants from the acute psychiatric inpatient ward of the Max Planck Institute of Psychiatry in Munich, Germany for the study. Within this period, nine group therapy cycles were conducted. After a standardized screening process, eligible participants were briefed about the study’s procedures and written informed consent was obtained. Enrolment into the group therapy was possible at the beginning of each module. The screening procedure and all rater-based assessments were either conducted by a clinical psychologist or psychiatrist in training. Inclusion criteria were: (1) aged between 18 and 70; (2) diagnosed with a PSD (ICD-10 codes F20-39); and (3) able to give informed consent. Exclusion criteria were: (1) severe neurological or internal concomitant diseases; (2) IQ < 80, severe learning disability, brain damage or pervasive developmental disorder; and (3) missing eligibility for psychotherapy because of missing language skills, hostile or uncooperative behaviour. Our sample size of N = 37 participants exceeded the suggested benchmark of N = 20 participants required to evaluate the feasibility, acceptability and preliminary effectiveness of a group therapy intervention [47], also for studies with PSDs [48,49,50]. Following guidelines on conducting feasibility studies, we employed a non-randomised exploratory pre-post design closest to a Phase II early clinical trial [51, 52] (see Fig. 1), suitable for assessing and maximizing the intervention’s potential effectiveness for future research [39]. Outcome measures were taken at baseline (timepoint T0), before and after each therapy module (timepoints T1, T2, T3, T4, T5) and post intervention (timepoint T6). Rehospitalisation data was examined up to 12 months after completion (timepoint T7). Our study received approval from the ethics committee of the Medical Faculty at Ludwig Maximilian University Munich (PNO-21-0025) and was pre-registered in ClinicalTrails.gov (TRN04874974-2021.04.26).

Fig. 1
figure 1

The study and intervention design

Modularized metacognitive group intervention

We designed the metacognition-focused and modularized group therapy as an experimental group in addition to the already existing mechanism-based therapy concept of the acute psychiatric inpatient ward (see Supplementary Methods 1 and Supplementary Fig. 1 for an overview) [36]. The five-week group intervention consisted of nine stand-alone sessions (two per week) divided into three modules targeting various metacognitive and social change mechanisms, with the overall goal of enhancing cognitive flexibility (see Supplementary Fig. 2 for underlying therapy model). Modules I and II aimed to enhance attentiveness to internal experiences by promoting metacognitive awareness and knowledge and hence cognitive insight for cognitive distortions. Module III focused on reducing distress and automatic relational responses through cognitive defusion and therewith strengthen metacognitive goals and strategies. Module I contains mainly psychoeduactional material and exercises on metacognition (cognitive biases and dysfunctional coping strategies), adapted in a transdiagnostic way from the Metacognitive Training for depression [53]. Therapy contents for Module II were adapted from the “acute version” of the Metacognitive Training for psychosis by Moritz and Woodward [22, 54]. Module III includes adapted exercises from the Metacognitive Therapy by Wells and Matthews [35] and the Acceptance and Commitment Therapy by Hayes [34]. A description of sessions’ contents can be found in Supplementary Table 1. To address the specific characteristics of acute inpatients with psychotic symptoms, such as low illness insight, treatment resistance [55], severe cognitive deficits [56], and comorbid diagnoses [57], we designed the contents to be transdiagnostic, experiential, and easy-to-comprehend. Information was presented on simple PowerPoint slides, group sizes were kept small with no more than seven participants, and each session lasted a maximum of 60 min. Sessions were carried out by a psychotherapist trained in CBT who adopted an empowering and self-disclosing therapeutic attitude [58]. Due to the naturalistic study design, participants were allowed to participate in one other group therapy, received weekly individual psychotherapy sessions and additional routine care (described in Supplementary Methods 2) within the acute inpatient setting. Any other interventions participants were involved in were documented. Risk assessments and evaluations were conducted regularly during group sessions and team meetings with medical staff. Pre-specified adverse events included: symptom aggravation, new symptoms, treatment misuse, increased suicidality, and negative impact on work or social network. The assessments were documented using standardized checklists proposed by Linden [59]. In case of a serious adverse event (attempted suicide) related to the intervention, the termination of the study was determined.

Outcome measures

Primary outcomes consisted of measures operationalized to assess the feasibility and acceptability of the intervention and study evaluation design. Secondary outcomes included multiple clinical measures that were used to evaluate the preliminary effectiveness of the intervention. Demographic information was collected at baseline via a self-reported questionnaire, supplemented by the clinical record. Baseline medication and any changes during the course of the study were recorded using participant’s medical records. Table 1 presents an overview of all study instruments and the sequence of their administration at each of the timepoints.

Table 1 Measurements across each timepoint 

Primary outcome measures

Using the CONSORT extension to pilot and feasibility studies [60] feasibility data included: (1) eligibility rate, (2) consent rate, (3) trial entry rate, (4) completion and missing data rate, (5) retention rate, (6) dropout rate, (7) patient engagement, and (8) adverse events. Acceptability, subjective effectiveness and participants’ treatment satisfaction with each module and the whole intervention was measured with a five-point Likert scale self-report questionnaire (see Supplementary Methods 3) adapted from Moritz and Woodward [61]. Additionally, all participants were invited to give general feedback on the group therapy and study conditions in semi-structured interviews conducted at study completion (see Supplementary Methods 4). Following guidelines on evaluating pilot studies [46, 62], feasibility and acceptability criteria were benchmarked a priori with a traffic light system on recruitment, retention and attendance rate as well as patients’ overall treatment satisfaction: red (not feasible < 60%), yellow (modify intervention and protocol ≥ 60% < 80%), and green (continue without modifications > 80%) [63,64,65].

Secondary clinical outcome measures

General psychopathology as well as negative and positive symptoms were rated with the Positive and Negative Syndrome Scale (PANSS), a clinician-administered 30-item semi-structured interview [66]. On the three different scales (positive, negative and global symptom scale), items are scored on a seven-point Likert scale between 1 (not present) and 7 (severe). The PANSS demonstrates strong internal consistency, indicated by a Cronbach's α = 0.73 and a high inter-rater reliability (between 0.83 and 0.87) [67].

Symptom distress was measured with the Psychotic Symptom Rating Scale (PSYRATS), a 17-item clinician-administered semi-structured interview. On two different subscales (auditory hallucinations and delusions), different dimensions (e.g. controllability, severity and intensity of distress and disruption) of hallucinations and delusions are rated between 0 (not present) and 4 (highest possible distress). The PSYRATS is reported to have a good internal consistency with a high inter-rater reliability (between 0.79 and 1.00) [68].

The level of functioning was assessed using the Global Assessment of Functioning (GAF), a clinician-administered rating scale. The GAF scale considers both symptoms and functionality, and its scores range from 1 (indicating a risk of self-harm or harm to others) to 100 (suggesting the absence or minimal presence of symptoms). It demonstrates a good internal consistency with Cronbach’s α = 0.70 [69, 70], but has been criticised for its weak inter-rater reliability [71].

Symptom severity and treatment response to the intervention was rated on the Clinical Global Impression (CGI) rating scales, a one-item clinician-administered assessment [72]. On the severity scale (CGI-S), the severity of an individual’s illness is evaluated relative to the clinician’s past experience on a seven-point Likert scale from 1 (not at all ill) to 7 (among the most extremely ill patients). The improvement scale (CGI-I) quantifies the individual’s improvement or worsening since the start of the intervention from 1 (very much improved) to 7 (very much worse) [73]. The CGI is one of the most widely used rating scales in mental health trials and several studies demonstrated its validity by linkage to rating scales such as the PANSS [74].

Disability and functional impairment were estimated using the World Health Organization Disability Assessment Schedule 2.0 (WHODAS-2.0), a 12-item self-report questionnaire [75]. The six disability dimensions (social, cognitive, society, self-care, household, and mobility) of the International Classification of Functioning (ICF) [76] serve as subscales in the questionnaire. These are rated using a five-point Likert scale (1 = no disability to 5 = very strong disability). The WHODAS shows good reliability (Cronbach’s α = 0.89) [77, 78]. As suggested in the literature, inpatients with psychosis tend to overestimate their functioning [79], so we introduced an additional rater-corrected WHODAS score when a participant lacked the insight to answer the questions objectively. Following the approach of Gspandl et al. [80] and the DSM-5’s WHODAS-2.0 Clinician Administration guide [81], we used information from proxy respondents such as family members and carers, as well as clinical judgement, to record a question-by-question “corrected” score alongside the participant's self-reported “raw” score.

The hypothesised metacognitive change mechanism of cognitive insight was determined using Beck’s Cognitive Insight Scale (BCIS), a 15-item self-report questionnaire. The BCIS contains two subscales, self-reflection and self-certainty regarding one’s thoughts and experiences, which are rated using a four-point Likert scale from 0 (do not at all agree) to 3 (agree completely). It presents acceptable internal consistency with Cronbach's α = 0.60–0.68 [82].

To assess the potential change mechanism of cognitive defusion, the Cognitive Fusion Questionnaire (CFQ) was used. The seven-item self-report questionnaire measures the extent to which an individual's behaviour is influenced by thoughts (cognitive fusion), using a seven-point Likert scale ranging from 1 (never true) to 7 (always true). Previous studies have demonstrated its high internal consistency (Cronbach’s α = 0.89–0.93) [83, 84].

Rehospitalisation rates (to the same unit or psychiatric hospital) during the follow-up period were monitored exploratory using internal patient chart records.

Data analysis

In line with the CONSORT guidelines on reporting pilot and feasibility studies [60], we focused the analysis on descriptive statistics for feasibility and acceptability measures using frequencies and percentages. Thematic analysis [85], a systematic approach to organize, encode, and analyse patterns (themes) within qualitative data, was employed for the semi-structured interviews. Changes in dosages of psychotropic medication from baseline to post-intervention were compared by computing dose equivalents [86] and conducting parametric (paired t-tests) or non-parametric (Wilcoxon’s signed ranks) tests depending on the data’s distribution.

Intraclass Correlation Coefficients (ICC) for all secondary outcomes (0.25–0.67) provided evidence for a nested data structure [87, 88], so we used linear mixed models (LMMs, for details see e.g. [89]) via the maximum likelihood method to estimate participants’ changes on secondary clinical measures (i.e., post–pre treatment comparison) [90]. In all our LMMs, the measurement occasions of the outcomes were represented as a binary-coded time variable with 0 (i.e., baseline measure before treatment) and 1 (i.e., post-intervention measure). The time variable was added as a fixed effect on the within-participant level, while participants’ ID was treated as a random effect [91, 92]. All our LMMs controlled for potential confounders by including the covariates sex, age, psychotherapeutic treatment dosage (group and total), and medication change scores (antipsychotic and antidepressant), that we selected based on previous research findings [93].

For investigating clinically significant changes over treatment time, we referred to the recommended criteria of 25% and 50% of improvement indicated by percentage of PANSS total scores reduction from baseline and to the CGI-improvement scale cut-offs [94, 95]. Finally, for exploratory rehospitalisation rates, we calculated the proportion of participants readmitted to the same unit or hospital within the follow-up period. All statistical analyses were conducted using R Software, version 4.1.2 [96].

Results

Baseline demographic and clinical characteristics and changes in the participants’ medication regime are shown in Tables 2 and 3. There were no significant differences in the antipsychotic medication dosages between baseline and post-intervention. However, we found significant changes for antidepressants and benzodiazepines.

Table 2 Demographic and clinical characteristics of participants (N = 37)
Table 3 Participants’ medication regime at baseline and post-intervention

Feasibility and acceptability

The study’s CONSORT chart is illustrated in Fig. 2. In terms of feasibility, the eligibility and consent rates were 75.8% and 78.7% respectively, while the trial entry rate was 100%. The completion rate for all clinical assessments and between-module feedback questionnaires was high at 99.4%. All participants attended at least one module, resulting in a dropout rate of 0%. 33 of the 37 participants completed all three modules leading to an overall retention rate of 89.2%. Session attendance was consistently high with 86.5% of participants attending at least six sessions, i.e. two thirds of the total intervention. Five participants experienced a total of seven adverse events over the course of the study. These included one negative impact on work, one appearance of new symptoms and five symptom deteriorations. None was related to the intervention.

Fig. 2
figure 2

CONSORT flow diagram of the recruitment, assessment and treatment process. Feasibility measures were defined as: 1) eligibility rate (proportion of those eligible to participate as a percentage of those screened); 2) consent rate (proportion of those who signed the informed consent as a percentage of those who were approached to participate); 3) trial entry rate (proportion of those who consented and completed baseline measures); 4) completion and missing data rate (proportion of assessments completed at each time point including screening, baseline, intervention and final meeting and reasons for missing data); 5) retention rate (proportion of those who began the treatment and completed all three modules); 6) dropout rate (patients you entered the trial, attended at least one therapy session and dropped out before completing at least one module); 7) patient engagement (proportion of those attending at least two thirds of the intervention, i.e. six sessions, as well as the reasons for non-attendance); and 8) adverse events (any unwanted events related to the intervention)

Participants’ acceptability and satisfaction with the group intervention was high (see Table 4), with 85.2%, 91.9%, 91.4% and 80% of the participants rating their treatment satisfaction for Modules I, II, III, and the overall treatment respectively with the highest possible rating (applies to a great extent or applies exactly). Illustrative open-ended feedback quotes (see Table 4) on each module and on the group therapy as a whole further support participants' satisfaction with and positive insights gained from the group therapy. Greater details concerning attendance data, complete presentation of the qualitative feedback on the questionnaires, participation in supplementary treatments and therapy content of individual therapies can be found in Supplementary Tables 2–5.

Table 4 Participants’ quantitative and qualitative feedback on each module and the overall treatment

25 of the 37 participants agreed to participate in the voluntary semi-structured feedback interview following study completion. Regarding positive group aspects, topics included helpful therapy contents, e.g. defusion techniques, and supporting environment, e.g. positive group atmosphere (see Supplementary Fig. 3). Themes identified for insights through therapy were gains in metacognitive abilities, e.g. thought awareness and recontextualisation (see Supplementary Fig. 4). Themes related to intervention deficiencies included e.g. tight session schedules and too few practical exercises (see Supplementary Fig. 5). Lastly, themes concerning the study and group setup comprised e.g. shortening session duration (see Supplementary Fig. 6). Examples of participants’ quotes and identified codes that support themes can be found in Supplementary Table 6 and 7.

Secondary clinical outcomes

The results of the LMMs (see Table 5) revealed significant medium-to-large post-intervention reductions from baseline for all secondary clinical outcomes, except for the self-rated WHODAS measuring disabilities and functional impairments. More precisely, we found reduced general psychopathology (b = − 17.03, 95% CI: − 23.78, − 10.27, d = -0.93), positive (b = − 6.59, 95% CI: − 8.64, − 4.53, d = − 1.24) and negative symptoms (b = -3.05, 95% CI: − 5.02, − 1.08, d = − 0.53), symptom distress (b = -12.07, 95% CI: − 16.88, − 7.26, d = − 0.99), symptom severity (b = − 1.04, 95% CI: − 1.56, − 0.53, d = − 0.97) and increased levels of global functioning (b = 19.72, 95% CI: 14.89, 24.56, d = 1.58). We also found a post-treatment reduction for the adjusted WHODAS-score (b = − 5.26, 95% CI: − 7.94; − 2.57, d = − 0.67). Regarding hypothesised change mechanisms, we found a significant post-module reduction in self-certainty after Module II (b = − 1.64, 95% CI: − 2.84, − 0.45, d = − 0.45) and in cognitive fusion after Module III (b = − 4.52, 95% CI: − 8.24, − 0.81, d = − 0.43). Time effects on secondary clinical outcomes were not alternatively explained by differences in sex, age, psychotherapeutic treatment dosage, or medication change since we controlled for these covariates in our LMMs.

Table 5 Effect of time on secondary outcome measures using linear mixed models

Analyses of clinically significant change in means of relative changes in PANSS total scores from baseline are shown in Table 6 [94]. At post-intervention, 75% of the refractory and 36% of the non-refractory participants fulfilled the response criteria. According to responder cut-off definitions on the CGI-improvement scale (at least minimally better) [94], 91.9% of the participants responded to the treatment. At 12-month follow-up, 16.2% of the participants were readmitted to our hospital one or more times (up to three times).

Table 6 Percentage changes from baseline in PANSS total scores as responder rates

Discussion

Given the significant individual and economic burden associated with exacerbations of psychotic disorders and hospitalisation, improving inpatient treatment is a critical concern for healthcare services [37]. An important contribution in this respect is the development of interventions targeting mechanisms of therapeutic change [97, 98] that are moreover adapted to the specific needs of acute inpatients [99]. The present study is the first exploratory study conducted within an acute psychiatric inpatient ward that investigates the feasibility, acceptability, and clinical outcomes of a mechanism-based and modularized group intervention targeting metacognitive change mechanisms in acute psychosis.

Results from the trial suggest that our group intervention was both feasible and acceptable, meeting the desired criteria for feasibility trials as outlined in guidelines [46, 100]. Despite COVID-19 pandemic-related challenges such as temporary closed wards and group format limitations, we recruited 37 participants within nine months, exceeding our pre-set recruitment target of 20 patients. Retention and attendance rates were both above the 80% benchmark, with overall satisfaction ratings exactly reaching the 80% acceptability target. The low dropout and missing data rates, and participants’ positive feedback in the questionnaires and interviews, further indicate high commitment and satisfaction with the treatment. Despite high symptom burden among participants with PANSS total scores comparable to average inpatients with acute psychosis [101, 102], there were no related adverse events, indicating the intervention's safety. Overall, our study results on feasibility and acceptability align with previous research, indicating that group psychological interventions are feasible, safe, and acceptable for inpatients with PSDs in acute care settings [16, 63, 103, 104]. This adds to the growing evidence contradicting the idea that psychotherapy is neither feasible nor helpful for this specific patient population [63].

Our LMMs moreover revealed promising results with medium-to-large effect sizes supporting hypothesised improvements on all secondary clinical measures. The decrease in negative symptoms is particularly noteworthy, as they greatly impair the functioning of those affected and have been reported to be resistant to pharmacotherapy and psychosocial treatments [105]. Participants in our study had significantly lower rehospitalisation rates compared to the average readmission rate of 50% within a year [106]. However, it's important to note that this interpretation is limited, as we only had access to readmission data from our hospital and not from other hospitals where patients may have been admitted during the follow-up period. The response rates in terms of PANSS reduction and CGI improvement moreover exceeded those of sole antipsychotic drug trials [102, 107], further supporting the potential clinical benefit of our mechanism-based intervention and meriting exploration in a larger scale study. Our findings are also consistent with above mentioned studies, which, next to demonstrating positive feasibility and acceptance, likewise presented preliminary encouraging results on clinical outcomes such as PANSS and WHODAS [16, 63, 103, 104].

Furthermore, our findings on assumed change mechanisms add support to the proof-of-concept of our underlying metacognitive treatment model. The post-Module-II improvements on cognitive insight measured with BCIS thereby are consistent with previous studies reporting immediate small post-intervention effects on self-certainty scores, with positive effects on self-reflectiveness showing only at the six-month follow-up [108, 109]. This suggests a previously discussed “sleeper” effect of MCT [110], that needs further exploration in future research studying long-term effects of cognitive insight [82, 108]. Significant post-Module-III reductions of cognitive fusion on the other hand are comparable to previous research reporting medium effect size changes in CFQ scores after four weeks of mindfulness-based group therapy for inpatients with PSDs [63]. Literature moreover discusses the mediating role of cognitive defusion in increasing psychological flexibility and thus fostering effective coping necessary for reducing symptom believability, subjective symptom severity, and psychosis-related distress in acute inpatients [111,112,113]. In summary, findings on potential change mechanisms underlying the respective modules were promising, but further exploration through mediation analyses in a randomized controlled trial is necessary before making viable statements [40, 114, 115].

Strengths and limitations

The major strengths in our study included the adherence to a pre-registered trial protocol, pre-set feasibility benchmarks, the use of well-validated qualitative and quantitative assessments (rater and self-report), the detailed assessment of psychotropic medication, and use of complementary treatment elements to control for potential confounding variables. Moreover, the broad inclusion criteria (e.g. no restriction on substance abuse or ECT) allowed capturing a diverse range of patients that were actively involved in the intervention refinement through codesign activities during the whole study period [116]. In addition, the use of a contextualized, flexible (modularized) and targeted (change mechanisms) treatment approach allowed for individualized and tailored interventions, increasing the potential for positive treatment outcomes in acute inpatients with psychosis [36]. Finally, our LMM analyses captured the nested structure of our data and delivered more valid standard error estimates than common analysis of variance. In addition, we controlled for several confounders in our LMMs making our results on time effects on the outcome variables more reliable  and unbiased, despite the small sample size.

As an exploratory phase II study, there are several methodological limitations to consider. Firstly, the lack of a control group and the absence of restrictions on additional treatment modalities make it difficult to reliably estimate the intervention’s effectiveness. Despite controlling for covariates, preliminary clinical outcomes need to be viewed with caution since the intervention's effectiveness cannot be conclusively determined yet. Secondly, the assessments and therapy were mainly carried out by the same researchers. While assessments were strictly conducted according to protocol, this could have led to biases. Nevertheless, there was consistency in the effects observed between rater-based and self-report measures. Thirdly, the small sample size limits the statistical power of our LMMs, although it can be considered sufficient to answer the question of feasibility and acceptability. Fourthly, no follow-up measurements were included to test lasting treatment effects on secondary clinical outcomes and change mechanisms. Fifthly, the overall positive feedback given in the open-ended sections in the modules’ feedback questionnaires and the semi-structured interviews may be the result of a selection bias, as only patients who were already specifically “motivated” may have chosen to answer and/or to participate. Lastly, participants’ personal therapy goals (see Supplementary Table 5) did not always match group contents. However, personal topics were discussed in individual sessions and treatment personalization will be subject to further research.

Future research should adjust therapy contents and the study’s framework according to participants’ feedback and feasibility measures, including bigger sample sizes, blinded assessments, randomization, and an active control condition not focusing on the targeted change mechanisms to explore the treatment’s internal validity [115, 117] (see Supplementary Table 8 for planned adjustments). To provide further proof-of-concept for the metacognitive-based treatment model, additional mechanism measures should be added, such as direct measures of cognitive biases e.g. jumping to conclusion (JTC) bias [118] and theory of mind (ToM) impairments [119], along with mediation analyses and follow-up timepoints (also including information on readmissions to other hospitals) to examine the effects of change mechanisms [114, 115, 117, 120, 121]. The ultimate goal is to identify moderators of outcome to ensure the intervention is matched to the patient’s need and personal therapy goals, hence providing personalized treatment [42, 122] (for further details see Supplementary Table 8).

Conclusion

Overall, the current results indicate that it is feasible and acceptable to conduct a mechanism-based and modularized group intervention focusing on metacognitive change mechanisms in acute psychiatric settings. The encouraging preliminary outcomes on clinical measures and change mechanisms moreover support the metacognitive treatment model. Further evaluation of the intervention and change mechanisms is warranted.