Introduction

Ultrasound (US) assessment of small joints is routinely used for the diagnosis of peripheral inflammatory arthritis (IA) and helps guiding therapeutic decisions in patients with established rheumatoid arthritis (RA) [1]. The access to musculoskeletal US services varies among hospitals and rheumatology services. The usefulness of the US examination for the diagnosis and management of RA depends on the level of expertise of examiners and quality of the US machines [2]. Even if a considerable proportion of patients with established RA continue to have subclinical inflammation, despite evidence of clinical remission, it is not cost-effective to screen them all. Extensive US examination of peripheral joints in RA had a good predictive value for disease outcome, as established by an 18-month longitudinal study [3]. As the US examination of multiple joints can be time consuming, several US scoring systems have been developed, aiming to assess a smaller number of joints without compromising on the quality of data collected [4].

There are no guidelines to help us decide which RA patients should have an US scan of their joints or how often, despite the constant effort to generate recommendations regarding the use of imaging techniques in the management of patients with RA [5]. There is no straightforward indicator of the risk of continuing with active joint inflammation, despite the use of several US prediction factors for disease progression and damage [68].

The OMERACT (Outcome Measures in RA Clinical Trials) initiative defined the US abnormalities characteristic for RA as synovial hypertrophy (SH), with or without power Doppler (PD) signal, tenosynovitis and erosions [4, 9].

The available US scoring systems use different quantitative or semiquantitative measures (such as grades of SH or PD) or a binary scoring system (such as presence/absence of erosions) to express US findings [4]. The principal aim of the international and European US expert groups is to develop a standardised US scoring system which will capture the patient’s global disease activity and which can be employed to guide therapeutic decisions [10].

As several outcome measures are required to establish if patients with RA have active disease or not, we identified the need to integrate clinical and laboratory parameters in a prediction model that could improve the quality of care we provide to our patients, by enabling the identification of those at risk of having positive PD signal in their joints. In an ideal situation, patients with a previous diagnosis of RA or with the suspicion of having developed IA are offered an US examination of their joints to increase the chance of correct diagnosis and optimise disease control. In reality, because of limited resources, patients are referred to US clinics in a selective manner, based on their clinician expertise and need, and availability of US resources. However, it is widely recognised that subtle joint inflammation is often missed by the clinical examination [11]. It was proposed that RA treatment should target the control of sub-clinical inflammation (as assessed by US or MRI), instead of being exclusively guided by clinical examination and laboratory measures [12], and that remission criteria for patients with RA should also include joint US examination [13, 14].

The rationale of our research project was to generate useful information for clinicians that can be easily applied in real life and can help identify patients at risk of having active inflammation in their joints. Our statistical modelling framework is based on data routinely collected from a heterogeneous RA population.

The aim of our study was to build a statistical model to assess the influence of several outcome measures on the presence of PD signal. The selected outcomes were as follows: the number of tender joints (TJC) and swollen joints (SJC) out of 28, global health state as assessed using a visual analogue scale (GVAS), high-sensitivity C-reactive protein (hsCRP), erythrocyte sedimentation rate (ESR), presence of rheumatoid factor (RF) and anti-citrullinated cyclic peptides antibodies (ACPA) and anti-nuclear antibodies (ANA), disease duration and medication. In order to generate this prediction model, we collected similar data from the two groups of patients (with suspicion of inflammatory arthritis (S-IA) and RA) and assessed the influence of every parameter on the patients’ risk of having active PD in their joints.

Methods

Subjects

We conducted a real-life, retrospective cohort study of patients seen to our US outpatient clinics in the order of their referral, between May 2013 and September 2013. We used an established protocol of US examination of hands comprising flexor tendons and 22 joint assessments (dorsal longitudinal and transverse views), which is the standard of practice for our US clinics. All the patients were referred to this clinic for inflammatory symptoms in their hands, and their clinical examination was equivocal. No inclusion or exclusion criteria were applied for the selection of patients, to ensure the general applicability of the prediction model we propose. Patients on oral steroids or NSAIDs, or who had steroid injections less than 4 weeks before the US scan, were excluded from the final analysis. We report data on the two largest groups, the patients with established RA and patients with S-IA, using descriptive statistics (SPSS, version 22). We also stratified patients based on their disease activity assessed using the disease activity score assessing 28 joints (DAS-28). Patient groups with different DAS scores were compared using ANOVA/chi-squared test depending on the variables analysed (p < 0.05 was considered significant). We also performed a cluster analysis of patients with similar DAS-28 scores using the aforementioned outcome measures.

Ultrasound examination

US examination was performed using a Logiq S8 US machine (GE Medical Systems Ultrasound and Primary Care Diagnostics, Wauwatosa, WI, USA), equipped with a multi-frequency linear matrix array transducer (6–15 MHz). B-mode and PD machine setting are optimised and standardised for all our patients’ US examinations. The settings used were as follows: B-mode frequency 11–15 MHz and Doppler frequency 7.5–15 depending on the depth of the anatomical area, Doppler gain around 18 dB, low wall filters and pulse repetition frequency 800 Hz. Because of the small number of joints with PD signal (only patients with equivocal clinical examination were referred for an US examination of their hands), we did not report separately the grades of PD signal.

Statistical analysis for the prediction model

We proposed a regression model to assess the contribution of every outcome measure to the risk of having active joint inflammation as well as predict PD signal. We excluded patients with PD signal present in more than 10/22 joints to ensure homogeneity in the data. We conducted a real-life study including 276 patients referred for the suspicion of active joint inflammation (new referrals for S-IA, RA patients and patients with other inflammatory rheumatic conditions). We assessed 22 hand joints in every patient, irrespective of their hand symptoms, using the OMERACT scoring system for US examination. The proposed regression model was based on a beta-binomial distribution (with the response ranging from 0 to 10) for the PD score variable and a mix of main interaction effects for the outcome measures stated earlier. Negative interaction effects show that the respective outcome is associated with a lower number of joints with PD signal.

Results

We collected data from 276 consecutive patients referred to our rheumatologist-led US clinic to have a scan of their hand joints aiming to answer the clinician question about the presence of active inflammation in their joints. There were 108 patients with established RA, 93 were referred for the clinical suspicion of IA, 29 were previously diagnosed with psoriatic arthritis (PsA) and 46 patients had other diagnoses, including crystal arthropathies, sarcoidosis, osteoarthritis and chronic pain. We analysed in parallel the characteristics of the two main groups of patients: the group of established RA (n = 108) and the group of patients referred with S-IA (n = 93) (Table 1). The assessment of the follow-up clinical letters of S-IA patients revealed that 50.5 % (n = 47) were ultimately diagnosed with a form of IA, out of which 76.5 % (n = 36) were diagnosed with RA, 12.7 % (n = 6) with undifferentiated IA and 10.6 % (n = 5) with psoriatic arthritis. The rest of the patients initially referred with the suspicion of IA (49.4 %) were subsequently diagnosed with non-specific arthralgia, chronic pain or hand osteoarthritis.

Table 1 Comparison between patients with S-IA and established RA

The patients in the RA group were treated with methotrexate (MTX) alone or in combination with other synthetic disease-modifying anti-rheumatic drugs (DMARDs) in 79.6 % (86/108), SSZ alone or in combination in 48.1 % (52/108) and HCQ alone or in combination in 58.3 % (58/108). In terms of biologic treatments, 24.07 % (26/108) were on etanercept, 12 % (13/108) on adalimumab, 0.018 % on certolizumab (2/108), 0.09 % on rituximab (10/108) and 0.055 % on tocilizumab (6/108). Out of 108 patients with established RA, 89 patients (82.4 %) were on combination therapy. The RA patients were on stable conventional and biologic DMARD medication for at least 1 month prior to the scan. A minority of patients (21/108 in the RA group and 9/93 in the S-IA group) were treated with additional NSAIDs, and one S-IA had oral steroids (Table 1). These patients were excluded from the further analysis that stratified them based on their disease activity.

The comparison between the RA and S-IA groups with regard of the demographic, clinical, laboratory and US features is summarised in Table 1. The median number of joints with PD activity was significantly higher in RA patients (0, interquartile range (IQR) = 2.25 vs. 0, IQR = 1, p = 0.03), as was the number of joints with erosions (p = 0.004) and the number of joints with joint effusions (1.5 vs. 0, p = 0.013). However, the percentage of patients having at least one joint with active inflammation, as assessed by the presence of PD signal, was not significantly different between the two groups (52/93, 55.9 % vs. 45/108, 41.6 %, p = 0.56). Tendon abnormalities (such as effusion and tendon sheet thickening) were more frequently observed in patients with RA vs. S-IA (14/108 vs. 3/93, p = 0.042). The number of clinically assessed swollen joints correlated poorly with the US-detected inflammation (as assessed by PD signal) in both groups of RA and S-IA patients (r = 0.27, p = 0.54 and r = 0.39, p = 0.72, respectively).

The majority of patients with S-IA and positive PD signal in at least one joint (n = 52) were subsequently diagnosed with a type of inflammatory arthritis (47/52, 90.3 %), out of which, 36 patients were diagnosed with RA (69.2 %). In five cases, the presence of PD was associated with osteophytes and they have been diagnosed with inflammatory OA. Out of 25.8 % of patients with S-IA who had positive RF at the time of the US scan (n = 24/93), 23 patients were subsequently diagnosed with RA (23/36, 63.3 %); similarly, a proportion of ACPA-positive S-IA patients were ulterior diagnosed with RA (18/36, 50 %).

Further analysis compared RA patients stratified based on the DAS-28 activity scores (Table 2), and similarly, Table 3 comprises data on the S-IA group (Table 3). As mentioned earlier, the patients on oral steroids and NSAIDs were excluded from this analysis, which reported data on 87 RA and 82 S-IA patients. As expected, the RA patients stratified based on their DAS-28 scores had statistically different ESR, TJC, SJC, GVAS and pain scores; however, they had similar US parameters assessing for active and chronic inflammatory changes (Table 2). In contrast, the analysis of patients with S-IA revealed that DAS-28 score stratification identified patients with different PD and SH grade 3 scores, CRP and ESR levels, as well as patients with different TJC, SJC, GVAS and pain scores (Table 3). In addition, the S-IA groups with low, moderate and active diseases had a larger proportion of patients with true active inflammation (based on PD signal) than the group classified as being in clinical remission (p = 0.01) (Table 3). The comparison between the patient groups with a similar disease activity revealed that patients with RA and moderate and high disease activities (DAS-28 = 3.2–5.1 and >5.1, respectively) were significantly different from S-IA patients with similar DAS-28 scores, as far as the disease duration, ACPA positivity and number of joints with erosions were concerned (Table 4). RA patients in remission (DAS-28 < 2.6) had significantly higher TJC, number of joints with active inflammation and erosions, and higher number of patients with subclinical inflammation than the S-IA patients classified as being in clinical remission (Table 4). We also assessed for correlations between PD signal and erosions in RA patients in all the disease activity groups, and there were no positive correlations (p = 0.06, p = 0.26, p = 0.49, p = 0.12 for the different DAS-28 score groups, and p = 0.38 for the whole RA group, respectively). Out of all clinical outcomes, only SJC correlated with the PD score and only in patients with RA (p = 0.03). The presence of erosions also correlated with the patient’s age and disease duration (p = 0.04 and p = 0.03, respectively).

Table 2 Comparison between RA patients with different levels of disease activity
Table 3 Comparison between S-IA patients with different levels of disease activity
Table 4 Comparison between RA and S-IA patients in clinical remission, with low, moderate or high disease activity based on the DAS-28 cutoffs

Most importantly, the DAS-28 scores did not correlate significantly with the PD scores in the RA patients (p = 0.25, p = 0.87, p = 0.13, p = 0.22, respectively, for the remission, low, moderate and active diseases, respectively). However, in the S-IA group, DAS-28 scores above 5.1 correlated significantly with the PD score (p = 0.048).

The binominal model analysis evaluated the impact of several variables on the risk of having active joint inflammation assessed by the presence of PD. The most negative values suggested a lower risk for positive PD signal in correlation with a certain parameter. Table 5 summarises the marginal effects of different variables on the number of joints with PD at the US examination in S-IA, RA patients, and patients with other arthropathies (labelled as non S-IA and non-RA). For instance, the average PD scores of S-IA patients increased by 0.004856 units in our prediction model as a consequence of having a hsCRP level above rather than below 5 mg/l. Similarly, the average PD score of a patient diagnosed with RA and treated with tocilizumab decreased by 1.81 units as compared to an RA patient who is not treated with tocilizumab (in the context of similar clinical picture and laboratory results). The most striking finding was that RA patients treated with tocilizumab tended had a lower PD score on US examination than patients treated with other biologics or DMARDs, despite similar clinical and laboratory findings (Table 5). The presence of RF increased slightly the risk of positive PD score only in patients with S-IA, and ACPA was associated with higher risk of active disease in patients with established RA, but had less influence on the S-IA group. Out of all three clinical outcomes, TJC, SJC and GVAS, only the SJC correlated with an increased PD score and only in patients with RA. The effect of different variables on the risk for a higher PD score was close to a zero value in the case of treatment with MTX and etanercept. For example, our prediction model suggested that in this group, the RA patients treated with MTX or etanercept monotherapy had a similar risk of having active joint inflammation on US. In this particular case, all the outcome measures taken into consideration in the prediction model are more relevant than being treated with MTX rather than etanercept.

Table 5 Marginal effects of different variables on the number of joints with PD signal in patients with other arthropathies vs. S-IA vs. RA

We also attempted a cluster analysis of patients with similar DAS-28 scores using the following variables: age at scan; duration of symptoms; CRP; ESR; SH grades 1, 2 and 3; PD; presence of osteophytes; erosions; TJC; and SJC and GVAS. Although there was no obvious clustering of patients based on their diagnosis, the analysis revealed five big clusters with a large jump in the levels of two consecutive nodes (Fig. 1). One large cluster, including a significant proportion of S-IA patients with DAS-28 >5.1 exhibited the smallest amount of dissimilarity, suggesting that further analysis of larger patient groups might help identify which parameters are best to predict patients’ tendency to cluster in a certain disease group. Figure 1 represents the dendogram.

Fig. 1
figure 1

Dendogram showing the clustering of the patients with high disease activity score (DAS-28 >5.1) 336 × 204 mm (96 × 96 DPI)

Discussion

The results of our comparative analysis between the two groups of patients reflected their selection: patients with RA with no obvious clinical signs of active disease, but possible subclinical synovitis, and patients with S-IA, who did not fulfil the clinical and laboratory criteria for diagnosis of an inflammatory arthropathy and were referred for an US scan. As expected, the number of joints with active inflammation and clinical swelling was low. Patients with established RA had longer disease duration and more obvious chronic inflammatory changes (defined as SH and tendon inflammatory changes). Chronic joint changes characteristic for RA are known to pose clinical difficulty in assessing for the presence of active inflammation. As frequently encountered in clinical practice, patients with chronic conditions probably have better coping strategies, and this can account for the reported difference in the GVAS assessment between the two groups, despite similar pain scores.

A large proportion of our study patients with positive PD signal in at least one of their hand joints was diagnosed with a form of IA following the US scan (91.4 %, 43/47), which reiterates the role of US examination in the early diagnosis of IA. Also, our study showed that about one in two patients with inflammatory joint pains has active arthritis based on US examination, even if they have not been previously diagnosed with any joint disease. In addition, the DAS-28 score underestimated the presence of active inflammation in 50 % patients with RA classified as being in clinical remission; also these RA patients had more active disease, more erosions and joint tenderness that subjects with S-IA included in the same disease activity group.

The binominal regression analysis model found that certain parameters can have different impacts on the PD signal risk prediction. It is difficult to appreciate the magnitude of the effect of different parameters taken into consideration in this prediction model because of the limited number of patients included in our study. Tocilizumab is considered one of the most potent inhibitors of synovial inflammation and was expected to be associated with the lower risk for PD signal, even in the context of joint pains. The weak impact of SJC and TJC on the PD presence is a common finding of our prediction model and previous US studies. Our study also emphasised the limited role of the inflammatory markers in predicting joint inflammation, as we found that almost half of our RA patients with hsCRP below 5 mg/l had at least one joint with positive PD signal.

This is the first study to propose a statistical prediction model integrating clinical, laboratory and US parameters aiming to predict which characteristics are associated with an increased risk of having active joint inflammation at the US examination in two different groups of patients. The role of US scans in early detection of RA is well established [15], and the probability of developing clinically apparent arthritis in the context of positive PD signal was estimated at 94 % [16]. The main diagnosis difficulties arise when a patient does not fulfil criteria for a diagnosis of IA, and the availability of US scans is limited. Despite the effort to establish guidelines for early diagnosis of RA integrating US data [16], there are no algorithms to help with the exclusion of IA in the context of inflammatory hand pains. Ideally, all patients with inflammatory hand pain should have an US examination to exclude joint inflammation. The US examination was proven effective in altering treatment decision and had increased sensitivity compared to clinical examination [17].

Our real-life observational study confirmed that there were differences between patients with established RA and patients referred with suspicion of IA, in terms of parameters associated with the presence of PD in their joints and patients’ characteristics. As the patients were referred to have a scan because of the difficulty to appreciate if their joints had active inflammation or not, rather than being screened for US abnormalities in the context of arthritis, the two groups of patients are not representative for the general group of patients with IA and established RA.

It is interesting to note that the presence of SJ in patients with suspicion of IA did not correlate with the presence of PD at the US examination and that DAS-28 score is not able to discriminate between RA patients with more or less active joint inflammation on US.

It is recognised that patients with established RA have chronic inflammatory changes, and the presence of swelling is not always indicative of presence of active inflammation, whereas patients with no previous diagnosis of arthritis might get swellings if their joints become inflamed. However, previous studies found a disparity between the clinical assessment of joint swelling and tenderness and the presence of PD signal in patients with established RA [18]. In our study, the presence of PD did not correlate with the erosion scores in RA patients, probably because the two phenomena are temporarily distinct. Previous studies have been focused on patients with early RA, establishing the importance of US for early diagnosis [15], disease progression assessment [19] and prediction of risk to develop arthritis [3, 20]. The emphasis of our prediction model is rather on finding parameters that suggest the need for organising an US examination in certain categories of patients at risk, rather than establishing the patients’ risk of developing IA based on prospective serial US examinations, as the majority of studies in the literature.

Our study has some important limitations: the number of patients was too small for enabling definite conclusions in relation to the risk prediction model and no practical validation of this model was pursued as this was beyond the scope of this study. This prediction risk model, if validated, will ensure patient access to US scans based on the stratification of their risk of having sub-clinical joint inflammation to minimise the risk of under-diagnosing active RA in the context of limited NHS clinical resources. The US score used included only hand joints as the patients referred to our US clinics had hand joint inflammatory pains. We do not suggest the extrapolation of our study findings for other joint areas or for other inflammatory arthropathies. The distal interphalangeal joints were not examined on purpose, as they are not included in the RA OMERACT scores.

The performance of this prediction model improves with the expansion of the database and also by taking into consideration ‘missing US outcomes’. To address this, we plan to collect data in parallel from two RA patient groups (who had and had not a recent US scan) to optimise our model ability to predict sub-clinical inflammation in the absence of recorded US outcomes. A pilot study is also planned for the external validation of our risk model, in which we will compare the predicted model outcomes with the US data to assess our model performance.