Introduction

The real-time detection and the broadband spectral analysis of microwave (MW) signals is key for a broad range of technologies such as communication, medicine and navigation. Cognitive radio networks, electromagnetic compatibility (EMC) analysis, radars, wireless communications, etc. are some of the many applications that require a real-time spectral detection over a broad frequency band (tens of GHz) with 100% probability of intercept (POI).

At present, the principal and most common solution for the real-time spectral analysis of MW signals is the fast Fourier transform (FFT) electronic spectrum analyzer1. The MW signal is digitalized by an analog-to-digital converter and then processed by Fourier analysis. However, the sampling rate and the power consumption of the analog-to-digital converter limit the real-time bandwidth to several hundreds of MHz. Typically, for a very large instantaneous bandwidth of 500 MHz, the system presents a frequency resolution of some hundreds of kHz, a 100% POI time resolution of some μs and a dynamic range greater than 70 dB2. Increasing the analysis bandwidth can be achieved by analog solutions based on photonic approaches3, where the MW signal is transposed in the optical domain and then processed to retrieve the spectral information. One promising technique is the spectral hole burning in ion-doped crystals at cryogenic temperature4, with an instantaneous real-time bandwidth of several tens of GHz, a frequency resolution of hundreds of kHz and a large dynamic range (>50 dB)5. However, although cooling technique has made impressive progress with closed cryocoolers now available, there is a need for complementary solutions that are compact, can be operated at room temperature and require low power consumption, in particular for on-board components.

An alternative technique is to use the quantum spin properties of electrons to directly detect MW signals with neither analog-to-digital conversions nor optical processing modules. The nitrogen-vacancy (NV) center in diamond is very attractive for this purpose since it is a solid-state atom-like system that can be utilized at room temperature. Its simplicity of operation makes it an appealing platform for building measurement devices, such as magnetometers6, gyroscopes7, thermometers8, high-pressure sensors9, electrometers10, etc., that are compact, consist of simple optical components and that require low power consumption.

Here, we propose a quantum diamond spectrum analyzer (Q-DiSA) for MW signals exploiting the spin-dependent optical properties of an ensemble of NV centers employed as “quantum” tuning forks whose resonance frequency is spatially encoded by means of an external static magnetic field gradient. In ref. 11, we have demonstrated the proof of principle of NV-based MW spectral analysis over a frequency range of some hundreds of MHz. This limited range results from the chosen magnetic architecture that does not allow the alignment of the static magnetic field along the NV centers axis in the diamond crystal, which is mandatory to cover a large frequency range without losing the NV center spin-dependent photophysical properties due to the transverse magnetic field12. In this work, we implement an original architecture to solve this issue. We combine a diamond crystal with a specific cut together with a simplified magnet arrangement. This allows tuning the resonance frequency over a broad range while keeping the alignment of the magnetic field with the NV center axis. Using this system, we demonstrate the real-time spectral analysis over a large tunable frequency range (25 GHz), a wide instantaneous bandwidth (up to 4 GHz), a MHz frequency resolution (down to 1 MHz), a ms temporal resolution and a large dynamic range (40 dB).

The nitrogen-vacancy center

The NV center is a color center in diamond13 consisting of a nitrogen atom and a carbon vacancy in two adjacent positions of the diamond lattice (Fig. 1a). The negatively charged state (NV, now-on simply denoted as NV) with two unpaired electrons has an electron spin equal to 1. Owing to the C3V symmetry of this defect, the N-to-V crystallographic direction forms an intrinsic spin quantization axis. According to the tetrahedral structure of the diamond lattice, four different N-to-V axis orientations exist in the crystal.

Fig. 1: Nitrogen-vacancy center schematic and its application to the spectral analysis.
figure 1

a NV center defect into the diamond lattice. Black spheres represent carbon atoms, the blue sphere represents the nitrogen (N) atom and the white sphere represents the carbon vacancy (V). b NV center energy levels. The system is optically excited by a non-resonant 532 nm laser (green arrows). The main de-excitation process is a spin-conserving radiative transition (red arrows). A non-radiative process (black dashed arrows) also occurs and allows for intersystem crossing (ISC). Both the excitation process and the radiative process can be phonon-assisted (white dashed arrows). When no static magnetic field (B) is applied to the NV center, the \(\left|\pm 1\right\rangle\) states are degenerate. A static magnetic field induces a Zeeman shift of these levels and removes the degeneracy. Being D the ground state zero-field splitting, the resonance frequencies of the \(\left|0\right\rangle \to \left|+1\right\rangle\) (ν+) and the \(\left|0\right\rangle \to \left|-1\right\rangle\) (ν) transitions are given by ν± = D ± γBNV, where γ is the NV center gyromagnetic ratio and BNV the magnetic field component along the NV center axis. c NV center resonance frequencies induced by the Zeeman effect. d General application of NV centers to the spectral analysis of MW signals. At the center, the schematic of the set-up. At the top, the ODMR spectra of NV centers placed at different distances from the magnet. At the bottom, the output of the camera when a MW signal is detected. A drop of photoluminescence is detected on the pixels that image the NV centers resonant with the applied signal.

Owing to its remarkable spin-dependent optical properties, the NV center is an enabling tool for numerous quantum sensing experiments14. NV energy levels are depicted in Fig. 1b. Both ground state and excited state are spin triplets (\(\left|{m}_{{{{{{{{\rm{s}}}}}}}}}=0,\pm \!1\right\rangle\)). The ground state zero-field splitting (ZFS) between the \(\left|0\right\rangle\) state and the two degenerate \(\left|\pm \!1\right\rangle\) states is equal to 2.87 GHz. At room temperature, these three spin states are equally populated due to thermal excitation. Under optical pumping, the main de-excitation process consists in spin-conserving radiative transitions characterized by a photoluminescence (PL) emission spectrum with a zero phonon line (ZPL) at 637 nm and a broad electron-phonon band (up to 800 nm). An additional non-radiative intersystem crossing (ISC) connects the \(\left|\pm \!1\right\rangle\) excited state sublevels to the \(\left|0\right\rangle\) ground state sublevel through a metastable singlet state12, leading to an efficient polarization of the spin in the \(\left|0\right\rangle\) state after a few optical pumping cycles, and a reduction of the PL rate when the \(\left|\pm \!1\right\rangle\) sublevels are populated. Applying an external static magnetic field to the NV center leads to a Zeeman shift of the \(\left|0\right\rangle \to \left|+1\right\rangle\) and \(\left|0\right\rangle \to \left|-1\right\rangle\) radio-frequency (RF) transitions. When the field is applied along the NV center axis, the two resonance frequencies are (Fig. 1c):

$${\nu }_{\pm }=| D\pm \gamma {B}_{{{{{{{{\rm{NV}}}}}}}}}|$$
(1)

where D is the zero-field splitting equal to 2.87 GHz, BNV is the magnetic field component parallel to the NV axis and γ, equal to 28 GHz  T-1, is the NV center gyromagnetic ratio. In particular, for a magnetic field of 102 mT, the \(\left|-1\right\rangle\) level crosses the \(\left|0\right\rangle\) level.

These two magnetic resonances can be optically detected (ODMR) by sweeping the frequency of a RF signal while monitoring the PL level (Fig. 1d). In this case, the NV center works as a magnetometer15.

The Q-DiSA technique consists in the reciprocal scheme. It relies on the spatial encoding of the NV center resonance frequencies by means of a controlled magnetic field gradient. The schematic is reported in Fig. 1d. NV centers located at different positions into the diamond undergo a different Zeeman shift and thus resonate at different MW frequencies. The MW signal to be analyzed is sent on the diamond through an antenna while a camera collects the PL emitted by the NV centers. The decrease in the PL intensity on some pixels reveals the presence of spectral components of the MW signal that are resonant with the NV centers imaged by those pixels. The measurement is performed in a continuous wave (CW) regime, namely the laser pumps continuously the NV centers, allowing, at the same time, a continuous repolarization of the spin and an instantaneous spin read out, without any dead time for the signal detection, except the refreshing time of the camera. Other techniques based on pulsed methods could be considered. However, they require prior knowledge of the microwave field amplitude in order to efficiently manipulate the spins. Here, the microwave field itself is the quantity to be measured. Since its amplitude is unknown, using such methods is not straightforward. On the contrary, although not as efficient as pulsed methods, the CW method allows to measure the spectral components of the signal instantaneously, simultaneously and without signal processing, by simply recording the image of the PL emitted by the diamond sample11.

Experimental set-up

Using NV centers for microwave detection over a large frequency range requires both a strong static magnetic field and a magnetic field gradient. Neodymium magnets provide a compact and room temperature solution to fulfill those requirements. We used a magnetic sphere of 1.3 cm diameter (Supermagnet K-13-C), which provides a magnetic field in the range of 1 T. By adjusting the magnet-diamond distance, the \(\left|0\right\rangle \to \left|-1\right\rangle\) transition frequency can be tuned from 10 MHz to 21 GHz and the \(\left|0\right\rangle \to \left|+1\right\rangle\) transition frequency from 2.87 to 27 GHz (Fig. 2a) (Supplementary Note 1). In order to achieve such a high Zeeman shift and preserve the optical spin properties of the NV centers, the magnetic field has to be aligned carefully with the NV center axis12 (Supplementary Note 2). The magnetization axis of the magnet is fixed in the horizontal plane and it is chosen as the reference axis of the system. The magnet is therefore mounted on a three-axis translation stage, with one horizontal axis parallel to the magnetization axis. The core element of the system is a commercially available 4.5 × 4.5 × 0.5 mm {100} single-crystal optical grade chemical vapor deposition diamond plate (Element Six) doped with NV centers at a concentration of a few ppb and {110} lateral facets which contain two NV axis in their plane. The diamond sample is mounted with a {110} facet parallel to the horizontal plane, allowing us to align one of the two families along the magnetization axis of the magnet (see inset of Fig. 2a). Therefore, our system preserves the alignment of the magnetic field and the NV axis while adjusting the magnet-diamond distance (Fig. 2a).

Fig. 2: Q-DiSA set-up and calibration procedure.
figure 2

a Q-DiSA set-up. The inset shows a zoom of the {110} facet from which the PL (red color) is collected when the diamond is shined by the laser source (green color). The magnetic field B generated by the spherical magnet is aligned along one of the two NV center families laying on the {110} plane. b CW-ODMR of the \(\left|0\right\rangle \to \left|-1\right\rangle\) transition for a magnet-diamond distance of ~2 mm. The resonance frequencies of the imaged NV centers are in the range [8.5 GHz; 10 GHz]. c Normalized PL acquired by the camera in the presence of a monochromatic MW signal at 9.5 GHz (top) and 9 GHz (bottom). d Comparison between the spectrum obtained considering only one row of pixels (y = 45, blue curve), and a sum over 40 rows (y from 20 to 60, red curve) showing a strong SNR improvement resulting from the summing procedure. e SNR of the PL integrated over a different number of non-resonant pixels. The signal is defined as the temporal mean of the PL; the noise is its standard deviation. The SNR increases linearly when the PL is integrated over a small number of pixels; the non linear behavior of the SNR for integration areas bigger than 40 pixels is mainly due to the inhomogeneity of the laser beam.

The NV centers are optically excited using a cw laser at 532 nm wavelength. The laser is focused, with a full width at half maximum (FWHM) waist of 38 ± 3 μm, into the diamond plate through one of the four {110} facets. The PL is collected, through the top {110} facet, using an optical microscope objective (20×, 0.33 NA), spectrally filtered (FF01-697/75 nm band-pass filter, Semrock) and focused on a commercial CMOS camera (UI-5240CP-M, IDS) using a 75 mm lens. A polarizer, located after the objective, partially suppresses the PL emitted by the three NV centers families off-axis with the magnetic field in order to enhance the ODMR contrast of the on-axis family (Supplementary Note 3).

The NV electronic spin transitions are excited by the homogenous RF magnetic field produced by a 1 mm-diameter loop antenna connected to a MW signal generator (SMA100B, R&S).

The optical system (beam waist and objective magnification) and the size of the antenna define the active area on the diamond (530 × 50 μm2) and the spatial resolution (0.66 × 0.66 μm2) (see inset of Fig. 2a). We have chosen those parameters as the best compromise between the size of the PL image, giving the spectral bandwidth, and the spatial resolution (three times the diffraction limit), which defines the frequency resolution of the spectral analysis.

Calibration procedure

For a given magnet-diamond distance, the Q-DiSA calibration procedure consists in associating each pixel of the image to the corresponding NV resonance frequency. As illustrated on Fig. 2b, the ODMR spectra show that, for a given y position, the spectral information is spatially coded along the x-axis through the magnetic field gradient (see inset of Fig. 2a).

Instead of considering a single row of pixels, we can take advantage of the wide-field imaging mode. Considering all the pixels that resonate at the same frequency, corresponding to the iso-B of the static magnetic field (Fig. 2c), we create a one-to-one calibration map that links each frequency to a set of pixels.

In wide-field imaging systems the signal-to-noise ratio (SNR) is primarily affected by photon shot noise. We can therefore increase it (Fig. 2d, e) by summing over all the pixels which resonate at the same frequency. The SNR then reads as16:

$${{{{{{{\rm{SNR}}}}}}}}=\sqrt{n\cdot V\cdot {N}_{{{{{{{{\rm{p}}}}}}}}}\cdot {R}_{0}\cdot \zeta \cdot \Delta {{{{{{{\rm{t}}}}}}}}}\,C$$
(2)

where n is the NV center concentration of the diamond, V is the volume imaged by the pixel, Np is the number of pixel resonating at the same frequency, R0 is the NV center PL rate, ζ is the collection efficiency of the system, Δt is the integration time of the measurement and C is the ODMR contrast. Once the calibration is performed, the only signal processing needed to retrieve the spectral content of the RF signal of interest is a sum over a few pixels. This procedure is quick and independent of the frequency we want to detect.

Results

Key parameters for the spectral analysis of MW signals are the frequency range, the bandwidth, the frequency resolution, the dynamic range and the temporal resolution. We discuss the performances of the Q-DiSA architecture with respect to these parameters.

Frequency range and bandwidth

The Q-DiSA system works as a tunable MW frequency detector in a typical range from 10 MHz to 25 GHz (Fig. 3a). The central frequency is determined by the distance between the magnet and the diamond sample, while the bandwidth is determined by the magnetic field gradient. Owing to the choice of the single-magnet configuration, these two parameters are linked (Fig. 3a). The lower part of the spectrum is covered when the energy difference between the \(\left|0\right\rangle\) and \(\left|-1\right\rangle\) states becomes very small. In that case, even very weak couplings (such as parasitic strain in the diamond sample, hyperfine coupling, residual transverse magnetic field) can induce a mixing between the \(\left|0\right\rangle\) and \(\left|-1\right\rangle\) states. This mixing induces a ground state level anti-crossing (GSLAC) that blurs the correspondence between the frequency of the signal applied to the NV centers and the magnetic field gradient. This parasitic effect17,18 sets the lower detectable frequency of our sensor to typically 10 MHz (Fig. 3b). The upper limit that we have reached is 27 GHz and it is related to the maximum amplitude of the static magnetic field produced by the permanent magnet (Fig. 3c) (Supplementary Note 1). These parameters could be extended using a stronger magnetic field that can be produced by cryogenic magnets19,20.

Fig. 3: Frequency range and bandwidth.
figure 3

a Ground state NV center resonance frequencies (blue and red) and magnetic field gradient (dashed bluish green) at different distances from the surface of the spherical magnet. The magnetic field gradient is expressed in MHz per pixel. b ODMR of the \(\left|0\right\rangle \to \left|-1\right\rangle\) transition at the GSLAC. The field of view corresponds to a bandwidth between 10 MHz and 250 MHz. c ODMR of the \(\left|0\right\rangle \to \left|+1\right\rangle\) transition around 22 GHz. The field of view corresponds to a bandwidth between 20 and 24 GHz.

The relation between the bandwidth and the strength of the magnetic field (Fig. 3a) is given on Fig. 3b, c showing, for the same diamond area, a total bandwidth of 300 MHz at a central frequency of 150 MHz (Fig. 3b) and a bandwidth of 4 GHz at a central frequency of 22 GHz (Fig. 3c). Those values can be adapted by changing both the imaging system and the magnetic field architecture in order to tune the amplitude and the gradient of the magnetic field as two independent parameters.

Frequency resolution

The frequency resolution of the Q-DiSA architecture is related to the resonance linewidth of the NV centers ensemble, which is affected by three main contributions: the intrinsic linewidth, the power broadening induced by both the optical and the MW excitation, and the inhomogeneous broadening induced by the magnetic field gradient21.

In our optical grade diamond sample, the intrinsic linewidth, which can be attributed to the spin decoherence induced by the interaction between the NV centers and the 13C nuclear spin bath22, is of the order of 500 kHz.

The other two contributions to the line broadening are shown in Fig. 4a, b, where the ODMR spectra for the same frequency set-point but under different magnetic field configurations (10 mT and 20 kHz  μm−1 (Fig. 4a)—195 mT and 1.5 MHz  μm−1 (Fig. 4b)) are reported. In the presence of a low-magnetic field gradient and for low-power MW excitation, the NV center hyperfine structure is well resolved (Fig. 4a). It consists of three lorentzian peaks separated by 2.14 MHz23 resulting from the hyperfine interaction between the NV center electron spin and the nitrogen atom nuclear spin (14N, S = 1). The linewidth of each peak is of the order of 1 MHz, which sets the best frequency resolution we are able to achieve with our system. The deviation from the intrinsic linewidth is attributed to the optical power broadening induced by the laser (Supplementary Note 4). Increasing the MW power, the ODMR line shows the typical power broadening effect and the linewidth is proportional to the amplitude of the MW field, as illustrated in Fig. 4c. At low MW power and higher magnetic field gradient (Fig. 4b), the hyperfine structure cannot be resolved anymore since the magnetic field gradient induces a distribution of the NV centers resonance frequencies at the single pixel scale thus broadening the ODMR line (Supplementary Note 5). Therefore, as clearly shown in Fig. 4c, the frequency resolution of our system, at low power, is limited by the inhomogeneity of the static magnetic field and by some possible imperfections of the set-up that occur during the measurement time, such as mechanical vibrations and temperature fluctuations of the magnet24. Consequently, due to the frequency-dependent magnetic field gradient implemented in this single-magnet architecture, the frequency resolution is higher at low frequencies (FWHM of 1 MHz at 2.6 GHz), close to the ultimate resolution set by the NV center intrinsic linewidth, than at high frequencies (FWHM of 50 MHz at 21 GHz).

Fig. 4: Frequency resolution and dynamic range.
figure 4

Comparison of the ODMR spectrum in the range [2.63 GHz, 2.69 GHz] for two different magnetic fields and magnetic field gradients (10 mT and 20 kHz  μm−1 (a) –195 mT and 1.5 MHz  μm−1 (b)). The curves are fitted using a sum of three identical Lorentzians shifted by 2.14 MHz accounting for the hyperfine structure of the resonance line to retrieve the FWHM (c) and the contrast (d) from the spectra. The error bars are due to the error on the fit parameters considering a confidence interval of 95% (see Methods). e Spectrum of the non-linearity of the MW generator when it is set to a central frequency of 38 MHz and a nominal power of 25 dBm.

Amplitude and dynamic range

As shown in Fig. 4d, the contrast increases with respect to the MW power, exhibiting a linear behavior at low-power regime and a saturation for stronger MW field (Supplementary Note 4). This effect can be used to quantitatively measure the amplitude of the magnetic field associated with the MW signal, as reported in refs. 25,26. The power detection threshold (i.e., the smallest contrast that can be detected) is given by the SNR (Eq. (2)) of the measurement. The differences in the ODMR contrast between the spectra obtained for a magnetic field equal to 10 mT (Fig. 4a) and 195 mT (Fig. 4b) are caused by the gradient of the static magnetic field, for the same reasons previously explained regarding the ODMR linewidth. Note that the MW power level used in Fig. 4 is the nominal power level at the input port of the MW transmission line and not the MW power applied to the NV centers, which depends on the frequency-dependent transmission efficiency of the antenna. Therefore, for the same MW nominal power, the contrast is higher at low frequencies than at high frequencies.

As an example, the Q-DiSA is applied to characterize the non-linearity of a MW generator (SMA100A, R&S). The results are reported in Fig. 4e and compared to the spectrum measured using a commercial electronic spectrum analyzer. Considering the noise floor of the measurement, the Q-DiSA can detect a signal over a MW power range from 23 dBm to –17 dBm, which opens the way to devices with a dynamics up to 40 dB.

Time resolution

A key advantage of the CW wide-field imaging mode is that all the frequencies within the frequency range are simultaneously detectable without any dead time except the one of the camera. This spatial multiplex advantage makes possible the real-time frequency detection of complex microwave signals with 100% POI. To illustrate this point, a time varying multi-frequencies microwave field is generated by combining the signals of two microwave generators and by sweeping their frequencies over the entire bandwidth of the image (Fig. 5a), for two different configurations, around 7 GHz (Fig. 5b) and around 23 GHz (Fig. 5c). The results indicate that simultaneous detection is possible for a number of frequency channels determined by the ratio between the bandwidth and the frequency resolution.

Fig. 5: Simultaneous detection and temporal resolution.
figure 5

a Schematic of the multi-frequencies signals detection. b, c Multi-frequencies signals detection at 7 GHz (b) and 23 GHz (c). df Spectrogram for three different frequency ranges: d 1.8 GHz, time resolution of 2 ms; e 9 GHz, time resolution 20 ms, f 23 GHz, time resolution 600 ms obtained summing the PL of 9 images having a total exposure time of 600 ms, in order not to saturate the camera.

The temporal resolution (Δt) is the smallest time required to detect the variation of the PL signal induced by a resonant MW frequency component. Using Eq. (2), we evaluate the temporal resolution by decreasing the camera exposure time in order to reach values of the SNR close to 1, not considering the refreshing time of the camera (Fig. 5d–f). Using a nominal MW power of 25 dBm, we measured a temporal resolution of 2 ms at 1.8 GHz (C = 6%, SNR = 5), 20 ms at 9 GHz (C = 1%, SNR = 4) and 600 ms at 23 GHz (C = 0.1%, SNR = 1). This strong variation of the temporal resolution with frequency is a consequence of the contrast frequency dependence. It is directly related to the Q-DiSA architecture and not to the intrinsic NV center photodynamics, whose characteristic times are of the order of one microsecond12,27.

Frequency ambiguity

The Q-DiSA allows analyzing the frequency components of a MW signal over a wide range. However, it may present some frequency ambiguities, namely the presence of more than one resonance frequency associated with the same pixel. Two factors can induce such ambiguity: the presence of additional ODMR lines due to the three non-aligned NV families and the two intrinsic resonance frequencies of the NV center (\(\left|0\right\rangle \to \left|-1\right\rangle\) and the \(\left|0\right\rangle \to \left|+1\right\rangle\)). The first source of ambiguity can be eliminated using a diamond with NV centers preferentially orientated along one of the four diamond crystallographic axis28. The second source is intrinsic to the single NV center but can be suppressed by applying a MW filter so that the signal is resonant with only one of the two NV center transitions. This procedure restricts the available bandwidth to a maximum of 5.74 GHz (twice the zero-field splitting), corresponding to the frequency shift between the \(\left|0\right\rangle \to \left|-1\right\rangle\) and the \(\left|0\right\rangle \to \left|+1\right\rangle\) transition for magnetic fields stronger than 0.1 T (Fig. 1c). In this configuration, the frequency range can be chosen according to the best performances of the MW chain (for example corresponding to baseband communication), and any other frequency signal can be brought inside this frequency range using heterodyning techniques. Such mixing will then lead to a spectral analysis over a frequency range larger than 25 GHz, only limited by the heterodyning chain.

Conclusion

We demonstrated that an ensemble of NV centers in a single-crystal diamond can be used to detect a complex microwave signal and resolve its spectral components. The analysis bandwidth can be tuned from a few MHz up to 25 GHz. The Q-DiSA experimental platform is compact and is operated at room temperature with low power consumption. The performances achieved in this work, corresponding to MHz frequency resolution and ms temporal resolution, can be improved working both on the experimental architecture (e.g., using an improved optical collection, a more flexible magnetic field architecture, different pumping parameters, etc.) and on the diamond sample (e.g., using a higher concentration of NV centers29 or a crystal with NV centers having a preferential orientation). Moreover, the large frequency range of the Q-DiSA can be further extended using heterodyne techniques, opening the way to the real-time spectral analysis over a frequency range of several tens of GHz.

Methods

NV center optical excitation

The 532 nm laser is set to 300 mW. The effect of the laser power to the ODMR linewidth are discussed in the Supplementary Note 4. According to ref. 30, the NV center saturation intensity is: ISat = 1–3 GW  m−2. Considering a laser beam waist w0 ≈ 40 μm (as it is in our experimental architecture), the saturation power is achieved at:

$${P}_{{{{{{{{\rm{Sat}}}}}}}}}=\frac{\pi {w}_{0}^{2}}{2}{I}_{{{{{{{{\rm{Sat}}}}}}}}}\approx 2\,W$$
(3)

Therefore, using a laser power of 300 mW, the saturation parameter is given by:

$$s=P/{P}_{{{{{{{{\rm{Sat}}}}}}}}}=0.15$$
(4)

The loop antenna

The loop antenna employed to bring the RF field in proximity of the NV centers is realized short-circuiting a co-axial cable with a 50 μm diameter copper wire which is soldered to the co-axial cable. The microwave field generated inside the loop is almost homogenous and linearly polarized along the loop axis. Placing the antenna at the top of the diamond {110} facet, the RF field is perpendicular to the NV center family aligned to the reference axis, thus maximizing the coupling between the RF field and the electron spin.

The ODMR spectrum acquisition procedure

In this section we detail the acquisition procedure used to acquire the ODMR spectra reported in Figs. 2b, c, 3b, c and 4a, b. It consists in sweeping the frequency of the RF field while the camera captures, for each frequency, the PL emitted by the NV centers. During the acquisition, the laser and the RF field are always on. In order to speed up the acquisition procedure, the area of interest (AOI) of the camera is limited to the pixels that image NV centers pumped by the green laser.

The RF frequency f is scanned in a range \(\left[{f}_{\min },{f}_{\max }\right]\) with a frequency resolution of Δf, which results in a number of frequency steps (Nseq) given by:

$${N}_{{{{{{{{\rm{seq}}}}}}}}}=({f}_{\max }-{f}_{\min })/{{\Delta }}\,{{\mbox{f}}}$$
(5)

In order to increase the SNR of the measurement, the frequency ramp is repeated Ncycle times; for each frequency step, the signal acquired by the camera is added to the signal acquired during the previous ramps at the same frequency. The number of cycles over which the RF frequency is swept is chosen in order to obtain a satisfactory SNR. The exposure time of the camera is chosen in order to maximize the number of counts without saturating the detector.

During the acquisition, the data are stored in a three dimensional matrix of size [w; h; Nseq], where w and h are respectively the width and the height of the AOI. In order to retrieve the contrast of the measurement the data are normalized dividing the data matrix (DRaw) for the mean value of the PL in the first m and the last m steps of the frequency ramp, checking that all the NV centers in the AOI are out of resonance for those frequency values. The normalized data matrix (DNorm) is thus given by:

$${D}_{xyf}^{{{{{{{{\rm{Norm}}}}}}}}}=\frac{2m\cdot {D}_{xyf}^{{{{{{{{\rm{Raw}}}}}}}}}}{\mathop{\sum }\nolimits_{f = {f}_{\min }}^{f = {f}_{\min }+m\Delta {{{{{{{\rm{f}}}}}}}}}{D}_{xyf}^{{{{{{{{\rm{Raw}}}}}}}}}+\mathop{\sum }\nolimits_{f = {f}_{\max }-m\Delta {{{{{{{\rm{f}}}}}}}}}^{f = {f}_{\max }}{D}_{xyf}^{{{{{{{{\rm{Raw}}}}}}}}}}$$
(6)

and it is directly related to the ODMR contrast by the formula:

$${C}_{xyf}=1-{D}_{xyf}^{{{{{{{{\rm{Norm}}}}}}}}}.$$
(7)

The calibration procedure

The calibration procedure consists in defining, for each frequency in the frequency range, a mask of the same size of the camera AOI that is zero everywhere except for the pixels which resonate at the frequency associated with the mask. Once the set of calibration masks has been built, the spectrum is defined as:

$$S(\nu )=\mathop{\sum}\limits_{x,y}I(x,y)\times M(x,y,\nu )$$
(8)

where I(x, y) is the image acquired by the camera when the signal we want to analyze is sent in proximity of the NV center and M(x, y, ν) is the mask associated with the frequency ν. The image processing was realized off-line using a MATLAB script, which calculates, for each frequency, the spectral component of the RF signal using the previous formula. This processing, which is equivalent to a simple sum, can be easily implemented in real-time using a field-programmable gate array (FPGA) programmed to this task.

Fitting procedure

The fit curve used to fit Fig. 4a, b and thus to retrieve the ODMR contrast and ODMR linewidth plotted in Fig. 4c, d is given by:

$$f(\nu )=1-\left(\frac{{a}^{2}}{{a}^{2}+{(\nu -b)}^{2}}+\frac{{a}^{2}}{{a}^{2}+{(\nu -b+{\nu }_{{{{{\rm{hyp}}}}}})}^{2}}+\frac{{a}^{2}}{{a}^{2}+{(\nu -b-{\nu }_{{{{{{{{\rm{hyp}}}}}}}}})}^{2}}\right)c$$
(9)

where ν is the frequency expressed in MHz, νhyp = 2.14 MHz is the frequency difference between the successive peaks of the NV center hyperfine structure, a is a parameter related to the ODMR linewidth, b is a parameter related to the NV center resonance frequency, and c is a parameter related to the contrast. The FWHM is then defined as FWHM = 2a and the contrast as CODMR = c. The error bars of Fig. 4c, d are due to the error on the fit parameters considering a confidence interval of 95%.