To the Editor — The importance of providing open and transparent solutions to use, share, inspect and reproduce mass spectrometry (MS) data analysis is becoming more apparent. To address these challenges, we developed a collaborative interactive web application, the GNPS Dashboard (https://gnps-lcms.ucsd.edu), to enable remote and synchronous collaborative research in a common analysis environment for MS data analysis (similar to collaborative text editing in browser-based word processors, such as Google Docs).

The disruptions to office and laboratory workspaces resulting from the COVID-19 pandemic, including campus closures, telework and stay-at-home orders, increased the need for online collaborative approaches to perform scientific research. Most solutions for the analysis of MS data are geared for exploration on a local workstation that lacks synchronous collaborative data exploration, requiring specific software packages, knowledge of data storage locations, familiarity with file transfer protocols and often conversion of data into compatible formats1,2,3,4. Commercial data exploration solutions usually lack universality and are costly, creating additional hurdles to data accessibility, preventing collaborators, reviewers and readers of scientific publications from inspecting MS data to verify interpretations. However, commercial software offers deep access to proprietary vendor-specific data formats and unique data analysis capabilities, for example, creation of private databases. More broadly, as scientific data are becoming more complex, scientific reproducibility is a challenge. Therefore, there is a growing mandate by both funding bodies and journals to make mass spectrometry data (and other data) adherent to FAIR (findable, accessible, interoperable and reusable).

To empower remote collaborations or classroom teaching, the GNPS Dashboard includes leader–follower synchronization (real-time updates from one lead user) and fully collaborative synchronization (real-time updates from multiple users). Followers can disconnect the synchronization to continue the analysis from where the leader left off without needing to reload the data. The GNPS Dashboard leader–follower paradigm has already been used in classroom teaching by at least five institutions, including undergraduate institutions (Supplementary Note 1 and 2). The fully collaborative synchronization enables multi-user visualization and data exploration similar to online synchronous collaborative document editing. For example, users can initiate a collaborative session with two or more people on any web-accessible device (providing a dashboard collaboration start link, instructions, video tutorial and video application). A snapshot of the collaborative work is automatically created and can be shared with collaborators or included in publications. The final state of the analysis is saved together with every discrete action, enabling inspection of the evolution of data analysis.

The GNPS Dashboard includes tools that facilitate the exploration of liquid and gas chromatography–mass spectrometry data for collaborative examination, and hands-on teaching of MS concepts using private and publicly available MS data, including files stored in the MS data repositories GNPS/MassIVE5, MetaboLights6, ProteomeXchange7 and Metabolomics Workbench8 (Fig. 1; providing a link to instructions and video tutorial). All public data from compatible repositories can be selected using the GNPS dataset explorer (https://gnps-explorer.ucsd.edu/). Files not deposited in public MS repositories can be uploaded through a drag-and-drop option for file transfer. mzXML, mzML, CDF and raw formats are compatible with GNPS Dashboard. Through deep linking from the GNPS platform, the GNPS Dashboard serves as a data explorer and hub for further data analysis, including Molecular Networking5, gas chromatography–mass spectrometry deconvolution9, in silico annotation and Mass Spectrometry Search Tool (MASST10) (Supplementary Note 2).

Fig. 1: Overview of the GNPS Dashboard in the data analysis ecosystem.
figure 1

The GNPS Dashboard has been integrated into the web-accessible MS ecosystem. The top and left panels show how the GNPS Dashboard can ingress datasets from public metabolomics resources: MetaboLights, Metabolomics Workbench and GNPS/MassIVE; and from proteomics resources: PRIDE and MassIVE, private user uploads and private GNPS analysis tasks. The bottom panel shows that users can launch downstream analysis on their data directly out of the GNPS Dashboard. Finally, the right panel shows how the GNPS Dashboard and its visualizations can interact with the wider community with reproducible URL or QR code link outs from any analysis and teaching/synchronous collaboration modes for real-time interactivity.

The GNPS Dashboard’s visualization tool enables inspection of total ion chromatograms, retention time versus m/z heat map, extracted ion chromatograms, tandem mass spectra for inspection/visualization of individual compounds and quantitative comparison of the peak abundances of two groups as boxplots — all with publication-quality figures. The dashboard aids peer review of scientific manuscripts and inspection of public quantitative MS data to validate published results. Beyond visualization and analysis of MS data, the dashboard supports the development of other bioinformatics tools that do not have their own web-enabled user interfaces. Documentation and tutorials are available in Supplementary Note 1 and Supplementary Fig. 1.

The GNPS Dashboard encodes the visualization in a shareable URL, which empowers users to share the exact same visualization with colleagues, thus reducing miscommunication and improving data transparency, for example, during remote meetings. Every visualization and analysis result can be shared via a URL that will re-launch the original data visualization on their device, along with the history of the analysis (up to 1,000 steps per session). Users can share these links with collaborators and embed them in publications, presentations or social media posts (for example, Example Tweet). The final visualization can also be shared as a quick response (QR) code. Anyone with a link or QR code can build on the analysis and re-share their additions. Links and QR codes will remain valid for data that have been archived in a public repository.

Together, the capabilities of the GNPS Dashboard should help improve MS data accessibility and interoperability, lower barriers of entry for MS data analysis in the research environment and the classroom, encourage data transparency and sharing, and strengthen the reproducibility of MS data analysis.