Abstract
The Code Interpreter feature in ChatGPT has the potential to democratize data analysis for non-specialists. As bioinformaticians, we are impressed by its performance in data manipulation and visualization. However, bioinformatics tasks often require execution of third-party packages, access to annotation knowledgebase, and handling large datasets. Code Interpreter’s exclusive support for Python, no installation option for additional packages, inability to utilize external resources, and limited storage capacity could pose obstacles to its wide adoption in bioinformatics applications. To address these limitations, we advocated for the necessity of locally deployable, API-based systems for chatbot-aided bioinformatics applications.
Data availability
Prompts and scripts to support the conclusions are in Supplementary Files of the manuscript.
References
Hou, W., and Z. Ji. GeneTuring tests GPT models in genomics. BioRxiv. 2023. https://doi.org/10.1101/2023.03.11.532238.
Duong, D., and B. D. Solomon. Analysis of large-language model versus human performance for genetics questions. Eur. J. Hum. Genet. 2023. https://doi.org/10.1038/s41431-023-01396-8.
Kung, T. H., et al. Performance of ChatGPT on USMLE: potential for AI-assisted medical education using large language models. PLoS Digit. Health.2(2):e0000198, 2023.
Merow, C., et al. AI chatbots can boost scientific coding. Nat. Ecol. Evol. 7:960, 2023.
Perkel, J. M. Six tips for better coding with ChatGPT. Nature. 618(7964):422–423, 2023.
Shue, E., et al. Empowering beginners in bioinformatics with ChatGPT. Quant. Biol. 11(2):105–108, 2023.
Xu, D. ChatGPT opens a new door for bioinformatics. Quant. Biol. 11(2):204–206, 2023.
Dziadowicz, S., et al. Bone marrow stroma-induced transcriptome and regulome signatures of multiple myeloma. Cancers. 14(4):927, 2022.
Bernier, A., H. Liu, and B. M. Knoppers. Computational tools for genomic data de-identification: facilitating data protection law compliance. Nat. Commun. 12(1):6949, 2021.
Ge, S.X. RTutor, Chat with your data via AI. [cited 2023 07/11/2023]. https://RTutor.ai, 2023.
Acknowledgements
NIH-NIGMS grants P20 GM103434, U54 GM-104942, and 1P20 GM121322 (GH). NIH-NIGMS Grant R01HG010805 and P20GM135008 to XG. NIH-NLM Grant No. R01LM013438 to LL. The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health. The writing was polished by ChatGPT.
Author information
Authors and Affiliations
Contributions
GH contributed to conceptualization, formal analysis, and writing—original draft; LW contributed to formal analysis; and XG and LL contributed to formal analysis, writing—review and editing.
Corresponding author
Ethics declarations
Competing interests
The authors declared no competing interests.
Additional information
Associate Editor Stefan M. Duma oversaw the review of this article.
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Below is the link to the electronic supplementary material.
10439_2023_3324_MOESM1_ESM.pdf
Supplementary file1 (PDF 22130 kb). Supplementary File 1: Screen shot of chat session for requesting several gene expression data analysis. Supplementary File 2: Screen shot of chat session for requesting multiple gene sequence alignment. Supplementary File 3: Screen shot of chat session for requesting gene ID conversion. Supplementary File 4: Screen shot of chat session for requesting alignment of short sequencing reads. Supplementary File 5: Screen shot of chat session for requesting DE gene analysis based on a count matrix. Supplementary File 6: Screen shot of chat session for requesting phylogeny inference. Supplementary File 7: Screen shot of chat session for requesting a list of pre-installed Python packages
Rights and permissions
About this article
Cite this article
Wang, L., Ge, X., Liu, L. et al. Code Interpreter for Bioinformatics: Are We There Yet?. Ann Biomed Eng 52, 754–756 (2024). https://doi.org/10.1007/s10439-023-03324-9
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10439-023-03324-9