DeepEBV: a deep learning model to predict Epstein-Barr virus (EBV) integration sites

Jiuxing Liang, Zifeng Cui, Canbiao Wu, Yao Yu, Rui Tian, Hongxian Xie, Zhuang Jin, Weiwen Fan, Weiling Xie, Zhaoyue Huang, Wei Xu, Jingjing Zhu, Zeshan You, Xiaofang Guo, Xiaofan Qiu, Jiahao Ye, Bin Lang, Mengyuan Li, Songwei Tan, Zheng Hu

Research output: Contribution to journalArticlepeer-review

6 Citations (Scopus)

Abstract

Motivation: Epstein-Barr virus (EBV) is one of the most prevalent DNA oncogenic viruses. The integration of EBV into the host genome has been reported to play an important role in cancer development. The preference of EBV integration showed strong dependence on the local genomic environment, which enables the prediction of EBV integration sites. Results: An attention-based deep learning model, DeepEBV, was developed to predict EBV integration sites by learning local genomic features automatically. First, DeepEBV was trained and tested using the data from the dsVIS database. The results showed that DeepEBV with EBV integration sequences plus Repeat peaks and 2-fold data augmentation performed the best on the training dataset. Furthermore, the performance of the model was validated in an independent dataset. In addition, the motifs of DNA-binding proteins could influence the selection preference of viral insertional mutagenesis. Furthermore, the results showed that DeepEBV can predict EBV integration hotspot genes accurately. In summary, DeepEBV is a robust, accurate and explainable deep learning model, providing novel insights into EBV integration preferences and mechanisms.

Original languageEnglish
Pages (from-to)3405-3411
Number of pages7
JournalBioinformatics
Volume37
Issue number20
DOIs
Publication statusPublished - 15 Oct 2021

Fingerprint

Dive into the research topics of 'DeepEBV: a deep learning model to predict Epstein-Barr virus (EBV) integration sites'. Together they form a unique fingerprint.

Cite this