1. Introduction
Drug-drug interactions (DDIs), a prominent aspect of adverse drug reactions, occur when the combined effects of multiple medications diverge from the expected results if these drugs were used separately [1]. These interactions are broadly categorized into pharmacokinetic and pharmacodynamic types. Pharmacokinetic interactions involve changes in a drug's absorption, distribution, metabolism, and excretion (ADME), thus altering its bioavailability. For instance, fluoxetine's inhibition of the enzyme CYP2D6 increases the effectiveness of metoprolol, a beta-blocker, by slowing its metabolism, while simultaneously decreasing the efficacy of codeine by preventing its conversion to morphine. On the other hand, pharmacodynamic interactions occur when drugs affect each other's actions through their mechanisms of action. An example is the way glucagon can counter the effects of beta-blockers via activation of Gs-coupled GPCR, or how ibuprofen can inhibit aspirin’s effect on platelet aggregation by blocking its access to the active site of COX-1 [2-4].
The relevance of DDIs in healthcare is emphasized by their high incidence rate and the wide range of their impacts, from beneficial synergistic effects to reduced drug effectiveness, and even potentially fatal side effects [2]. This challenge is particularly acute in hospitals, where patients often receive complex medication regimes involving multiple drugs. The likelihood of encountering DDIs in such settings is significantly higher due to the variety of medications prescribed and frequent changes in dosages and drug types. Hospitalized patients, typically more critically ill than those treated as outpatients, have a diminished capacity to cope with these pharmacological challenges. A study in a major university hospital in the Netherlands reported that 27.8% of 21,277 admissions were affected by at least one DDI [5]. While hospitals have implemented electronic DDI alert systems, these can lead to 'alert fatigue', where the high volume of alerts, many of which may be of low clinical relevance, increases the risk of overlooking critical alerts. Additionally, these systems are often limited to known DDIs from existing literature, offering limited guidance on newer drugs or untested drug combinations [5]. This highlights the need for continuous research and development of more advanced and comprehensive DDI detection and management tools in healthcare settings.
Graph Neural Network (GNN) is a form of machine learning algorithm widely used in biological and chemical problems due to the ability of graphs to represent information whose structure and connections cannot be described by simple ordering, such as the connectivity and spatial relationship of components within small molecules and macromolecules. According to Gligorijevic´ et al. (2021), they used multiple machine learning techniques for protein function prediction, such as CNN-based DeepGO, BLAST baseline, LSTM language model, and GCNs (Graph Convolutional Networks) in the form of DeepFRI, which are used for annotation transfer and to extract features from proteins, taking into account their graph-based structure of interconnected residues represented by contact maps [6].
The systematic and relatively predictable nature of drug interactions has led to the use of many machine learning algorithms, including GNNs, to tackle the problem of DDI prediction [2]. As all drug interactions lie in the binding and reaction of drug molecules with enzymes and receptors as well as that between enzymes and receptors, knowledge graphs incorporating these as nodes and their interactions as edges would be extremely valuable for DDI prediction. The use of graph neural networks to learn the topological neighborhood representation of drugs in the knowledge graph for DDI prediction was first pioneered by Lin et al (2020), who have achieved the extremely high F1 of 95.7% on binary classification of whether a given DDI exists [7]. Chen et al. combined 2D drug molecular graphs and large-scale knowledge graphs in their MUF- FIN model, and was able to achieve performance surpassing all previous models on both binary- class, multi-class and multi-label DDI prediction [8]. The use of language models to extract and incorporate textual information into DDI prediction by GNN is also a fruitful approach, as shown by the 3DGT-DDI architecture by He et al [9]. Their model represents the 3D structure of each drug as a 3D graph and uses SchNet to extract their 3D characteristics and combines this information with a SCIBERT-based text feature extraction model, giving an 84.48% macro F1 score in the DDI Extraction 2013 shared task dataset.
In this study, we introduce an innovative model that integrates graph attention neural networks with advanced pre-trained language models. This model is designed to predict drug-drug interactions (DDIs) effectively, utilizing the chemical structures of drugs, protein amino acid sequences, and a molecular biology knowledge graph. Our knowledge graph is a heterogeneous graph comprising two node types - drugs and proteins - and three edge types, including drug-drug, drug-protein, and protein-protein interactions, plus an edge representing similarity between neighboring nodes. For input, our model takes the chemical structure of drugs in SMILES format. It then employs edge prediction to determine the potential DDI edges originating from a given drug node, along with the likelihood of each edge's existence. We leverage the pre-trained language model CHEMBERTa to extract features from the SMILES representations, forming the drug node embeddings. For protein node embeddings, we utilize ESM-1b, which extracts features from protein amino acid sequences [10]. The similarity measure between feature vectors of two distinct drugs serves as a basis for the edge weight, indicating chemical similarity. Edges with weights below a predetermined threshold are not included in our graph.
Our model stands out due to its lightweight and dynamic nature. It allows for continual enhancement of DDI prediction accuracy as more drug-protein interaction information is fed into the system. This adaptability enables the model to be quickly retrained with new data. Such features make our model a valuable tool for augmenting current literature-based DDI alert systems in hospitals, especially for new and less-researched drugs and combinations. Its capability to evolve with emerging data ensures that our alert predictions remain accurate and up-to-date.
2. Methods and Materials
2.1. Datasets
2.1.1. DrugBank
In 2018, the DrugBank Multi-Typed DDI dataset was curated for the first time as the Gold Standard DDI dataset for training the DeepDDI model [11]. The dataset consists of 86 types associated with at least five drug pairs. It contains 192,284 DDIs from 191,878 drug pairs. It also clearly labels each drug with the DrugBank ID SMILES string expressions corresponding to it [12].
2.1.2. PrimeKG
PrimeKG is a comprehensive medical heterogeneous knowledge graph that connects over 100,000 nodes across ten types, such as drugs and proteins [13]. Each node has been labeled with clinical descriptors from trustworthy medical data sources, and it is connected by 29 types of edges containing over 4 million relationships.
The data extraction process involves identifying them with DrugBank IDs based on the protein sequence. Then, the ID of each drug should be mapped to the corresponding DrugBank ID in order to identify the same drugs in both datasets. By employing this method, it is possible to improve the accuracy and applicability of drug-drug interaction forecasts.
Figure 1. Meta path for Drug-Drug Interaction.
Drug-Protein Interaction and Protein-Protein Interactions, which illustrates that there may be docking happening between drug-protein interactions.
As illustrated in Figure 1, the utilization of drug-protein interactions and protein-protein interactions can improve the efficacy and accuracy of drug-drug interaction prediction. The establishment of a chain association has the potential to affect the molecular interactions.
2.2. Data Analysis
(a) 2D relations between three types of interactions. (b) 3D relations between three types of interactions.
Figure 2. Meta path in dataset
We used the DrugBank Multi-Typed DDI (TDcommons) dataset to predict potential drug-drug interactions. In addition, the Drug-Protein and Protein-Protein interactions data extracted from the PrimeKG database are used as a supplement for improved prediction. As portrayed in Figure 2a and Figure 2b, a brief illustration of DDIs, DPIs, and PPIs is provided. Brompheniramine, megestrol acetate, and methimazole were chosen to demonstrate drug and protein interactions. In the meta path, both Brompheniramine and megestrol acetate can interact with a specific protein. This implies that one drug has the potential to influence the other drug by altering the protein’s quantity.
The dataset contains a comprehensive list of 1,706 unique drug types and 1,544 specific protein types. Based on the calculations performed, the analysis reveals 191,808 distinct types of drug-drug interactions, 12,801 distinct types of drug-protein interactions, and 6,793 distinct types of protein-protein interactions.
Figure 3. The architecture of the heterogeneous graph for two nodes and four edges with Drug-Drug interactions, Drug- Protein Interactions and Protein-Protein interactions.
The heterogeneous graph is constructed as the general structure depicted in Figure 3, which is a potent knowledge representation that integrates three distinct datasets: DDIs, DTIs, and PPIs. Traditional methods for predicting DDI types rely heavily on analyzing of individual drug pairs. However, the heterogeneous graph permits us to model complex interactions and dependencies between drugs, targets, and proteins within a unified framework. GNNs can effectively capture these interconnected relationships, which leads to more accurate data representations.
Table 1. Six types of interaction types which are grouped based on their features
Numbers | Type |
1 | Absorption |
2 | Distribution |
3 | Metabolism |
4 | Excretion |
5 | Toxicity |
6 | Effects (not necessarily bad) |
Based on the analysis of a dataset containing 86 types of Drug-Drug Interactions, it has been determined that the interactions can be effectively categorized into six distinct groups based on their detailed information for their different impact on various human body systems. The types of interactions are shown in Table 1: absorption, distribution, metabolism, excretion, toxicity, and effects. Among the interaction types 46, 47, and 73, the type 6 interactions, which relate primarily to the effects of the drugs, occur most frequently. This emphasizes the importance of understanding the potential effects of drug combinations on patients, as these interactions can substantially impact their health outcomes. In addition, Figure 4 visually represents the distribution of Drug-Drug Interactions across these six groups. It reveals that type 6 interactions are the most prevalent among the various groups, highlighting the need to comprehend the impact of drug combinations on patients.
Figure 4. Distribution of Interaction Types of Drug-Drug Interactions.
Figure 4 shows the Type 49 interactions appear most frequently then others and the whole data distribution is certainly sparse.
2.3. Model Architecture
2.3.1. ChemBERTa
ChemBERTa, a BERT (Bidirectional Encoder Representations from Transformers) language model, has been pre-trained on 77M data to analyze and manipulate chemical and molecular data to forecast chemical properties and generate molecules [14]. It encodes SMILES strings to hold semantic information about drugs, which can provide more informative content. ChemBERTa is useful in drug discovery and other chemistry tasks because it can understand chemical language and context.
2.3.2. ESM-1b
ESM-1b, a general-purpose protein language model, can predict protein structure, function, and other properties from sequences [15]. It is capable of translating protein sequences into language, which provides a contextualized protein sequence representation that captures important features and patterns. Our model uses ESM-1b to create a vector that captures the protein’s unique properties. The format can be used for protein-protein interaction analysis and assist in predicting drug-drug interactions.
2.3.3. RDKits
RDKits is free molecular modeling and cheminformatics research software, which provides many molecular structure manipulation and analysis tools [12]. It can help us understand complex molecule interactions, which is useful in predicting drug-drug interactions, where molecules with information to targets are needed for Morgan Fingerprint which will be used to calculate the similarity of molecules in our approach.
2.3.4. Graph Neural Network (GNN)
Graph Neural Networks (GNNs) are a category of deep learning architectures designed specifically for analysing graph-structured data, such as molecular graphs [16]. GNNs facilitate the acquisition of non-local connections and higher-level structural information within graph structures, which allows a more comprehensive understanding of medication interactions.
The Graph Convolutional Network (GCN) and Graph Attention Network (GAT) models are applied in this paper. The GCN is a graph neural network that collects and integrates local neighborhood data from a node in a graph structure, which can graph medicine relationships for DDI prediction. Drugs are nodes, and their interactions are edges in this graph. It uses a predetermined graph convolution operation to iteratively improve node representations by integrating adjacent nodes [17]. The GAT uses attention mechanisms to capture adjacent significance during information aggregation, which facilitates nodes to assign different attention weights to their neighbors, enabling them to focus on more relevant and informative neighbors. The attention mechanism in GAT makes it possible for the model to capture the diverse significance of neighboring nodes [10].
2.3.5. Link Prediction
Link prediction is a machine learning and graph analysis specialization that forecasts the probability of a connection between two nodes within a given graph. In the context of DDI prediction, the link prediction technique can be used to identify potential drug-drug interactions [18]. It assists in developing new medicines by analyzing the complex relationships between numerous molecular attributes.
2.4. Overview of HGNN-DDI Model
Figure 5. Model Architecture of the HGNN-DDI for Drug-Drug Interactions Prediction
When building our model, we relied on the Deep Graph Library (DGL), a powerful framework tailored for graph neural networks, which enabled us to effectively tackle the complexity of DDIs [19]. The architecture of our model is shown in Figure 5, and the implementation of our model proceeds as follows:
1. Input Processing: The SMILES representations of drugs are put into the ChemBERTa model for preprocessing. The chemical structure of proteins is input into the ESM-1b model for feature extraction and processing.
2. Graph Construction: The preprocessed data is represented as a graph, with drugs and proteins as nodes and their interactions and similarity as edges. This graph-based representation generates a network that elegantly summarises the interactions between different drugs and proteins.
3. Graph Convolutional Network (GCN): A three-layer GCN is applied to incorporate this knowledge effectively. As the model goes through multiple graph convolution layers, it gradually refines and consolidates its understanding of DDI interactions, resulting in representations that are abundant in features.
4. Graph Attention Network (GAT): It allows each node to assign different attention weights to each neighbor, enabling the node to focus on more significant and instructive neighbors.
5. Link Prediction and Multi-Layer Perception (MLP): The GNN-derived enriched feature vectors are seamlessly incorporated into a Multi-Layer Perception (MLP), which is well-suited for transforming the features into a classification scheme. It can capture complex nonlinear relationships, making it suitable for accurately predicting interactions between various drugs.
3. Experiments and Results
Through the methods mentioned above, we conducted training and testing of the model on the meticulously curated dataset with DDI, DPI, and PPI interactions. Moreover, to improve the ability of predictions for unknown drugs, we supplied the model with similarity information calculated by SMILES. We established that edges representing the two nodes are similar for molecular pairs exhibiting a similarity of over 0.7.
\( {accuracy_{i}}=\frac{T{P_{i}}+T{N_{i}}}{T{P_{i}}+T{N_{i}}+F{P_{i}}+F{N_{i}}} \) (1)
\( {precision_{i}}=\frac{T{P_{i}}}{T{P_{i}}+F{P_{i}}} \) (2)
\( {recall_{i}}=\frac{T{P_{i}}}{T{P_{i}}+F{N_{i}}} \) (3)
\( {F1_{i}}=\frac{2 × precisioni × {recall_{i}}}{{precision_{i}}+{recall_{i}}} \) (4)
\( {W_{i}}=\frac{{N_{i}}}{\sum {N_{i}}} \) (5)
\( {accuracy_{weighted}}=\sum {W_{i}}∙{accuracy_{i}} \) (6)
\( {precision_{weighted}}=\sum {W_{i}}∙{precision_{i}} \) (7)
\( {recall_{weighted}}=\sum {W_{i}}∙{recall_{i}} \) (8)
\( {F1_{weighted}}=\sum {W_{i}}∙{F1_{i}} \) (9)
Figure 6. The f1 score, loss, precision and recall of HGNN-DDI model
During the testing phase, negative samples constituted 10% of the entire test set, obtained by randomly sampling negative edges from the complete test graph. Commonly used evaluation metrics such as accuracy, precision, recall, and f1 score are used to assess the performance of the model, which equations are shown above. Throughout the model training process, the parameter curves exhibited the trends shown in Figure 6.
As mentioned above, we have tried both HGAT-DDI and HGCN-DDI models, and we apply the machine learning evaluation metric of F1, Recall, Precision, and Accuracy to measure the performance of our model. By searching the relevant models that applied different models, we compare our model with them, for we are all based on the same dataset. The HGAT-DDI model is with the attention mechanisms, which means it can focus on more significant models. It is suggested to perform better that the HGCN-DDI model.
Table 2. The performance of different algorithms on the Drugbank dataset
Model | Method | F1 | Recall | Precision | Accuracy | |
GNN on DDI model | GAT | 84.72 | 85.07 | 84.36 | 84.65 | |
GCN | 85.02 | 86.29 | 83.78 | 84.79 | ||
GraphSAGE | 84.18 | 85.5 | 82.9 | 83.93 | ||
GE on DDI graph | Node2Vec | 79.63 | 78.35 | 80.95 | 88.73 | |
DeepWalk | 78.8 | 78.27 | 79.44 | 88.64 | ||
GNN on Heterogeneous graph | GAT | 85.68 | 86.28 | 85.99 | 86.06 | |
GCN | 88.28 | 89.01 | 85.49 | 89.68 | ||
GraphSAGE | 87.79 | 88.31 | 86.94 | 85.59 | ||
Our model based on Heterogeneous graph | HGCN | 6 classes | 96.91 | 94.23 | 97.01 | 96.86 |
86 classes | 90.16 | 88.45 | 90.88 | 90.47 | ||
HGAT | 86 classes | 90.35 | 90.12 | 89.68 | 90.21 |
We present two matrices: one displays the outcomes for a dataset categorized into eighty-six distinct groups, and the other shows the results for the dataset segmented into six different categories. The six categories are defined as follows: Class1 pertains to absorption processes, Class2 to distribution, Class3 to metabolism, Class4 to excretion, Class5 to toxicity, and Class6 to neutral effects, which don't necessarily indicate adverse outcomes. On the other hand, the eighty-six categories are based on varying degrees of severity in drug-drug interactions.
Our experimental findings indicate that the classification accuracy was markedly higher for the dataset when it was divided into the broader six classes, as opposed to the more detailed eighty-six class structure. In the context of the six-class system, the distinctions between each group are more pronounced, thus facilitating the model's ability to differentiate between the classes. However, in the 86-class scenario, the nuances between some of the categories are less distinct, posing a challenge for the model in achieving precise classification.
Contrary to our expectations, the HGAT model showed minimal improvement compared to the HGCN model. In the experiment, the Heterogeneous Graph Convolutional Neural Network model achieved a maximum F1 score of 0.969, with precision, recall, and accuracy all exceeding 90%. In the dataset with 10% negative samples, the HGAT model’s performance was nearly identical to HGCN, with all metrics hovering around 90%. As shown in Table 2 above, it compares performance metrics between our algorithm and other algorithms on the same dataset. It is obvious that our model is performing better than our models with the combination of Protein-Protein interactions and Drug-Protein Interactions as well as the Heterogeneous graph to link these complex relationships together and demonstrate it clearly.
Based on this result, the current bottleneck affecting the algorithm’s performance is believed to lie within the dataset. Additionally, using the simple MLP classifier without any tricks for classification using embedding vectors might also influence the algorithm’s performance, potentially masking the distinctive characteristics of the GNN (Graph Neural Network).
4. Conclusion and Discussion
In conclusion, we proposed a novel deep learning framework that combines large-scale pre-trained language models with graph neural networks to integrate information inherent to the structure of drugs and proteins and the topological information of their position and interaction in the knowledge graph for multi-class DDI predictions. We have demonstrated that the efficacy of our model is comparable to that of state-of-the-art models that combine both knowledge graphs and drug molecular graphs despite being somewhat simpler in architecture and requiring less training time. Moreover, the high performance of our simple model is evident from the negligible F1 improvement of the HGAT model over it. Our confusion matrix also shows that we have successfully overcome the problem of promiscuously interacting drugs, reducing model accuracy that plagues relatively well-performing traditional machine learning algorithms such as Decision Trees.
5. Future Goals
Currently, in order to improve the accuracy of drug-drug interaction prediction, heterogeneous graphs are being exploited by GNN to improve drug-drug interaction prediction. In addition, we have incorporated the examination of Protein-Protein and Protein-Drug interactions into our methodology to help the prediction of drug-drug interactions. After performing a preliminary analysis utilizing SMILES representations for drug-drug interactions (DDI) and sequence for protein-protein interactions (PPI), we acquire the characteristics of drugs and proteins. By including DPIs and PPIs, we achieve our goal of overcoming the limitations of the original dataset. Future research objectives include integrating data containing molecular 3D conformations. The addition of additional information has the potential to increase the comprehensiveness of molecular structure and interaction types, thereby enhancing the accuracy and interpretability of our models. The use of SMILES strings data only may contain limited information, which should be improved in the future, and additional research should be conducted. To increase the exhaustiveness of the experiment, future steps, including validation, hyperparameter tuning, and further evaluation, will be taken.
Author Contribution
Hongbo Liu: Set up training server, Implement training and network structure, Feature Extraction, Problem solving in coding and working environment.
Siyi Li: New model ideas, Literature Review, Visualize the dataset and model architecture, Model Construction of GAT, RDKit similarity Calculation.
Zheng Yu: Dataset Organization and collection, Developing toolkits of operating datasets, Molecular similarity Calculation, SNAP dataset Investigation.
Acknowledge
Prof. Kellis, Ash, Zhu Zhang (who are serving as TAs for this project) have offered their aid in advising our project and answering our questions. Their suggestions are useful for us to achieve our goals and adjust the wrong steps while facing difficulties.
Hongbo Liu, Siyi Li, and Zheng Yu contributed equally to this work and should be considered co-first authors.
References
[1]. W. Hu, W. Zhang, Y. Zhou, Y. Luo, X. Sun, H. Xu, S. Shi, T. Li, Y. Xu, Q. Yang, Y. Qiu, F. Zhu, H. Dai, Mecddi: Clarified drug–drug interaction mechanism facilitating rational drug use and potential drug–drug interaction pre- diction, Journal of Chemical Information and Modeling 63 (5) (2023) 1626–1636. doi:10.1021/acs.jcim. 2c01656.
[2]. I. Cascorbi, Drug interactions—principles, examples and clinical consequences, Deutsches A¨ rzteblatt International109 (2012) 546–556. doi:10.3238/arztebl.2012.0546.
[3]. L. A. Kondili, G. B. Gaeta, D. Ieluzzi, A. L. Zignego, M. Monti, A. Gori, A. Soria, G. Raimondo, R. Filomia, A. Di Leo, et al, Real-life data on potential drug-drug interactions in patients with chronic hepatitis c viral infection undergoing antiviral therapy with interferon-free daas in the piter cohort study, PloS One 12 (2) (2017). doi: 10.1371/journal.pone.0172159.
[4]. J. Niu, R. M. Straubinger, D. E. Mager, Pharmacodynamic drug–drug interactions, Clinical Pharmacology and Therapeutics 105 (6) (2019) 1395–1406. doi:10.1002/cpt.1434.
[5]. J. E. Zwart-van Rijkom, E. V. Uijtendaal, M. J. Ten Berg, W. W. Van Solinge, A. C. Egberts, Frequency and nature of drug–drug interactions in a dutch university hospital, British journal of clinical pharmacology 68 (2) (2009) 187–193.
[6]. V. Gligorijevic´, P. D. Renfrew, T. Kosciolek, J. K. Leman, D. Berenberg, T. Vatanen, C. Chandler, B. C. Tay- lor, I. M. Fisk, H. Vlamakis, R. J. Xavier, R. Knight, K. Cho, R. Bonneau, Structure-based protein function prediction using graph convolutional networks, Nature communications 12 (1) (2021) 3168. doi:10.1038/ s41467-021-23303-9.
[7]. X. Lin, Z. Quan, Z.-J. Wang, T. Ma, X. Zeng, Kgnn: Knowledge graph neural network for drug-drug interaction prediction, Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence (2020). doi: 10.24963/ijcai.2020/380.
[8]. Y. Chen, T. Ma, X. Yang, J. Wang, B. Song, X. Zeng, Muffin: multi-scale feature fusion for drug–drug interaction prediction, Bioinformatics 37 (17) (2021) 2651–2658. doi:10.1093/bioinformatics/btab169.
[9]. H. He, G. Chen, C. Yu-Chian Chen, 3dgt-ddi: 3d graph and text based neural network for drug-drug interaction prediction, Briefings in bioinformatics 23 (3) (2022). doi:10.1093/bib/bbac134.
[10]. J. Y. Ryu, H. U. Kim, S. Y. Lee, Deep learning improves prediction of drug–drug and drug–food interactions, Proceedings of the National Academy of Sciences 115 (18) (2018). doi:10.1073/pnas.1803294115.
[11]. M. Herrero-Zazo, I. Segura-Bedmar, P. Mart´ınez, R. Declerck, The ddi corpus: An annotated corpus with phar- macological substances and drug–drug interactions, Journal of Biomedical Informatics 46 (5) (2013) 914–920. doi:10.1016/j.jbi.2013.07.011.
[12]. M. Lovric´, J. M. Molero, R. Kern, Pyspark and rdkit: Moving towards big data in cheminformatics, Molecular Informatics 38 (6) (2019). doi:10.1002/minf.201800082.
[13]. P. Chandak, PrimeKG (2022). doi:10.7910/DVN/IXA7BM. https://doi.org/10.7910/DVN/IXA7BM
[14]. S. Chithrananda, G. Grand, B. Ramsundar, Chemberta: Large-scale self-supervised pretraining for molecular prop- erty prediction (2020). arXiv:2010.09885.
[15]. Z. Lin, H. Akin, R. Rao, B. Hie, Z. Zhu, et al, Evolutionary-scale prediction of atomic level protein structure with a language model, bioRxiv (2022). arXiv:https://www.biorxiv.org/content/early/2022/10/31/2022. 07.20.500902.full.pdf, doi:10.1101/2022.07.20.500902. https://www.biorxiv.org/content/early/2022/10/31/2022.07.20.500902
[16]. Y. Feng, S. Zhang, Prediction of drug-drug interaction using an attention-based graph neural network on drug molecular graphs, Molecules (Basel, Switzerland) 27 (9) (2022) 3004. doi:10.3390/molecules27093004.
[17]. K. Shao, Y. Zhang, Y. Wen, Z. Zhang, S. He, X. Bo, Dti-heta: Prediction of drug–target interactions based on gcn and gat on heterogeneous graph, Briefings in Bioinformatics 23 (3) (2022). doi:10.1093/bib/bbac109.
[18]. A. Rossi, D. Barbosa, D. Firmani, A. Matinata, P. Merialdo, Knowledge graph embedding for link prediction: A comparative analysis, ACM Transactions on Knowledge Discovery from Data 15 (2) (2021) 1–49. doi:10.1145/ 3424672.
[19]. B. Tanoori, M. Z. Jahromi, E. G. Mansoori, Drug-target continuous binding affinity prediction using multiple sources of information, Expert Systems with Applications 186 (2021) 115810. doi:10.1016/j.eswa.2021. 115810.
Cite this article
Liu,H.;Li,S.;Yu,Z. (2024). Predicting drug-drug interactions using heterogeneous graph neural networks: HGNN-DDI. Applied and Computational Engineering,79,77-89.
Data availability
The datasets used and/or analyzed during the current study will be available from the authors upon reasonable request.
Disclaimer/Publisher's Note
The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of EWA Publishing and/or the editor(s). EWA Publishing and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.
About volume
Volume title: Proceedings of the 4th International Conference on Signal Processing and Machine Learning
© 2024 by the author(s). Licensee EWA Publishing, Oxford, UK. This article is an open access article distributed under the terms and
conditions of the Creative Commons Attribution (CC BY) license. Authors who
publish this series agree to the following terms:
1. Authors retain copyright and grant the series right of first publication with the work simultaneously licensed under a Creative Commons
Attribution License that allows others to share the work with an acknowledgment of the work's authorship and initial publication in this
series.
2. Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the series's published
version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgment of its initial
publication in this series.
3. Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and
during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work (See
Open access policy for details).
References
[1]. W. Hu, W. Zhang, Y. Zhou, Y. Luo, X. Sun, H. Xu, S. Shi, T. Li, Y. Xu, Q. Yang, Y. Qiu, F. Zhu, H. Dai, Mecddi: Clarified drug–drug interaction mechanism facilitating rational drug use and potential drug–drug interaction pre- diction, Journal of Chemical Information and Modeling 63 (5) (2023) 1626–1636. doi:10.1021/acs.jcim. 2c01656.
[2]. I. Cascorbi, Drug interactions—principles, examples and clinical consequences, Deutsches A¨ rzteblatt International109 (2012) 546–556. doi:10.3238/arztebl.2012.0546.
[3]. L. A. Kondili, G. B. Gaeta, D. Ieluzzi, A. L. Zignego, M. Monti, A. Gori, A. Soria, G. Raimondo, R. Filomia, A. Di Leo, et al, Real-life data on potential drug-drug interactions in patients with chronic hepatitis c viral infection undergoing antiviral therapy with interferon-free daas in the piter cohort study, PloS One 12 (2) (2017). doi: 10.1371/journal.pone.0172159.
[4]. J. Niu, R. M. Straubinger, D. E. Mager, Pharmacodynamic drug–drug interactions, Clinical Pharmacology and Therapeutics 105 (6) (2019) 1395–1406. doi:10.1002/cpt.1434.
[5]. J. E. Zwart-van Rijkom, E. V. Uijtendaal, M. J. Ten Berg, W. W. Van Solinge, A. C. Egberts, Frequency and nature of drug–drug interactions in a dutch university hospital, British journal of clinical pharmacology 68 (2) (2009) 187–193.
[6]. V. Gligorijevic´, P. D. Renfrew, T. Kosciolek, J. K. Leman, D. Berenberg, T. Vatanen, C. Chandler, B. C. Tay- lor, I. M. Fisk, H. Vlamakis, R. J. Xavier, R. Knight, K. Cho, R. Bonneau, Structure-based protein function prediction using graph convolutional networks, Nature communications 12 (1) (2021) 3168. doi:10.1038/ s41467-021-23303-9.
[7]. X. Lin, Z. Quan, Z.-J. Wang, T. Ma, X. Zeng, Kgnn: Knowledge graph neural network for drug-drug interaction prediction, Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence (2020). doi: 10.24963/ijcai.2020/380.
[8]. Y. Chen, T. Ma, X. Yang, J. Wang, B. Song, X. Zeng, Muffin: multi-scale feature fusion for drug–drug interaction prediction, Bioinformatics 37 (17) (2021) 2651–2658. doi:10.1093/bioinformatics/btab169.
[9]. H. He, G. Chen, C. Yu-Chian Chen, 3dgt-ddi: 3d graph and text based neural network for drug-drug interaction prediction, Briefings in bioinformatics 23 (3) (2022). doi:10.1093/bib/bbac134.
[10]. J. Y. Ryu, H. U. Kim, S. Y. Lee, Deep learning improves prediction of drug–drug and drug–food interactions, Proceedings of the National Academy of Sciences 115 (18) (2018). doi:10.1073/pnas.1803294115.
[11]. M. Herrero-Zazo, I. Segura-Bedmar, P. Mart´ınez, R. Declerck, The ddi corpus: An annotated corpus with phar- macological substances and drug–drug interactions, Journal of Biomedical Informatics 46 (5) (2013) 914–920. doi:10.1016/j.jbi.2013.07.011.
[12]. M. Lovric´, J. M. Molero, R. Kern, Pyspark and rdkit: Moving towards big data in cheminformatics, Molecular Informatics 38 (6) (2019). doi:10.1002/minf.201800082.
[13]. P. Chandak, PrimeKG (2022). doi:10.7910/DVN/IXA7BM. https://doi.org/10.7910/DVN/IXA7BM
[14]. S. Chithrananda, G. Grand, B. Ramsundar, Chemberta: Large-scale self-supervised pretraining for molecular prop- erty prediction (2020). arXiv:2010.09885.
[15]. Z. Lin, H. Akin, R. Rao, B. Hie, Z. Zhu, et al, Evolutionary-scale prediction of atomic level protein structure with a language model, bioRxiv (2022). arXiv:https://www.biorxiv.org/content/early/2022/10/31/2022. 07.20.500902.full.pdf, doi:10.1101/2022.07.20.500902. https://www.biorxiv.org/content/early/2022/10/31/2022.07.20.500902
[16]. Y. Feng, S. Zhang, Prediction of drug-drug interaction using an attention-based graph neural network on drug molecular graphs, Molecules (Basel, Switzerland) 27 (9) (2022) 3004. doi:10.3390/molecules27093004.
[17]. K. Shao, Y. Zhang, Y. Wen, Z. Zhang, S. He, X. Bo, Dti-heta: Prediction of drug–target interactions based on gcn and gat on heterogeneous graph, Briefings in Bioinformatics 23 (3) (2022). doi:10.1093/bib/bbac109.
[18]. A. Rossi, D. Barbosa, D. Firmani, A. Matinata, P. Merialdo, Knowledge graph embedding for link prediction: A comparative analysis, ACM Transactions on Knowledge Discovery from Data 15 (2) (2021) 1–49. doi:10.1145/ 3424672.
[19]. B. Tanoori, M. Z. Jahromi, E. G. Mansoori, Drug-target continuous binding affinity prediction using multiple sources of information, Expert Systems with Applications 186 (2021) 115810. doi:10.1016/j.eswa.2021. 115810.