Research article Special Issues

GOF/LOF knowledge inference with tensor decomposition in support of high order link discovery for gene, mutation and disease

  • Received: 14 December 2019 Accepted: 22 January 2019 Published: 20 February 2019
  • For discovery of new usage of drugs, the function type of their target genes plays an important role, and the hypothesis of "Antagonist-GOF" and "Agonist-LOF" has laid a solid foundation for supporting drug repurposing. In this research, an active gene annotation corpus was used as training data to predict the gain-of-function or loss-of-function or unknown character of each human gene after variation events. Unlike the design of(entity, predicate, entity) triples in a traditional three way tensor, a four way and a five way tensor, GMFD-/GMAFD-tensor, were designed to represent higher order links among or among part of these entities: genes(G), mutations(M), functions(F), diseases(D) and annotation labels(A). A tensor decomposition algorithm, CP decomposition, was applied to the higher order tensor and to unveil the correlation among entities. Meanwhile, a state-of-the-art baseline tensor decomposition algorithm, RESCAL, was carried on the three way tensor as a comparing method. The result showed that CP decomposition on higher order tensor performed better than RESCAL on traditional three way tensor in recovering masked data and making predictions. In addition, The four way tensor was proved to be the best format for our issue. At the end, a case study reproducing two disease-gene-drug links(Myelodysplatic Syndromes-IL2RA-Aldesleukin, Lymphoma- IL2RA-Aldesleukin) presented the feasibility of our prediction model for drug repurposing.

    Citation: Kaiyin Zhou, YuxingWang, Sheng Zhang, Mina Gachloo, Jin-Dong Kim, Qi Luo, Kevin Bretonnel Cohen, Jingbo Xia. GOF/LOF knowledge inference with tensor decomposition in support of high order link discovery for gene, mutation and disease[J]. Mathematical Biosciences and Engineering, 2019, 16(3): 1376-1391. doi: 10.3934/mbe.2019067

    Related Papers:

    [1] Julien Arino, Fred Brauer, P. van den Driessche, James Watmough, Jianhong Wu . A final size relation for epidemic models. Mathematical Biosciences and Engineering, 2007, 4(2): 159-175. doi: 10.3934/mbe.2007.4.159
    [2] Fred Brauer, Zhisheng Shuai, P. van den Driessche . Dynamics of an age-of-infection cholera model. Mathematical Biosciences and Engineering, 2013, 10(5&6): 1335-1349. doi: 10.3934/mbe.2013.10.1335
    [3] Hai-Feng Huo, Qian Yang, Hong Xiang . Dynamics of an edge-based SEIR model for sexually transmitted diseases. Mathematical Biosciences and Engineering, 2020, 17(1): 669-699. doi: 10.3934/mbe.2020035
    [4] Jianquan Li, Yiqun Li, Yali Yang . Epidemic characteristics of two classic models and the dependence on the initial conditions. Mathematical Biosciences and Engineering, 2016, 13(5): 999-1010. doi: 10.3934/mbe.2016027
    [5] Christine K. Yang, Fred Brauer . Calculation of R0 for age-of-infection models. Mathematical Biosciences and Engineering, 2008, 5(3): 585-599. doi: 10.3934/mbe.2008.5.585
    [6] Julijana Gjorgjieva, Kelly Smith, Gerardo Chowell, Fabio Sánchez, Jessica Snyder, Carlos Castillo-Chavez . The Role of Vaccination in the Control of SARS. Mathematical Biosciences and Engineering, 2005, 2(4): 753-769. doi: 10.3934/mbe.2005.2.753
    [7] Zhiping Liu, Zhen Jin, Junyuan Yang, Juan Zhang . The backward bifurcation of an age-structured cholera transmission model with saturation incidence. Mathematical Biosciences and Engineering, 2022, 19(12): 12427-12447. doi: 10.3934/mbe.2022580
    [8] Yu Tsubouchi, Yasuhiro Takeuchi, Shinji Nakaoka . Calculation of final size for vector-transmitted epidemic model. Mathematical Biosciences and Engineering, 2019, 16(4): 2219-2232. doi: 10.3934/mbe.2019109
    [9] Yuri Amemiya, Tianwen Li, Hiroshi Nishiura . Age-dependent final size equation to anticipate mortality impact of COVID-19 in China. Mathematical Biosciences and Engineering, 2023, 20(6): 11353-11366. doi: 10.3934/mbe.2023503
    [10] Yicang Zhou, Zhien Ma . Global stability of a class of discrete age-structured SIS models with immigration. Mathematical Biosciences and Engineering, 2009, 6(2): 409-425. doi: 10.3934/mbe.2009.6.409
  • For discovery of new usage of drugs, the function type of their target genes plays an important role, and the hypothesis of "Antagonist-GOF" and "Agonist-LOF" has laid a solid foundation for supporting drug repurposing. In this research, an active gene annotation corpus was used as training data to predict the gain-of-function or loss-of-function or unknown character of each human gene after variation events. Unlike the design of(entity, predicate, entity) triples in a traditional three way tensor, a four way and a five way tensor, GMFD-/GMAFD-tensor, were designed to represent higher order links among or among part of these entities: genes(G), mutations(M), functions(F), diseases(D) and annotation labels(A). A tensor decomposition algorithm, CP decomposition, was applied to the higher order tensor and to unveil the correlation among entities. Meanwhile, a state-of-the-art baseline tensor decomposition algorithm, RESCAL, was carried on the three way tensor as a comparing method. The result showed that CP decomposition on higher order tensor performed better than RESCAL on traditional three way tensor in recovering masked data and making predictions. In addition, The four way tensor was proved to be the best format for our issue. At the end, a case study reproducing two disease-gene-drug links(Myelodysplatic Syndromes-IL2RA-Aldesleukin, Lymphoma- IL2RA-Aldesleukin) presented the feasibility of our prediction model for drug repurposing.




    [1] M. Simsek, B. Meijer, A. A. van Bodegraven, N. K de Boer, and J. J. M. Chris, Finding hidden treasures in old drugs: the challenges and importance of licensing generics, Drug Discov. Today, 23(2018), 17–21.
    [2] N C. Baker, S. Ekins, A. J. Williams and A. Tropsha, A bibliometric review of drug repurposing, Drug Discov. Today, (2018).
    [3] F. L. Hitchcock, The expression of a tensor or a polyadic as a sum of products, J. Math. Phys., 6 (1927), 164–189.
    [4] N. Maximilian, V. Tresp and H. P. Kriegel, A three-Way model for collective learning on multirelational data, ICML, 11 (2011), 809–816.
    [5] N. Madhav, M. Gupta and P. Talukdar, Higher-order relation schema induction using tensor factorization with back-o and aggregation, In Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 1 (2018), 1575–1584.
    [6] N. Madhav, U. S. Saini and P. Talukdar, Relation schema induction using tensor factorization with side information, arXiv preprint, arXiv:1605.04227 (2016).
    [7] L. Timothée, N. Usunier and G. Obozinski, Canonical tensor decomposition for knowledge base completion, arXiv preprint, arXiv:1806.07297 (2018).
    [8] J. C. Ho, J. Ghosh, S. R. Steinhubl,W. F. Stewart, J. C. Denny, B. A. Malin and J Sun, Limestone: High-throughput candidate phenotype generation via tensor factorization, J. Biomed. Inform., 52 (2014), 199–211.
    [9] J. Fang and W. Jonathan, Tightly integrated genomic and epigenomic data mining using tensor decomposition, Bioinformatics, (2018), 1–7.
    [10] Y. H. Taguchi, Identification of candidate drugs using tensor-decomposition-based unsupervised feature extraction in integrated analysis of gene expression between diseases and DrugMatrix datasets, Sci. Rep., 7 (2017), 13733.
    [11] Y. Wang, X. Yao, K. Zhou, X. Qin, J. D. Kim, K. B Cohen and J. Xia, Guideline design of an active gene annotation corpus for the purpose of drug repurposing, 2018 11th International Congress on Image and Signal Processing, BioMedical Engineering and Informatics(CISP-BMEI 2018), Oct, 2018, Beijing. (Accepted)
    [12] T. G. Kolda and W. B. Bader, Tensor decompositions and applications. SIAM Rev., 51 (2009), 455–500.
    [13] R. Stephan, O. Shchur and S. Günnemann, Introduction to tensor decompositions and their applications in machine learning, arXiv preprint, 1711(2017),10781.
    [14] L. Hao, S. Liang, J. Ye and Z. Xu, TensorD: A tensor decomposition library in TensorFlow, Neurocomputing, 318(2018), 196–200.
  • This article has been cited by:

    1. Menachem Lachiany, Yoram Louzoun, Effects of distribution of infection rate on epidemic models, 2016, 94, 2470-0045, 10.1103/PhysRevE.94.022409
    2. Nancy Hernandez-Ceron, Zhilan Feng, Carlos Castillo-Chavez, Discrete Epidemic Models with Arbitrary Stage Distributions and Applications to Disease Control, 2013, 75, 0092-8240, 1716, 10.1007/s11538-013-9866-x
    3. Pandemic influenza: Modelling and public health perspectives, 2011, 8, 1551-0018, 1, 10.3934/mbe.2011.8.1
    4. L.F. Mondolfo, 1976, 9780408709323, 385, 10.1016/B978-0-408-70932-3.50088-7
    5. Guy Katriel, The size of epidemics in populations with heterogeneous susceptibility, 2012, 65, 0303-6812, 237, 10.1007/s00285-011-0460-2
    6. Arun Solanki, Tarana Singh, 2021, Chapter 3, 978-3-030-60038-9, 57, 10.1007/978-3-030-60039-6_3
    7. Fan Bai, Uniqueness of Nash equilibrium in vaccination games, 2016, 10, 1751-3758, 395, 10.1080/17513758.2016.1213319
    8. Joel C. Miller, A Note on the Derivation of Epidemic Final Sizes, 2012, 74, 0092-8240, 2125, 10.1007/s11538-012-9749-6
    9. Baltazar Espinoza, Victor Moreno, Derdei Bichara, Carlos Castillo-Chavez, 2016, Chapter 9, 978-3-319-40411-0, 123, 10.1007/978-3-319-40413-4_9
    10. Fred Brauer, General compartmental epidemic models, 2010, 31, 0252-9599, 289, 10.1007/s11401-009-0454-1
    11. L.F. Mondolfo, 1976, 9780408709323, 384, 10.1016/B978-0-408-70932-3.50087-5
    12. L.F. Mondolfo, 1976, 9780408709323, 335, 10.1016/B978-0-408-70932-3.50058-9
    13. Fred Brauer, Carlos Castillo-Chavez, Zhilan Feng, 2019, Chapter 1, 978-1-4939-9826-5, 3, 10.1007/978-1-4939-9828-9_1
    14. P. van den Driessche, Zhisheng Shuai, Fred Brauer, Dynamics of an age-of-infection cholera model, 2013, 10, 1551-0018, 1335, 10.3934/mbe.2013.10.1335
    15. L.F. Mondolfo, 1976, 9780408709323, 380, 10.1016/B978-0-408-70932-3.50083-8
    16. Mahnaz Alavinejad, Jianhong Wu, Coupled Systems of Renewal Equations for Forces of Infection through a Contact Network, 2020, 63, 0008-4395, 624, 10.4153/S0008439519000705
    17. G. Katriel, R. Yaari, A. Huppert, U. Roll, L. Stone, Modelling the initial phase of an epidemic using incidence and infection network data: 2009 H1N1 pandemic in Israel as a case study, 2011, 8, 1742-5689, 856, 10.1098/rsif.2010.0515
    18. Jummy Funke David, Epidemic models with heterogeneous mixing and indirect transmission, 2018, 12, 1751-3758, 375, 10.1080/17513758.2018.1467506
    19. David J. D. Earn, Paul W. Andrews, Benjamin M. Bolker, Population-level effects of suppressing fever, 2014, 281, 0962-8452, 20132570, 10.1098/rspb.2013.2570
    20. Calistus N. Ngonghala, Enahoro Iboi, Steffen Eikenberry, Matthew Scotch, Chandini Raina MacIntyre, Matthew H. Bonds, Abba B. Gumel, Mathematical assessment of the impact of non-pharmaceutical interventions on curtailing the 2019 novel Coronavirus, 2020, 325, 00255564, 108364, 10.1016/j.mbs.2020.108364
    21. Hiroshi Nishiura, Gerardo Chowell, Carlos Castillo-Chavez, Alessandro Vespignani, Did Modeling Overestimate the Transmission Potential of Pandemic (H1N1-2009)? Sample Size Estimation for Post-Epidemic Seroepidemiological Studies, 2011, 6, 1932-6203, e17908, 10.1371/journal.pone.0017908
    22. Fred Brauer, Mathematical epidemiology: Past, present, and future, 2017, 2, 24680427, 113, 10.1016/j.idm.2017.02.001
    23. Bahman Davoudi, Joel C. Miller, Rafael Meza, Lauren Ancel Meyers, David J. D. Earn, Babak Pourbohloul, Early Real-Time Estimation of the Basic Reproduction Number of Emerging Infectious Diseases, 2012, 2, 2160-3308, 10.1103/PhysRevX.2.031005
    24. Fred Brauer, A simple model for behaviour change in epidemics, 2011, 11, 1471-2458, 10.1186/1471-2458-11-S1-S3
    25. Ping Yan, Zhilan Feng, Variability order of the latent and the infectious periods in a deterministic SEIR epidemic model and evaluation of control effectiveness, 2010, 224, 00255564, 43, 10.1016/j.mbs.2009.12.007
    26. GUY KATRIEL, STOCHASTIC DISCRETE-TIME AGE-OF-INFECTION EPIDEMIC MODELS, 2013, 06, 1793-5245, 1250066, 10.1142/S1793524512500660
    27. Karyn L. Sutton, Danielle Robbins, H.Thomas Banks, Theoretical foundations for traditional and generalized sensitivity functions for nonlinear delay differential equations, 2013, 10, 1551-0018, 1301, 10.3934/mbe.2013.10.1301
    28. G. Röst, Z. Vizi, I. Z. Kiss, Pairwise approximation for SIR -type network epidemics with non-Markovian recovery , 2018, 474, 1364-5021, 20170695, 10.1098/rspa.2017.0695
    29. L.F. Mondolfo, 1976, 9780408709323, 413, 10.1016/B978-0-408-70932-3.50098-X
    30. Eleonora Messina, Mario Pezzella, Antonia Vecchio, A non-standard numerical scheme for an age-of-infection epidemic model, 2022, 9, 2158-2491, 239, 10.3934/jcd.2021029
    31. Xiaohao Guo, Yichao Guo, Zeyu Zhao, Shiting Yang, Yanhua Su, Benhua Zhao, Tianmu Chen, Computing R0 of dynamic models by a definition-based method, 2022, 7, 24680427, 196, 10.1016/j.idm.2022.05.004
    32. Raphaël Forien, Guodong Pang, Étienne Pardoux, Epidemic Models with Varying Infectivity, 2021, 81, 0036-1399, 1893, 10.1137/20M1353976
    33. Raphaël Forien, Guodong Pang, Étienne Pardoux, Multi-patch multi-group epidemic model with varying infectivity, 2022, 7, 2095-9672, 333, 10.3934/puqr.2022019
    34. Scott Greenhalgh, Carly Rozins, A generalized differential equation compartmental model of infectious disease transmission, 2021, 6, 24680427, 1073, 10.1016/j.idm.2021.08.007
    35. Jingan Cui, Yucui Wu, Songbai Guo, Effect of Non-homogeneous Mixing and Asymptomatic Individuals on Final Epidemic Size and Basic Reproduction Number in a Meta-Population Model, 2022, 84, 0092-8240, 10.1007/s11538-022-00996-7
    36. Alison Adams, Quiyana M. Murphy, Owen P. Dougherty, Aubrey M. Sawyer, Fan Bai, Christina J. Edholm, Evan P. Williams, Linda J.S. Allen, Colleen B. Jonsson, Data-driven models for replication kinetics of Orthohantavirus infections, 2022, 349, 00255564, 108834, 10.1016/j.mbs.2022.108834
    37. Dmitry V. Boguslavsky, Natalia P. Sharova, Konstantin S. Sharov, Cryptocurrency as Epidemiologically Safe Means of Transactions: Diminishing Risk of SARS-CoV-2 Spread, 2021, 9, 2227-7390, 3263, 10.3390/math9243263
    38. Sarafa A. Iyaniwura, Musa Rabiu, Jummy F. David, Jude D. Kong, Assessing the impact of adherence to Non-pharmaceutical interventions and indirect transmission on the dynamics of COVID-19: a mathematical modelling study, 2021, 18, 1551-0018, 8905, 10.3934/mbe.2021439
    39. Eleonora Messina, Mario Pezzella, Antonia Vecchio, Positive Numerical Approximation of Integro-Differential Epidemic Model, 2022, 11, 2075-1680, 69, 10.3390/axioms11020069
    40. Guodong Pang, Étienne Pardoux, Functional central limit theorems for epidemic models with varying infectivity, 2022, 1744-2508, 1, 10.1080/17442508.2022.2124870
    41. Jummy F. David, Sarafa A. Iyaniwura, Effect of Human Mobility on the Spatial Spread of Airborne Diseases: An Epidemic Model with Indirect Transmission, 2022, 84, 0092-8240, 10.1007/s11538-022-01020-8
    42. Luis Almeida, Pierre-Alexandre Bliman, Grégoire Nadin, Benoît Perthame, Nicolas Vauchelet, Final size and convergence rate for an epidemic in heterogeneous populations, 2021, 31, 0218-2025, 1021, 10.1142/S0218202521500251
    43. Jummy F. David, Sarafa A. Iyaniwura, Michael J. Ward, Fred Brauer, A novel approach to modelling the spatial spread of airborne diseases: an epidemic model with indirect transmission, 2020, 17, 1551-0018, 3294, 10.3934/mbe.2020188
    44. Fan Bai, An age-of-infection model with both symptomatic and asymptomatic infections, 2023, 86, 0303-6812, 10.1007/s00285-023-01920-w
    45. Eleonora Messina, Mario Pezzella, Antonia Vecchio, A long-time behavior preserving numerical scheme for age-of-infection epidemic models with heterogeneous mixing, 2023, 01689274, 10.1016/j.apnum.2023.04.009
    46. Maximilian M. Nguyen, Ari S. Freedman, Sinan A. Ozbay, Simon A. Levin, Fundamental bound on epidemic overshoot in the SIR model, 2023, 20, 1742-5662, 10.1098/rsif.2023.0322
    47. Jummy David, Gabrielle Brankston, Idriss Sekkak, Sungju Moon, Xiaoyan Li, Sana Jahedi, Zahra Mohammadi, Ao Li, Martin Grunnil, Pengfei Song, Woldegebriel Assefa, Nicola Bragazzi, Jianhong Wu, 2023, Chapter 1, 978-3-031-40804-5, 1, 10.1007/978-3-031-40805-2_1
    48. Aditi Ghosh, Pradyuta Padmanabhan, Anuj Mubayi, Padmanabhan Seshaiyer, Influence of distinct social contexts of long-term care facilities on the dynamics of spread of COVID-19 under predefine epidemiological scenarios, 2023, 11, 2544-7297, 10.1515/cmb-2023-0102
    49. Donald S. Burke, Origins of the problematic E in SEIR epidemic models, 2024, 24680427, 10.1016/j.idm.2024.03.003
    50. Phyu Phyu Win, Zhigui Lin, Mengyun Zhang, The final size and critical times of an SIVR epidemic model, 2025, 2025, 2731-4235, 10.1186/s13662-025-03902-2
    51. Sani Musa, Salisu Usaini, Idris Ahmed, Chanakarn Kiataramkul, Jessada Tariboon, Dynamics of the Diphtheria Epidemic in Nigeria: Insights from the Kano State Outbreak Data, 2025, 13, 2227-7390, 1189, 10.3390/math13071189
  • Reader Comments
  • © 2019 the Author(s), licensee AIMS Press. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0)
通讯作者: 陈斌, bchen63@163.com
  • 1. 

    沈阳化工大学材料科学与工程学院 沈阳 110142

  1. 本站搜索
  2. 百度学术搜索
  3. 万方数据库搜索
  4. CNKI搜索

Metrics

Article views(6190) PDF downloads(668) Cited by(9)

Article outline

Figures and Tables

Figures(6)  /  Tables(5)

/

DownLoad:  Full-Size Img  PowerPoint
Return
Return

Catalog