Designing neural networks for modeling biological data: A statistical perspective

  • Received: 01 October 2012 Accepted: 29 June 2018 Published: 01 October 2013
  • MSC : Primary: 62G08, 62H15; Secondary: 62F40, 92B20.

  • In this paper, we propose a strategy for the selection of the hidden layer size in feedforward neural network models. The procedure herein presented is based on comparison of different models in terms of their out of sample predictive ability, for a specified loss function. To overcome the problem of data snooping, we extend the scheme based on the use of the reality check with modifications apt to compare nested models. Some applications of the proposed procedure to simulated and real data sets show that it allows to select parsimonious neural network models with the highest predictive accuracy.

    Citation: Michele La Rocca, Cira Perna. Designing neural networks for modeling biological data: A statistical perspective[J]. Mathematical Biosciences and Engineering, 2014, 11(2): 331-342. doi: 10.3934/mbe.2014.11.331

    Related Papers:

    [1] Saranya Muniyappan, Arockia Xavier Annie Rayan, Geetha Thekkumpurath Varrieth . DTiGNN: Learning drug-target embedding from a heterogeneous biological network based on a two-level attention-based graph neural network. Mathematical Biosciences and Engineering, 2023, 20(5): 9530-9571. doi: 10.3934/mbe.2023419
    [2] Hongqiang Zhu . A graph neural network-enhanced knowledge graph framework for intelligent analysis of policing cases. Mathematical Biosciences and Engineering, 2023, 20(7): 11585-11604. doi: 10.3934/mbe.2023514
    [3] Biyun Hong, Yang Zhang . Research on the influence of attention and emotion of tea drinkers based on artificial neural network. Mathematical Biosciences and Engineering, 2021, 18(4): 3423-3434. doi: 10.3934/mbe.2021171
    [4] Diego Fasoli, Stefano Panzeri . Mathematical studies of the dynamics of finite-size binary neural networks: A review of recent progress. Mathematical Biosciences and Engineering, 2019, 16(6): 8025-8059. doi: 10.3934/mbe.2019404
    [5] Haoyu Wang, Xihe Qiu, Jinghan Yang, Qiong Li, Xiaoyu Tan, Jingjing Huang . Neural-SEIR: A flexible data-driven framework for precise prediction of epidemic disease. Mathematical Biosciences and Engineering, 2023, 20(9): 16807-16823. doi: 10.3934/mbe.2023749
    [6] Yongquan Zhou, Yanbiao Niu, Qifang Luo, Ming Jiang . Teaching learning-based whale optimization algorithm for multi-layer perceptron neural network training. Mathematical Biosciences and Engineering, 2020, 17(5): 5987-6025. doi: 10.3934/mbe.2020319
    [7] Lu Lu, Jiyou Fei, Ling Yu, Yu Yuan . A rolling bearing fault detection method based on compressed sensing and a neural network. Mathematical Biosciences and Engineering, 2020, 17(5): 5864-5882. doi: 10.3934/mbe.2020313
    [8] Shuai Cao, Biao Song . Visual attentional-driven deep learning method for flower recognition. Mathematical Biosciences and Engineering, 2021, 18(3): 1981-1991. doi: 10.3934/mbe.2021103
    [9] Wenbo Yang, Wei Liu, Qun Gao . Prediction of dissolved oxygen concentration in aquaculture based on attention mechanism and combined neural network. Mathematical Biosciences and Engineering, 2023, 20(1): 998-1017. doi: 10.3934/mbe.2023046
    [10] Xiwen Qin, Dongmei Yin, Xiaogang Dong, Dongxue Chen, Shuang Zhang . Survival prediction model for right-censored data based on improved composite quantile regression neural network. Mathematical Biosciences and Engineering, 2022, 19(8): 7521-7542. doi: 10.3934/mbe.2022354
  • In this paper, we propose a strategy for the selection of the hidden layer size in feedforward neural network models. The procedure herein presented is based on comparison of different models in terms of their out of sample predictive ability, for a specified loss function. To overcome the problem of data snooping, we extend the scheme based on the use of the reality check with modifications apt to compare nested models. Some applications of the proposed procedure to simulated and real data sets show that it allows to select parsimonious neural network models with the highest predictive accuracy.


    [1] J. Chem. Inf. Comput. Sci., 42 (2002), 903-911.
    [2] IEEE Trans. Inform. Theory, 39 (1993), 930-945.
    [3] International Journal of Software Engineering and its Applications, 4 (2010), 23-33.
    [4] in Artificial Neural Networks: Methods and Applications (ed. D. J. Livingstone), Methods in Molecular Biology, Vol. 458, Humana Press, Totowa N.J., 2009, 1-13.
    [5] IEEE Trans. Inform. Theory, 45 (1999), 682-691.
    [6] Bioinformatics, 16 (2010), 1062-1072.
    [7] J. Bus. Econom. Statist., 30 (2012), 53-66.
    [8] J. Econometrics, 170 (2012), 1-14.
    [9] in Handbook of Economic Forecasting, Vol. 1 (eds. G. Elliott, C. W. J. Granger and A. Timmermann), North-Holland, 2006, 197-284.
    [10] Academic Press, London, 1996.
    [11] Technometrics, 40 (1998), 273-282.
    [12] Journal Econometrics, 122 (2004), 47-79.
    [13] Ann. Statist., 19 (1991), 1-141.
    [14] 2nd edition, Springer, 2008.
    [15] in Artificial Neural Networks: Methods and Applications (ed. D. J. Livingstone), Methods in Molecular Biology Series, Vol. 458, Humana Press, Totowa, NJ, 2009, 78-118.
    [16] Neural Computation, 6 (1994), 1262-1275.
    [17] J. Amer. Statist. Assoc., 92 (1997), 748-757.
    [18] Breast Cancer Research and Treatment, 120 (2010), 83-93.
    [19] Comput. Statist. Data Anal., 48 (2005), 415-429.
    [20] in Computational Intelligence and Bioinspired Systems (eds. J. Cabestany, A. Prieto and F. Sandoval), Lecture Notes in Computer Science, Vol. 3512, Springer, Berlin-Heidelberg, 2005, 200-207.
    [21] in Computational Methods in Financial Engineering: Essays in Honour of Manfred Gili, Part II (eds. E. Kontoghiorghes, B. Rustem and P. Winker), Springer, Berlin-Heidelberg, 2008, 163-189.
    [22] Journal of Applied Econometrics, 10 (1995), 347-364.
    [23] J. Approx. Theory, 85 (1996), 98-109.
    [24] Environment Protection Engineering, 36 (2010), 95-109.
    [25] in Proceedings of the 4th European Symposium on Artificial Neural Networks (ESANN96), Bruges, Belgium, April 24-26, 1996, 315-322.
    [26] European Journal of Operational Research, 132 (2001), 666-680.
    [27] Econometrica, 73 (2005), 1237-1282.
    [28] Ann. Statist., 6 (1978), 461-464.
    [29] Springer Series in Statistics, Springer-Verlag, New York, 1995.
    [30] Journal of Urology, 16 (1989), 1076-1083.
    [31] The Review of Economics and Statistics, 79 (1997), 540-550.
    [32] J. Chem. Comput. Sci., 36 (1996), 794-803.
    [33] Neural Computation, 8 (1996), 152-163.
    [34] IEEE Transactions on Biomedical Engineering, 57 (2010), 884-893.
    [35] Ann. Statist., 32 (2004), 494-499.
    [36] in Trends in Applied Intelligent Systems: 23rd International Conference on Industrial Engineering and Other Applications of Applied Intelligent Systems, IEA/AIE 2010, Cordoba, Spain, June 1-4, 2010, Proceedings, Part I, Lecture Notes in Computer Science, 6096, Springer-Verlag, Berlin-Heidelberg, 2010, 317-326.
    [37] Neural Computation, 1 (1989), 425-464.
    [38] Neural Networks, 3 (1990), 535-549.
    [39] Econometrica, 68 (2000), 1097-1126.
    [40] J. Chem. Inf. Comput. Sci., 41 (2001), 1218-1227.
  • This article has been cited by:

    1. Shapour Mohammadi, A new test for the significance of neural network inputs, 2018, 273, 09252312, 304, 10.1016/j.neucom.2017.08.007
    2. Jibin Wang, Ping Wang, Suping Wang, Automated detection of atrial fibrillation in ECG signals based on wavelet packet transform and correlation function of random process, 2020, 55, 17468094, 101662, 10.1016/j.bspc.2019.101662
    3. Weichao Xu, Fei Long, He Zhao, Yaobin Zhang, Dawei Liang, Luguang Wang, Keaton Larson Lesnik, Hongbin Cao, Yuxiu Zhang, Hong Liu, Performance prediction of ZVI-based anaerobic digestion reactor using machine learning algorithms, 2021, 121, 0956053X, 59, 10.1016/j.wasman.2020.12.003
    4. Michele La Rocca, Cira Perna, Opening the Black Box: Bootstrapping Sensitivity Measures in Neural Networks for Interpretable Machine Learning, 2022, 5, 2571-905X, 440, 10.3390/stats5020026
    5. Gahyun Baek, Changsoo Lee, Jinyoung Yoon, Machine learning approach for predicting anaerobic digestion performance and stability in direct interspecies electron transfer-stimulated environments, 2023, 193, 1369703X, 108840, 10.1016/j.bej.2023.108840
    6. Zehao Liu, Songxian Zeng, Xinglin Quan, Sandip K Mishra, Image Genetic Analysis and Application Research Based on QRFPR and Other Neural Network-Related SNP Loci, 2022, 2022, 2314-6141, 1, 10.1155/2022/5861928
    7. Haibin Yang, Zhidong Liu, Image recognition technology of crop diseases based on neural network model fusion, 2022, 32, 1017-9909, 10.1117/1.JEI.32.1.011202
    8. Chengxin Niu, Bin Li, Zhiwei Wang, Using artificial intelligence-based algorithms to identify critical fouling factors and predict fouling behavior in anaerobic membrane bioreactors, 2023, 687, 03767388, 122076, 10.1016/j.memsci.2023.122076
    9. Michele La Rocca, Cira Perna, 2023, 10.5772/intechopen.1002540
    10. Xiao Wang, Yanshi Zhang, Bo He, Jun Li, Tianci Yang, Haotian Sun, Qianqian Shao, Chunhua Xu, Deep learning algorithms in predicting Cr(VI) removal performance of S-ZVI: Models building and optimal parameters prediction, 2024, 330, 13835866, 125487, 10.1016/j.seppur.2023.125487
    11. Pu Zhang, Research on the Use of BIM Technology in Green Building Design Based on Neural Network Learning, 2024, 12, 2169-3536, 94784, 10.1109/ACCESS.2024.3421540
    12. Nasem Badreldin, Xiaodong Cheng, Ali Youssef, An Overview of Software Sensor Applications in Biosystem Monitoring and Control, 2024, 24, 1424-8220, 6738, 10.3390/s24206738
  • Reader Comments
  • © 2014 the Author(s), licensee AIMS Press. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0)
通讯作者: 陈斌, bchen63@163.com
  • 1. 

    沈阳化工大学材料科学与工程学院 沈阳 110142

  1. 本站搜索
  2. 百度学术搜索
  3. 万方数据库搜索
  4. CNKI搜索

Metrics

Article views(2325) PDF downloads(482) Cited by(12)

Article outline

Other Articles By Authors

/

DownLoad:  Full-Size Img  PowerPoint
Return
Return

Catalog