Review

Machine learning approach on healthcare big data: a review

  • Received: 30 June 2020 Accepted: 15 October 2020 Published: 29 October 2020
  • In the past few years, big data has flattering more dominant in healthcare, due to three major reasons, such as the huge amount of data available, expanding healthcare costs, and a target on personalized care. Big data processing in healthcare refers to generating, collecting, analyzing, and holding clinical data that is too vast or complex to be inferred by classical means of data processing methods. Big data sources for healthcare include, the Internet of Things (IoT), Electronic Medical Record/Electronic Health Record (EMR/EHR) contains patientos medical history, diagnoses, medications, treatment plans, allergies, laboratory and test results, genomic sequencing, Medical Imaging, Insurance Providers and other clinical data. This paper discusses different machine learning algorithms that were applied to various healthcare data. Also, the challenges of processing, handling big data, and their applications. The scope of the paper is to elaborate on the application of machine learning algorithms and the need for handling and utilizing big data from a different perspective.

    Citation: M Supriya, AJ Deepa. Machine learning approach on healthcare big data: a review[J]. Big Data and Information Analytics, 2020, 5(1): 58-75. doi: 10.3934/bdia.2020005

    Related Papers:

  • In the past few years, big data has flattering more dominant in healthcare, due to three major reasons, such as the huge amount of data available, expanding healthcare costs, and a target on personalized care. Big data processing in healthcare refers to generating, collecting, analyzing, and holding clinical data that is too vast or complex to be inferred by classical means of data processing methods. Big data sources for healthcare include, the Internet of Things (IoT), Electronic Medical Record/Electronic Health Record (EMR/EHR) contains patientos medical history, diagnoses, medications, treatment plans, allergies, laboratory and test results, genomic sequencing, Medical Imaging, Insurance Providers and other clinical data. This paper discusses different machine learning algorithms that were applied to various healthcare data. Also, the challenges of processing, handling big data, and their applications. The scope of the paper is to elaborate on the application of machine learning algorithms and the need for handling and utilizing big data from a different perspective.


    加载中


    [1] Manogaran G, Lopez D, Thota C, et al. (2017) Big data analytics in healthcare internet of things, In: Innovative Healthcare Systems for the 21st Century, Springer, Cham, 263-284.
    [2] Lopez D and Manogaran G, (2017) A survey of big data architectures and machine learning algorithms in healthcare. Int J Biomed Eng Technol 25: 182.
    [3] Khamlichi KY, Chaoui NEH and Khennou F, (2018) Improving the use of big data analytics within electronic health records: A case study based open HER. Procedia Compute Sci 127: 60-68.
    [4] Sharma M, Kaur P and Mittal M, (2018) Big data and machine learning-based secure healthcare framework. Procedia Compute Sci 132: 1049-1059.
    [5] Xu Y, Qiu J, Wu Q, et al. (2016) A survey of machine learning for big data processing. EURASIP J Adv Signal Proc 2016: 67.
    [6] Gerrero-Curieses A, Munoz-Romero S, Bote-Curiel L, et al. (2019) Deep learning and big data in healthcare: A double review for critical beginners. Appl Sci 9: 2331.
    [7] Hao Y, Hwang MCK, Wang L, et al. (2017) Disease prediction by machine learning over big data from healthcare communities. IEEE Access, 5: 8869-8879.
    [8] Zhang Y and Zheng T, (2017) A big data application of machine learning-based framework to identify type 2 diabetes through electronic health records, In: International Conference on Knowledge Management in Organizations, Springer, 451-458.
    [9] Luo G, (2016) Automatically explaining machine learning prediction results: a demonstration on type 2 diabetes risk prediction. Health Inf Sci Syst 4: 2.
    [10] You M, Yang G, Chen Y, et al. (2017) A machine learning-based framework to identify type 2 diabetes through electronic health records. Int J Med Inf 97: 120-127.
    [11] Sisodia DS and Sisodia D, (2018) Prediction of diabetes using classification algorithms. Procedia Comp Sci 132: 1578-1585.
    [12] Pereira N, Taslimitehrani V, Pathak J, et al. (2015) Using EHRs and machine learning for heart failure survival analysis. Stud Health Technol Inf 216: 40.
    [13] Stewart WF, Sun J, Choi E, et al. (2017) Using recurrent neural network models for early detection of heart failure onset. J Am Med Inf Assoc 24: 361-370.
    [14] Liu Z, Zhang S, Jin B, et al. (2018) Predicting the risk of heart failure with EHR sequential data modeling. IEEE Access 6: 9256-9261.
    [15] Calvert J, Hoffman J, Jay M, et al. (2016) Prediction of sepsis in the intensive care unit with minimal electronic health record data: A machine learning approach. JMIR Med Inf 4: e28.
    [16] Hall MK, Pare JR, Venkatesh AK, et al. (2015) Prediction of in hospital mortality in emergency department patients with sepsis: a local big data-driven, machine learning approach. Acad Emerg Med 23: 269-278.
    [17] Neapolitan R, Zexian SE, Roy A, et al. (2018) Using natural language processing and machine learning to identify breast cancer local recurrence. BMC Bioinfor 19: 65-74.
    [18] Garg S and Gupta P, (2020) Breast cancer prediction using varying parameters of machine learning models. Procedia Comput Sci 171: 593-601.
    [19] Alkhawaldeh RS, Al-Shami F and Al-Shargabi B, (2019) Enhancing multi-layer perception for breast cancer prediction. Int J Adv Sci Tech 130: 11-20.
    [20] Seenivasagam V and Suijitha R, (2020) Classification of lung cancer stages with machine learning over big data healthcare framework. J Ambient Intell Humanized Comput 2020.
    [21] Olugbara OO and Adetiba E, (2015) Lung cancer prediction using neural network ensemble with histogram of oriented gradient genomic features. Sci World J 2015: 786013.
    [22] Peter T, Delzell DAP, Smith M, et al. (2019) Machine learning and feature selection methods for disease classification with application to lung cancer screening image data. Front Oncol, 9: 1393.
    [23] Nartowt BJ, Hart GR, Deng J, et al. (2019) Stratifying ovarian cancer risk using personal health data. Front Big Data 2: 24.
    [24] Hassanat ABA, (2018) Furthest-pair-based binary search tree for speeding big data classification using K-nearest neighbors. Big Data 6: 225-235.
    [25] Hung JC, Lin KC, Zhang KY, et al. (2016) Feature selection based on an improved cat swarm optimization algorithm for big data classification. J Supercomput 72: 3210-3221.
    [26] Bei Y and Xing W, (2019) Medical health big data classification based on KNN classification algorithm. IEEE Access, 8: 28808-28819.
    [27] Li Y and Jiang C, (2019) Medical health big data classification based on KNN classification algorithm. IEEE Access, 7: 176782-176789.
    [28] Shah NH and Callahan A, (2017) Machine learning in healthcare, In: Key Advances in Clinical Informatics, Academic Press, 279-291.
  • Reader Comments
  • © 2020 the Author(s), licensee AIMS Press. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0)
通讯作者: 陈斌, bchen63@163.com
  • 1. 

    沈阳化工大学材料科学与工程学院 沈阳 110142

  1. 本站搜索
  2. 百度学术搜索
  3. 万方数据库搜索
  4. CNKI搜索

Metrics

Article views(10106) PDF downloads(1196) Cited by(17)

Article outline

Figures and Tables

Tables(3)

Other Articles By Authors

/

DownLoad:  Full-Size Img  PowerPoint
Return
Return

Catalog