
Citation: Titin Siswantining, Alhadi Bustamam, Devvi Sarwinda, Saskya Mary Soemartojo, Moh. Abdul Latief, Elke Annisa Octaria, Anggrainy Togi Marito Siregar, Oon Septa, Herley Shaori Al-Ash, Noval Saputra. Correction: Triclustering method for finding biomarkers in human immunodeficiency virus-1 gene expression data[J]. Mathematical Biosciences and Engineering, 2023, 20(4): 7298-7301. doi: 10.3934/mbe.2023316
[1] | Titin Siswantining, Alhadi Bustamam, Devvi Sarwinda, Saskya Mary Soemartojo, Moh. Abdul Latief, Elke Annisa Octaria, Anggrainy Togi Marito Siregar, Oon Septa, Herley Shaori Al-Ash, Noval Saputra . Triclustering method for finding biomarkers in human immunodeficiency virus-1 gene expression data. Mathematical Biosciences and Engineering, 2022, 19(7): 6743-6763. doi: 10.3934/mbe.2022318 |
[2] | Ahmed Hammad, Mohamed Elshaer, Xiuwen Tang . Identification of potential biomarkers with colorectal cancer based on bioinformatics analysis and machine learning. Mathematical Biosciences and Engineering, 2021, 18(6): 8997-9015. doi: 10.3934/mbe.2021443 |
[3] | Ming-Xi Zhu, Tian-Yang Zhao, Yan Li . Insight into the mechanism of DNA methylation and miRNA-mRNA regulatory network in ischemic stroke. Mathematical Biosciences and Engineering, 2023, 20(6): 10264-10283. doi: 10.3934/mbe.2023450 |
[4] | Miguel Ángel Rodríguez-Parra, Cruz Vargas-De-León, Flaviano Godinez-Jaimes, Celia Martinez-Lázaro . Bayesian estimation of parameters in viral dynamics models with antiviral effect of interferons in a cell culture. Mathematical Biosciences and Engineering, 2023, 20(6): 11033-11062. doi: 10.3934/mbe.2023488 |
[5] | Fang Niu, Zongwei Liu, Peidong Liu, Hongrui Pan, Jiaxue Bi, Peng Li, Guangze Luo, Yonghui Chen, Xiaoxing Zhang, Xiangchen Dai . Identification of novel genetic biomarkers and treatment targets for arteriosclerosis-related abdominal aortic aneurysm using bioinformatic tools. Mathematical Biosciences and Engineering, 2021, 18(6): 9761-9774. doi: 10.3934/mbe.2021478 |
[6] | Yonghua Xue, Yiqin Ge . Construction of lncRNA regulatory networks reveal the key lncRNAs associated with Pituitary adenomas progression. Mathematical Biosciences and Engineering, 2020, 17(3): 2138-2149. doi: 10.3934/mbe.2020113 |
[7] | Zekun Xin, Yang Li, Lingyin Meng, Lijun Dong, Jing Ren, Jianlong Men . Elevated expression of the MYB proto-oncogene like 2 (MYBL2)-encoding gene as a prognostic and predictive biomarker in human cancers. Mathematical Biosciences and Engineering, 2022, 19(2): 1825-1842. doi: 10.3934/mbe.2022085 |
[8] | Shuaiqun Wang, Tianshun Zhang, Wei Kong, Gen Wen, Yaling Yu . An improved MOPSO approach with adaptive strategy for identifying biomarkers from gene expression dataset. Mathematical Biosciences and Engineering, 2023, 20(2): 1580-1598. doi: 10.3934/mbe.2023072 |
[9] | Wei Niu, Lianping Jiang . A seven-gene prognostic model related to immune checkpoint PD-1 revealing overall survival in patients with lung adenocarcinoma. Mathematical Biosciences and Engineering, 2021, 18(5): 6136-6154. doi: 10.3934/mbe.2021307 |
[10] | Tingting Chen, Wei Hua, Bing Xu, Hui Chen, Minhao Xie, Xinchen Sun, Xiaolin Ge . Robust rank aggregation and cibersort algorithm applied to the identification of key genes in head and neck squamous cell cancer. Mathematical Biosciences and Engineering, 2021, 18(4): 4491-4507. doi: 10.3934/mbe.2021228 |
Triclustering method for finding biomarkers in human immunodeficiency virus-1 gene expression data by Titin Siswantining, Alhadi Bustamam, Devvi Sarwinda, Saskya Mary Soemartojo, Moh. Abdul Latief, Elke Annisa Octaria, Anggrainy Togi Marito Siregar, Oon Septa, Herley Shaori Al-Ash and Noval Saputra. Mathematical Biosciences and Engineering, 2022, 19(7):6743-6763. doi: 10.3934/mbe.2022318.
We have updated Subsection 2.2.2 especially about Multi Slope Measure (MSL) in pages 6747–6749. We've added citations for this section [1], rewritten the sentences, and changed some of the equation notation (Eqs (2.9)–(2.12)). Then, we change notation in Figure 2 and added source for Figures 2 and 3.
Understanding MSL requires a graphic representation of tricluster TRIxyz, where x, y, and z are one of gene G, condition C, and time or depth T so that element x on TRIxyz will be on the X-axis, and element y on TRIxyz will outline the panels that are elements z on TRIxyz as shown in Figure 2 [2]. MSL calculates the difference among the angles formed by every series traced on each of three representations, taking into account TRIgct, TRIgtc, and TRItgc on Figure 3. MSL accounts for the effect of adjacent points in time.
MSL measures the average difference between the angles formed by the probe ID gene in all rows and columns for each individual or candidate tricluster. To calculate the MSL measure of a tricluster, a multiangular ratio calculation is first performed. Define the FGmulti tricluster TRItgc as the average difference Δ from an angle vector avyz∈angset from all of the outlines y, for each panel p (Vmc), and the same for the rest of panels (Hmc), where Nmc is the number of differences that are formed. All the angular vectors of an outline y on panel z are defined as a set of angles formed by outline y, taking into account each data point on the X-axis. Each outline will have many (axis mark X–1) angles. The difference Δ between two vector angles avA and avB is defined as the average of MAX−MIN (MAX is the maximum, and MIN is the minimum of the two angles avA(i) and avB(i)) of any component or angle i from avA and avB.
FGmulti(TRIxyz)=Vmc+HmcNmc | (2.9) |
where:
angset={avy1z1,avy2z1,avy3z1,...,avy1z2,y1z3,...,avyUT,zAN}Vmc=∑angsetΔ(avyz,avnext(y)z)Hmc=∑angsetΔ(avyz,avynext(z))Nmc=|y|×|z|×(|y|+|z|−2)2avy={axi} where i=1,2,...,AX−1Δ(avA,avB)=∑i∈avA,avBMAX(avA(i),avB(i))−MIN(avA(i),avB(i))|avA,B| |
FGmulti is based on multiple operations with the avyz angle vector. These elements are obtained based on the concept of a series, so the series Syz from the outline y for each panel z is the set of value pairs from the X-axis (xi) and expression level (elj), which forms the outline. For each set Syz, the angle of alpha axi is the spin arctangent of the slope of the line formed by the points (xi,eli) and (xnext(i),elnext(i)). The spin operation from an angle is the positive equivalent of the angle if it is negative.
Syz={(x0,el0),...,(xAX,elL)} | (2.10) |
axi=spin(arctan(xnext(i)−xielnext(i)−eli)) | (2.11) |
spin(axi)=axi, if axi≥0 and spin(axi=axi+(2+π) if axi<0 |
To conclude, the MSL measure of a TRI tricluster as shown in Eq (2.12) is the mean of the angular comparisons of three graphical representations of a tricluster.
MSL(TRI)=13[FGmulti(TRIgct)+FGmulti(TRIgtc)+FGmulti(TRItgc)] | (2.12) |
In Section 2.5. Trigen Algorithm, we have rewritten the sentences for first and second paragraf, and changed some of the equations for pages 6753 and 6754. We also added citation for this section in [2].
Trigen algorithm is an algorithm based on the theory of evolution, genetic algorithm [1]. Genetic algorithm is an algorithm that aims to maximize a problem that will produce the best solution. Because in the Trigen algorithm we want to produce N-set triclusters, we need to do a genetic algorithm that is N-times.
For the Trigen algorithm, MSL results are added to the genetic algorithm fitness function FF(TRI) along with the individul size and overlap control [2]. MSL combined with six other factors to be a weighted average. The first three factors is 1−|TRIG||DG|, 1−|TRIC||DC|, and 1−|TRIT||DT| measure the number of genes, conditions, and time of TRI(TRIG,C,T) compared to the size of the dataset (|DG,C,T|). Because MSL minimizes the fitness function, therefore on these three factors are made 1- each proportion to produce a TRI with a larger size when the parameter wg, wc or wt is increased. The next three factors RG(TRI,SOL)|TRIG|×|SOLL|, RC(TRI,SOL)|TRIC|×|SOLL|, and RT(TRI,SOL)|TRIT|×|SOLL| measure the number of genes, conditions, and time or depth elements TRI on the set of solutions that have been found previously SOL(|TRIG,C,T|×|SOL|) to produce TRI with a small overlap as the wag, wac, or wat value increases. Finally, the main function MSL(TRI)2π measures the MSL(TRI) proportional value close to its maximum value of 2π to produce TRI with a small MSL value when the wf value is increased. The default total configuration value for wf, wg, wc, wt, wag, wac, wat is 1, with fix value for wf is 0.8 and the total others variables are 0.2.
FF(TRI)=(wf×MSL(TRI)2π+wg(1−|TRIG|DG)+wc(1−|TRIC|DC)+wt(1−|TRIT|DT)+wag×RG(TRI,SOL)|TRIG|×|SOL|+wac×Rc(TRI,SOL)|TRIC|×|SOL|+wat×RT(TRI,SOL)|TRIT|×|SOL|) | (2.16) |
[1] |
D. Gutiérrez-Avilés, C. Rubio-Escudero, F. Martínez-Álvarez, J. C. Riquelme, TriGen: A genetic algorithm to mine triclusters in temporal gene expression data, Neurocomputing, 132 (2014), 42–53. https://doi.org/10.1016/j.neucom.2013.03.061 doi: 10.1016/j.neucom.2013.03.061
![]() |
[2] | D. Gutiérrez-Avilés, C. Rubio-Escudero, MSL: a measure to evaluate three-dimensional patterns in gene expression data, Evol. Bioinf., 11 (2015), EBO-S25822. https://journals.sagepub.com/doi/full/10.4137/EBO.S25822 |