
There are two main factors involved in documents classification, document representation method and classification algorithm. In this study, we focus on document representation method and demonstrate that the choice of representation methods has impacts on quality of classification results. We propose a document representation strategy for supervised text classification named document representation based on global policy (DRGP), which can obtain an appropriate document representation according to the distribution of terms. The main idea of DRGP is to construct the optimization function through the importance of terms to different categories. In the experiments, we investigate the effects of DRGP on the 20 Newsgroups, Reuters21578 datasets, and using the SVM as classifier. The results show that the DRGP outperforms other text representation strategy schemes, such as Document Max, Document Two Max and global policy.
Citation: Longjia Jia, Bangzuo Zhang. A new document representation based on global policy for supervised term weighting schemes in text categorization[J]. Mathematical Biosciences and Engineering, 2022, 19(5): 5223-5240. doi: 10.3934/mbe.2022245
[1] | Yantao Song, Wenjie Zhang, Yue Zhang . A novel lightweight deep learning approach for simultaneous optic cup and optic disc segmentation in glaucoma detection. Mathematical Biosciences and Engineering, 2024, 21(4): 5092-5117. doi: 10.3934/mbe.2024225 |
[2] | Rafsanjany Kushol, Md. Hasanul Kabir, M. Abdullah-Al-Wadud, Md Saiful Islam . Retinal blood vessel segmentation from fundus image using an efficient multiscale directional representation technique Bendlets. Mathematical Biosciences and Engineering, 2020, 17(6): 7751-7771. doi: 10.3934/mbe.2020394 |
[3] | Yue Li, Hongmei Jin, Zhanli Li . A weakly supervised learning-based segmentation network for dental diseases. Mathematical Biosciences and Engineering, 2023, 20(2): 2039-2060. doi: 10.3934/mbe.2023094 |
[4] | Jianguo Xu, Cheng Wan, Weihua Yang, Bo Zheng, Zhipeng Yan, Jianxin Shen . A novel multi-modal fundus image fusion method for guiding the laser surgery of central serous chorioretinopathy. Mathematical Biosciences and Engineering, 2021, 18(4): 4797-4816. doi: 10.3934/mbe.2021244 |
[5] | Dehua Feng, Xi Chen, Xiaoyu Wang, Xuanqin Mou, Ling Bai, Shu Zhang, Zhiguo Zhou . Predicting effectiveness of anti-VEGF injection through self-supervised learning in OCT images. Mathematical Biosciences and Engineering, 2023, 20(2): 2439-2458. doi: 10.3934/mbe.2023114 |
[6] | Ran Zhou, Yanghan Ou, Xiaoyue Fang, M. Reza Azarpazhooh, Haitao Gan, Zhiwei Ye, J. David Spence, Xiangyang Xu, Aaron Fenster . Ultrasound carotid plaque segmentation via image reconstruction-based self-supervised learning with limited training labels. Mathematical Biosciences and Engineering, 2023, 20(2): 1617-1636. doi: 10.3934/mbe.2023074 |
[7] | Duolin Sun, Jianqing Wang, Zhaoyu Zuo, Yixiong Jia, Yimou Wang . STS-TransUNet: Semi-supervised Tooth Segmentation Transformer U-Net for dental panoramic image. Mathematical Biosciences and Engineering, 2024, 21(2): 2366-2384. doi: 10.3934/mbe.2024104 |
[8] | Zhenwu Xiang, Qi Mao, Jintao Wang, Yi Tian, Yan Zhang, Wenfeng Wang . Dmbg-Net: Dilated multiresidual boundary guidance network for COVID-19 infection segmentation. Mathematical Biosciences and Engineering, 2023, 20(11): 20135-20154. doi: 10.3934/mbe.2023892 |
[9] | Wenli Cheng, Jiajia Jiao . An adversarially consensus model of augmented unlabeled data for cardiac image segmentation (CAU+). Mathematical Biosciences and Engineering, 2023, 20(8): 13521-13541. doi: 10.3934/mbe.2023603 |
[10] | Jingyao Liu, Qinghe Feng, Yu Miao, Wei He, Weili Shi, Zhengang Jiang . COVID-19 disease identification network based on weakly supervised feature selection. Mathematical Biosciences and Engineering, 2023, 20(5): 9327-9348. doi: 10.3934/mbe.2023409 |
There are two main factors involved in documents classification, document representation method and classification algorithm. In this study, we focus on document representation method and demonstrate that the choice of representation methods has impacts on quality of classification results. We propose a document representation strategy for supervised text classification named document representation based on global policy (DRGP), which can obtain an appropriate document representation according to the distribution of terms. The main idea of DRGP is to construct the optimization function through the importance of terms to different categories. In the experiments, we investigate the effects of DRGP on the 20 Newsgroups, Reuters21578 datasets, and using the SVM as classifier. The results show that the DRGP outperforms other text representation strategy schemes, such as Document Max, Document Two Max and global policy.
To exemplify the phenomena of compounds scientifically, researchers utilize the contraption of the diagrammatic hypothesis, it is a well-known branch of geometrical science named graph theory. This division of numerical science provides its services in different fields of sciences. The particular example in networking [1], from electronics [2], and for the polymer industry, we refer to see [3]. Particularly in chemical graph theory, this division has extra ordinary assistance to study giant and microscope-able chemical compounds. For such a study, researchers made some transformation rules to transfer a chemical compound to a discrete pattern of shapes (graph). Like, an atom represents as a vertex and the covalent bonding between atoms symbolized as edges. Such transformation is known as molecular graph theory. A major importance of this alteration is that the hydrogen atoms are omitted. Some chemical structures and compounds conversion are presented in [4,5,6].
In cheminformatics, the topological index gains attraction due to its implementations. Various topological indices help to estimate a bio-activity and physicochemical characteristics of a chemical compound. Some interesting and useful topological indices for various chemical compounds are studied in [3,7,8]. A topological index modeled a molecular graph or a chemical compound into a numerical value. Since 1947, topological index implemented in chemistry [9], biology [10], and information science [11,12]. Sombor index and degree-related properties of simplicial networks [13], Nordhaus–Gaddum-type results for the Steiner Gutman index of graphs [14], Lower bounds for Gaussian Estrada index of graphs [15], On the sum and spread of reciprocal distance Laplacian eigenvalues of graphs in terms of Harary index [16], the expected values for the Gutman index, Schultz index, and some Sombor indices of a random cyclooctane chain [17,18,19], bounds on the partition dimension of convex polytopes [20,21], computing and analyzing the normalized Laplacian spectrum and spanning tree of the strong prism of the dicyclobutadieno derivative of linear phenylenes [22], on the generalized adjacency, Laplacian and signless Laplacian spectra of the weighted edge corona networks [23,24], Zagreb indices and multiplicative Zagreb indices of Eulerian graphs [25], Minimizing Kirchhoff index among graphs with a given vertex bipartiteness, [26], asymptotic Laplacian energy like invariant of lattices [27]. Few interesting studies regarding the chemical graph theory can be found in [28,29,30,31,32].
Recently, the researchers of [33] introduced a topological descriptor and called the face index. Moreover, the idea of computing structure-boiling point and energy of a structure, motivated them to introduced this parameter without heavy computation. They computed these parameters for different models compare the results with previous literature and found approximate solutions with comparatively less computations. This is all the blessings of face index of a graph. The major concepts of this research work are elaborated in the given below definitions.
Definition 1.1. [33] Let a graph G=(V(G),E(G),F(G)) having face, edge and vertex sets notation with F(G),E(G),V(G), respectively. It is mandatory that the graph is connected, simple and planar. If e from the edge set E(G), is one of those edges which surrounds a face, then the face f from the face set F(G), is incident to the edge e. Likewise, if a vertex α from the vertex set V(G) is at the end of those incident edges, then a face f is incident to that vertex. This face-vertex incident relation is symbolized here by the notation α∼f. The face degree of f in G is described as d(f)=∑α∼fd(α), which are elaborated in the Figure 1.
Definition 1.2. [33] The face index FI(G), for a graph G, is formulated as
FI(G)=∑f∈F(G)d(f)=∑α∼f,f∈F(G)d(α). |
In the Figure 1, we can see that there are two faces with degree 4, exactly two with five count and four with count of 6. Moreover, there is an external face with count of face degree 28, which is the count of vertices.
As the information given above that the face index is quite new and introduced in the year 2020, so there is not so much literature is available. A few recent studies on this topic are summarized here. A chemical compound of silicon carbides is elaborated with such novel definition in [34]. Some carbon nanotubes are discussed in [35]. Except for the face index, there are distance and degree-based graphical descriptors available in the literature. For example, distance-based descriptors of phenylene nanotube are studied in [36], and in [37] titania nanotubes are discussed with the same concept. Star networks are studied in [38], with the concept of degree-based descriptors. Bounds on the descriptors of some generalized graphs are discussed in [39]. General Sierpinski graph is discussed in [40], in terms of different topological descriptor aspects. The study of hyaluronic acid-doxorubicin ar found in [41], with the same concept of the index. The curvilinear regression model of the topological index for the COVID-19 treatment is discussed in [42]. For further reading and interesting advancements of topological indices, polynomials of zero-divisor structures are found in [43], zero divisor graph of commutative rings [44], swapped networks modeled by optical transpose interconnection system [45], metal trihalides network [46], some novel drugs used in the cancer treatment [47], para-line graph of Remdesivir used in the prevention of corona virus [48], tightest nonadjacently configured stable pentagonal structure of carbon nanocones [49]. In order to address a novel preventive category (P) in the HIV system known as the HIPV mathematical model, the goal of this study is to offer a design of a Morlet wavelet neural network (MWNN) [50].
In the next section, we discussed the newly developed face index or face-based index for different chemical compounds. Silicate network, triangular honeycomb network, carbon sheet, polyhedron generalized sheet, and generalized chain of silicate network are studied with the concept of the face-based index. Given that the face index is more versatile than vertex degree-based topological descriptors, this study will aid in understanding the structural characteristics of chemical networks. Only the difficulty authors will face to compute the face degree of a generalized network or structure, because it is more generalized version and taking degree based partition of edges into this umbrella of face index.
Silicates are formed when metal carbonates or metal oxides react with sand. The SiO4, which has a tetrahedron structure, is the fundamental chemical unit of silicates. The central vertex of the SiO4 tetrahedron is occupied by silicon ions, while the end vertices are occupied by oxygen ions [51,52,53]. A silicate sheet is made up of rings of tetrahedrons that are joined together in a two-dimensional plane by oxygen ions from one ring to the other to form a sheet-like structure. The silicate network SLn symbol, where n represents the total number of hexagons occurring between the borderline and center of the silicate network SLn. The silicate network of dimension one is depicted in Figure 2. It contain total 3n(5n+1) vertices are 36n2 edges. Moreover, the result required is detailed are available in Table 1.
Dimension | |f12| | |f15| | |f36| |
1 | 24 | 48 | 7 |
2 | 32 | 94 | 14 |
3 | 40 | 152 | 23 |
4 | 48 | 222 | 34 |
5 | 56 | 304 | 47 |
6 | 64 | 398 | 62 |
7 | 72 | 504 | 79 |
8 | 80 | 622 | 98 |
. | . | . | . |
. | . | . | . |
. | . | . | . |
n | 8n+16 | 6n2+28n+14 | n2+4n+2 |
Theorem 2.1. Let SLn be the silicate network of dimension n≥1. Then the face index of SLn is
FI(SLn)=126n2+720n+558. |
Proof. Consider SLn the graph of silicate network with dimension n. Suppose fi denotes the faces of graph SLn having degree i. that is, d(fi)=∑α∼fid(α)=i and |fi| denotes the number of faces with degree i. The graph SLn contains three types of internal faces f12, f15, f36, and single external face which is usually denoted by f∞.
If SLn has one dimension then sum of degree of vertices incident to the external face is 144 and when SLn has two dimension then sum of degree of incident vertices to the external face is 204 whenever SLn has three dimension then sum of degree of incident vertices to the external face is 264. Similarly, SLn has n−dimension then sum of degree of incident vertices to the external face is 60n+84.
The number of internal faces with degree in each dimension is mentioned in Table 1.
By using the definition of face index FI we have
FI(SLn)=∑α∼f∈F(SLn)d(α)=∑α∼f12∈F(SLn)d(α)+∑α∼f15∈F(SLn)d(α)+∑α∼f36∈F(SLn)d(α)+∑α∼f∞∈F(SLn)d(α)=|f12|(12)+|f15|(15)+|f36|(36)+(60n+84)=(8n+16)(12)+(6n2+28n+14)(15)+(n2+4n+2)(36)+60n+84=126n2+72n+558. |
Hence, this is our required result.
A chain silicate network of dimension (m,n) is symbolized as CSL(m,n) which is made by arranging (m,n) tetrahedron molecules linearly. A chain silicate network of dimension (m,n) with m,n≥1 where m denotes the number of rows and each row has n number of tetrahedrons. The following theorem formulates the face index FI for chain silicate network.
Theorem 2.2. Let CSL(m,n) be the chain of silicate network of dimension m,n≥1. Then the face index FI of the graph CSL(m,n) is
FI(CSL(m,n))={48n−12if m=1, n≥1;96m−12if n=1, m≥2;168m−60if n=2,m≥2;45m−9n+36mn−42if both m,n are even45m−9n+36mn−21otherwise. |
Proof. Let CSL(m,n) be the graph of chain silicate network of dimension (m,n) with m,n≥1 where m represents the number of rows and n is the number of tetrahedrons in each row. A graph CSL(m,n) for m=1 contains three type of internal faces f9, f12 and f15 with one external face f∞. While for m≥2, it has four type of internal faces f9, f12, f15 and f36 with one external face f∞. We want to evaluate the algorithm of face index FI for chain silicate network. We will discuss it in two different cases.
Case 1: When CSL(m,n) has one row (m=1) with n number of tetrahedrons as shown in the Figure 3.
The graph has three type of internal faces f9, f12 and f15 with one external face f∞. The sum of degree of incident vertices to the external face is 9n and number of faces are |f9|=2, |f12|=2n and |f15|=n−2. Now the face index FI of the graph CSL(m,n) is given by
FI(CSL(m,n))=∑α∼f∈F(CSL(m,n))d(α)=∑α∼f9∈F(CSL(m,n))d(α)+∑α∼f12∈F(CSL(m,n))d(α)+∑α∼f15∈F(CSL(m,n))d(α)+∑α∼f∞∈F(CSL(m,n))d(α)=|f9|(9)+|f12|(12)+|f15|(15)+(9n)=(2)(9)+(2n)(12)+(n−2)(15)+9n=48n−12. |
Case 2: When CSL(m,n) has more than one rows (m≠1) with n number of tetrahedrons in each row as shown in the Figure 4.
The graph has four type of internal faces f9, f12, f15 and f36 with one external face f∞. The sum of degree of incident vertices to the external face is
∑α∼f∞∈F(CSL(m,n))d(α)={18mif n=1, m≥1;27mif n=2, m≥1;30m+15n−30if both m,n are even30m+15n−33otherwise. |
The number of faces are |f9|, |f12|, f15 and |f36| are given by
|f9|={2if m is odd3+(−1)nif m is even.|f12|={2(2m+n−1)if m is odd4(⌊n+12⌋+2m−1)if m is even|f15|=(3m−2)n−m|f36|={(m−12)(n−1)if m is odd(2n+(−1)n−14)(m−22)nif m is even. |
Now the face index FI of the graph CSL(m,n) is given by
FI(CSL(m,n))=∑α∼f∈F(CSL(m,n))d(α)=∑α∼f9∈F(CSL(m,n))d(α)+∑α∼f12∈F(CSL(m,n))d(α)+∑α∼f15∈F(CSL(m,n))d(α)+∑α∼f36∈F(CSL(m,n))d(α)+∑α∼f∞∈F(CSL(m,n))d(α)=|f9|(9)+|f12|(12)+|f15|(15)+|f36|(36)+∑α∼f∞∈F(CSL(m,n))d(α). |
After some mathematical simplifications, we can get
FI(CSL(m,n))={48n−12if m=196m−12if n=1,∀m168m−60if n=2,∀m45m−9n+36mn−42if both m,n are even45m−9n+36mn−21otherwise. |
There are three regular plane tessellations known to exist, each constituted from the same type of regular polygon: triangular, square, and hexagonal. The triangular tessellation is used to define the hexagonal network, which is extensively studied in [54]. A dimensioned hexagonal network THk has 3k2−3k+1 vertices and 9k2−15k+6 edges, where k is the number of vertices on one side of the hexagon. It has 2k−2 diameter. There are six vertices of degree three that are referred to as corner vertices. Moreover, the result required detailed are available in the Table 2.
Dimension | |f12| | |f14| | |f17| | |f18| |
1 | 6 | 0 | 0 | 0 |
2 | 6 | 12 | 12 | 12 |
3 | 6 | 24 | 24 | 60 |
4 | 6 | 36 | 36 | 144 |
5 | 6 | 48 | 48 | 264 |
6 | 6 | 60 | 60 | 420 |
7 | 6 | 72 | 72 | 612 |
8 | 6 | 84 | 84 | 840 |
. | . | . | . | . |
. | . | . | . | . |
. | . | . | . | . |
k | 6 | 12(k−1) | 12(k−1) | 18k2−42k+24 |
Theorem 2.3. Let THk be the triangular honeycomb network of dimension k≥1. Then the face index of graph THk is
FI(THk)=324k2−336k+102. |
Proof. Consider THk be a graph of triangular honeycomb network. The graph TH1 has one internal and only one external face while graph THk with k≥2, contains four types of internal faces f12, f14, f17, and f18 with one external face f∞.
For TH1 the sum of degree of incident vertices to the external face is 18 and in TH2 the sum of degree of incident vertices to the external face is 66. Whenever the graph TH3, the sum of degree of incident vertices to the external face is 114. Similarly, for THk has n−dimension then sum of degree of incident vertices to the external face is 48k−30.
The number of internal faces with degree in each dimension is given in Table 2.
By using the definition of face index FI we have
FI(THk)=∑α∼f∈F(THk)d(α)=∑α∼f12∈F(THk)d(α)+∑α∼f14∈F(THk)d(α)+∑α∼f17∈F(THk)d(α)+∑α∼f18∈F(THk)d(α)+∑α∼f∞∈F(THk)d(α)=|f12|(12)+|f14|(14)+|f17|(17)+|f18|(18)+(48k−30)=(6)(12)+(12(k−1))(14)+(12(k−1))(17)+(18k2−42k+24)(18)+48k−30=324k2−336k+102. |
Hence, this is our required result.
Given carbon sheet in the Figure 6, is made by grid of hexagons. There are few types of carbon sheets are given in [55,56]. The carbon sheet is symbolize as HCSm,n, where n represents the total number of vertical hexagons and m denotes the horizontal hexagons. It contain total 4mn+2(n+m)−1 vertices and 6nm+2m+n−2 edges. Moreover, the result required detailed are available in Tables 3 and 4.
Dimension m | |f15| | |f16| | |f18| | |f∞| |
2 | 3 | 2(n−1) | n−1 | 20n+7 |
Dimension m | |f15| | |f16| | |f17| | |f18| | |f∞| |
2 | 3 | 2(n−1) | 0 | n−1 | 20n+7 |
3 | 2 | 2n | 1 | 3(n−1) | 20n+17 |
4 | 2 | 2n | 3 | 5(n−1) | 20n+27 |
5 | 2 | 2n | 5 | 7(n−1) | 20n+37 |
6 | 2 | 2n | 7 | 9(n−1) | 20n+47 |
. | . | . | . | . | . |
. | . | . | . | . | . |
. | . | . | . | . | . |
m | 2 | 2n | 2m−5 | 2mn−2m−3n+3 | 20n+10m−13 |
Theorem 2.4. Let HCSm,n be the carbon sheet of dimension (m,n) and m,n≥2. Then the face index of HCSm,n is
FI(HCSm,n)={70n+2ifm=236mn−14−2(n−4m)ifm≥3. |
Proof. Consider HCSm,n be the carbon sheet of dimension (m,n) and m,n≥2. Let fi denotes the faces of graph HCSm,n having degree i, which is d(fi)=∑α∼fid(α)=i, and |fi| denotes the number of faces with degree i. A graph HCSm,n for a particular value of m=2 contains three types of internal faces f15, f16, f17 and f18 with one external face f∞. While for the generalize values of m≥3, it contain four types of internal faces f15, f16 and f17 with one external face f∞ in usual manner. For the face index of generalize nanotube, we will divide into two cases on the values of m.
Case 1: When HCSm,n has one row or HCS2,n.
A graph HCSm,n for a this particular value of m=2 contains three types of internal faces |f15|=3, |f16|=2(n−1) and |f18|=n−1 with one external face f∞. For the face index of carbon sheet, details are given in the Table 3. Now the face index FI of the graph NT2,n is given by
FI(HCS2,n)=∑α∼f∈F(HCS2,n)d(α)=∑α∼f15∈F(HCS2,n)d(α)+∑α∼f16∈F(HCS2,n)d(α)+∑α∼f18∈F(HCS2,n)d(α)+∑α∼f∞∈F(HCS2,n)d(α)=|f15|(15)+|f16|(16)+|f18|(18)+20n+7.=3(15)+2(n−1)(16)+(n−1)(18)+20n+7.=70n+2. |
Case 2: When HCSm,n has m≥3 rows.
A graph HCSm,n for generalize values of m≥3 contains four types of internal faces |f15|=2, |f16|=2n, |f17|=2m−5 and |f18|=2mn−2m−3n+3 with one external face f∞. For the face index of carbon sheet, details are given in the Table 4. Now the face index FI of the graph NTm,n is given by
FI(HCSm,n)=∑α∼f∈F(HCSm,n)d(α)=∑α∼f15∈F(HCSm,n)d(α)+∑α∼f16∈F(HCSm,n)d(α)+∑α∼f17∈F(HCSm,n)d(α)+∑α∼f18∈F(HCSm,n)d(α)+∑α∼f∞∈F(HCSm,n)d(α)=|f15|(15)+|f16|(16)+|f17|(17)+|f18|(18)+20n+10m−13.=36mn−2n+8m−14. |
Given structure of polyhedron generalized sheet of C∗28 in the Figure 7, is made by generalizing a C∗28 polyhedron structure which is shown in the Figure 8. This particular structure of C∗28 polyhedron are given in [57]. The polyhedron generalized sheet of C∗28 is as symbolize PHSm,n, where n represents the total number of vertical C∗28 polyhedrons and m denotes the horizontal C∗28 polyhedrons. It contain total 23nm+3n+2m vertices and 33nm+n+m edges. Moreover, the result required detailed are available in Tables 3 and 5.
m | |f14| | |f15| | |f16| | |f17| | |f18| | |f20| | |f35| |
1 | 2n+1 | 2 | 4n−2 | 0 | 0 | 2n−1 | 0 |
2 | 2n+2 | 2 | 8n−2 | 2 | 2n−2 | 4n−2 | 2n−1 |
3 | 2n+3 | 2 | 12n−2 | 4 | 4n−4 | 6n−3 | 4n−2 |
. | . | . | . | . | . | . | . |
. | . | . | . | . | . | . | . |
. | . | . | . | . | . | . | . |
m | 2n+m | 2 | 4mn−2 | 2m−2 | 2mn−2(m+n)+2 | 2mn−m | 2mn−(m+2n)+1 |
Theorem 2.5. Let PHSm,n be the polyhedron generalized sheet of C∗28 of dimension (m,n) and m,n≥1. Then the face index of PHSm,n is
FI(PHSm,n)=210mn−2(3m+5n). |
Proof. Consider PHSm,n be the polyhedron generalized sheet of C∗28 of dimension (m,n) and m,n≥1. Let fi denotes the faces of graph PHSm,n having degree i, which is d(fi)=∑α∼fid(α)=i, and |fi| denotes the number of faces with degree i. A graph PHSm,n for the generalize values of m,n≥1, it contain seven types of internal faces f14,f15,f16,f17,f18,f20 and f35 with one external face f∞ in usual manner. For the face index of polyhedron generalized sheet, details are given in the Table 5.
A graph PHSm,n for generalize values of m,n≥1 contains seven types of internal faces |f14|=2n+m, |f15|=2, |f16|=4nm−2, |f17|=2(m−1), |f18|=2nm−2(m+n)+2, |f20|=2nm−2mn−m, and |f35|=2mn−m−2n+1 with one external face f∞. Now the face index FI of the graph PHSm,n is given by
FI(PHSm,n)=∑α∼f∈F(PHSm,n)d(α)=∑α∼f14∈F(PHSm,n)d(α)+∑α∼f15∈F(PHSm,n)d(α)+∑α∼f16∈F(PHSm,n)d(α)+∑α∼f17∈F(PHSm,n)d(α)+∑α∼f18∈F(PHSm,n)d(α)+∑α∼f20∈F(PHSm,n)d(α)+∑α∼f35∈F(PHSm,n)d(α)+∑α∼f∞∈F(PHSm,n)d(α)=|f14|(14)+|f15|(15)+|f16|(16)+|f17|(17)+|f18|(18)+|f20|(20)+|f35|(35)+37m+68n−35.=210mn−6m−10n. |
With the advancement of technology, types of equipment and apparatuses of studying different chemical compounds are evolved. But topological descriptors or indices are still preferable and useful tools to develop numerical science of compounds. Therefore, from time to time new topological indices are introduced to study different chemical compounds deeply. In this study, we discussed a newly developed tool of some silicate type networks and generalized sheets, carbon sheet, polyhedron generalized sheet, with the face index concept. It provides numerical values of these networks based on the information of faces. It also helps to study physicochemical characteristics based on the faces of silicate networks.
M. K. Jamil conceived of the presented idea. K. Dawood developed the theory and performed the computations. M. Azeem verified the analytical methods, R. Luo investigated and supervised the findings of this work. All authors discussed the results and contributed to the final manuscript.
This work was supported by the National Science Foundation of China (11961021 and 11561019), Guangxi Natural Science Foundation (2020GXNSFAA159084), and Hechi University Research Fund for Advanced Talents (2019GCC005).
The authors declare that they have no conflicts of interest.
[1] | M. Lan, S. Sung, H. Low, C. Tan, A comparative study on term weighting schemes for text categorization, in Proceedings 2005 IEEE International Joint Conference on Neural Networks, 1 (2005), 546–551. https://doi.org/10.1109/IJCNN.2005.1555890 |
[2] |
X. Li, A. Zhang, C. Li, J. Ouyang, Y. Cai, Exploring coherent topics by topic modeling with term weighting, Inf. Process. Manage., 54 (2018), 1345–1358. https://doi.org/10.1016/j.ipm.2018.05.009 doi: 10.1016/j.ipm.2018.05.009
![]() |
[3] |
M. Lan, C. Tan, J. Su, Y. Lu, Supervised and traditional term weighting methods for automatic text categorization, IEEE Trans. Pattern Anal. Mach. Intell., 31 (2008), 721–735. https://doi.org/10.1109/TPAMI.2008.110 doi: 10.1109/TPAMI.2008.110
![]() |
[4] |
E. H. Han, G. Karypis, V. Kumar, Text Categorization Using Weight Adjusted K-Nearest Neighbor Classification, Proc. Pacific Asia Conf. Knowl. Discovery Data Min., (2001), 53–65. https://doi.org/10.1007/3-540-45357-1_9 doi: 10.1007/3-540-45357-1_9
![]() |
[5] |
X. Quan, W. Liu, B. Qiu, Term weighting schemes for question categorization, IEEE Trans. Pattern Anal. Mach. Intell., 33 (2010), 1009–1021. https://doi.org/10.1109/TPAMI.2010.154 doi: 10.1109/TPAMI.2010.154
![]() |
[6] |
A. I. Kadhim, Survey on supervised machine learning techniques for automatic text classification, Artif. Intell. Rev., 51 (2019), 273–292. https://doi.org/10.1007/s10462-018-09677-1 doi: 10.1007/s10462-018-09677-1
![]() |
[7] |
M. M. Michał, J. Protasiewicz, A recent overview of the state-of-the-art elements of text classification, Expert Syst. Appl., 106 (2018), 36–54. https://doi.org/10.1016/j.eswa.2018.03.058 doi: 10.1016/j.eswa.2018.03.058
![]() |
[8] | C. Liu, Y. Sheng, Z. Wei, Y. Yang, Research of text classification based on improved TF-IDF algorithm, in 2018 IEEE International Conference of Intelligent Robotic and Control Engineering (IRCE), 2018. https://doi.org/10.1109/IRCE.2018.8492945 |
[9] | Y. Ko, A study of term weighting schemes using class information for text classification, in Proceedings of the 35th international ACM SIGIR conference on Research and development in information retrieval, 2012. https://doi.org/10.1145/2348283.2348453 |
[10] | M. Yurochkin, S. Claici, E. Chien, F. Mirzazadeh, J. Solomon, Hierarchical optimal transport for document representation, preprint, arXiv: abs/1906.10827. |
[11] |
W. Zhang, Y. Li, S. Wang, Learning document representation via topic-enhanced LSTM model, Knowl. Based Syst., 174 (2019), 194–204. https://doi.org/10.1016/J.KNOSYS.2019.03.007 doi: 10.1016/J.KNOSYS.2019.03.007
![]() |
[12] |
L. Li, B. Qin, W. Ren, T. Liu, Document representation and feature combination for deceptive spam review detection, Neurocomputing, 254 (2017), 33–41. https://doi.org/10.1016/j.neucom.2016.10.080 doi: 10.1016/j.neucom.2016.10.080
![]() |
[13] |
S. Deerwester, S. Dumais, G. Furnas, T. Landauer, R. Harshman, Indexing by latent semantic analysis, J. Am. Soc. Inf. Sci., 41 (1990), 391–407. https://doi.org/10.1002/(SICI)1097-4571(199009)41:6<391::AID-ASI1>3.0.CO;2-9 doi: 10.1002/(SICI)1097-4571(199009)41:6<391::AID-ASI1>3.0.CO;2-9
![]() |
[14] |
D. M. Blei, A. Ng, M. I. Jordan, Latent dirichlet allocation, J. Mach. Learn. Res., 3 (2003), 993–1022. https://doi.org/10.1016/B978-0-12-411519-4.00006-9 doi: 10.1016/B978-0-12-411519-4.00006-9
![]() |
[15] |
T. Mikolov, K. Chen, G. Corrado, J. Dean, Efficient estimation of word representations in vector space, Comput. Sci., 2013. https://doi.org/10.48550/arXiv.1301.3781 doi: 10.48550/arXiv.1301.3781
![]() |
[16] | Q. V. Le, T. Mikolov, Distributed representations of sentences and documents, Int. Conf. Mach. Learn. PMLR, 2014. |
[17] |
F. Sebastiani, Machine learning in automated text categorization, ACM Comput. Surv. (CSUR), 34 (2002), 1–47. https://doi.org/10.1145/505282.505283 doi: 10.1145/505282.505283
![]() |
[18] | L. Jia, B. Zhang, Optimal document representation strategy for supervised term weighting schemes in automatic text categorization, in 2019 9th International Conference on Information and Social Science, 2019. |
[19] |
Y. Q. Miao, M. Kamel, Pairwise optimized Rocchio algorithm for text categorization, Pattern Recogn. Lett., 32 (2011), 375–382. https://doi.org/10.1016/j.patrec.2010.09.018 doi: 10.1016/j.patrec.2010.09.018
![]() |
[20] |
C. Deng, X. He, Manifold adaptive experimental design for text categorization, IEEE Trans. Knowl. Data Eng., 24 (2011), 707–719. https://doi.org/10.1109/TKDE.2011.104 doi: 10.1109/TKDE.2011.104
![]() |
[21] | L. Man, C. L. Tan, H. B. Low, Proposing a new term weighting scheme for text categorization, AAAI, 6 (2006). |
[22] |
M. Revanasiddappa, B. Harish, A new feature selection method based on intuitionistic fuzzy entropy to categorize text documents, Int. J. Interact. Multim. Artif. Intell, 5 (2018), 106–117. https://doi.org/10.9781/ijimai.2018.04.002 doi: 10.9781/ijimai.2018.04.002
![]() |
[23] |
M. Goudjil, M. Koudil, M. Bedda, N. Ghoggali, A novel active learning method using SVM for text classification, Int. J. Autom. Comput., 15 (2018), 290–298. https://doi.org/10.1007/S11633-015-0912-Z doi: 10.1007/S11633-015-0912-Z
![]() |
[24] |
M. Haddoud, A. Mokhtari, T. Lecroq, Saïd Abdeddaïm Combining supervised term-weighting metrics for SVM text classification with extended term representation, Knowl. Inf. Syst., 49 (2016), 909–931. https://doi.org/10.1007/s10115-016-0924-1 doi: 10.1007/s10115-016-0924-1
![]() |
[25] | A. McCallum, K. Nigam, A comparison of event models for naive bayes text classification, in Proceeding AAAI Workshop Learning for Text Categorization, 1998. |
[26] |
Y. Yang, An evaluation of statistical approaches to text categorization, Inf. Retr., 1 (2004), 69–90. https://doi.org/10.1023/A:1009982220290 doi: 10.1023/A:1009982220290
![]() |
[27] |
E. Leopold, J. Kindermann, Text categorization with support vector machines. How to represent texts in input space, Mach. Learn., 46 (2002), 423–444. https://doi.org/10.1023/A:1012491419635 doi: 10.1023/A:1012491419635
![]() |
[28] |
S. Lee, K. Seo, Intelligent fault diagnosis based on a hybrid multi-class support vector machines and case-based reasoning approach, J. Comput. Theor. Nanosci., 10 (2013), 1727–1734. https://doi.org/10.1166/JCTN.2013.3116 doi: 10.1166/JCTN.2013.3116
![]() |
[29] | C. C. Chang, C. J. Lin, LIBSVM: a library for support vector machines, ACM Trans. Intell. Syst. Technol. (TIST), 2 (2011), 27:1–27:27. https://doi.org/10.1145/1961189.1961199 |
[30] |
J. Zhang, L. Chen, G. Guo, Projected-prototype based classifier for text categorization, Knowl.-Based Syst., 49 (2013), 179–189. https://doi.org/10.1016/j.knosys.2013.05.013 doi: 10.1016/j.knosys.2013.05.013
![]() |
[31] |
F. Ren, M. G. Sohrab, Class-indexing-based term weighting for automatic text classification, Inf. Sci., 236 (2013), 109–125. https://doi.org/10.1016/j.ins.2013.02.029 doi: 10.1016/j.ins.2013.02.029
![]() |
[32] |
I. Alsmadi, G. K. Hoon, Term weighting scheme for short-text classification: Twitter corpuses, Neural Comput. Appl., 31 (2019), 3819–3831. https://doi.org/10.1007/s00521-017-3298-8 doi: 10.1007/s00521-017-3298-8
![]() |
[33] |
Y. Ko, New feature weighting approaches for speech-act classification, Pattern Recogn. Lett., 51 (2015), 107–111. https://doi.org/10.1016/j.patrec.2014.08.014 doi: 10.1016/j.patrec.2014.08.014
![]() |
1. | Yasunari Matsuzaka, Ryu Yashiro, Applications of Deep Learning for Drug Discovery Systems with BigData, 2022, 2, 2673-7426, 603, 10.3390/biomedinformatics2040039 | |
2. | Hao Wang, Guangmin Sun, Kun Zheng, Hui Li, Jie Liu, Yu Bai, Privacy protection generalization with adversarial fusion, 2022, 19, 1551-0018, 7314, 10.3934/mbe.2022345 | |
3. | Yu Li, Hao Liang, Guangmin Sun, Zifeng Yuan, Yuanzhi Zhang, Hongsheng Zhang, A Land Cover Background-Adaptive Framework for Large-Scale Road Extraction, 2022, 14, 2072-4292, 5114, 10.3390/rs14205114 | |
4. | Haiying Yuan, Mengfan Dai, Cheng Shi, Minghao Li, Haihang Li, A generative adversarial neural network with multi-attention feature extraction for fundus lesion segmentation, 2023, 43, 1573-2630, 5079, 10.1007/s10792-023-02911-y | |
5. | Xue Xia, Kun Zhan, Yuming Fang, Wenhui Jiang, Fei Shen, Lesion‐aware network for diabetic retinopathy diagnosis, 2023, 33, 0899-9457, 1914, 10.1002/ima.22933 | |
6. | Tiwalade Modupe Usman, Yakub Kayode Saheed, Adeyemi Abel Ajibesin, Augustine Shey Nsang, 2024, Ens5B-UNet for Improved Microaneurysms Segmentation in Retinal Images, 979-8-3503-5815-5, 1, 10.1109/SEB4SDG60871.2024.10629958 | |
7. | Huma Naz, Neelu Jyothi Ahuja, Rahul Nijhawan, Diabetic retinopathy detection using supervised and unsupervised deep learning: a review study, 2024, 57, 1573-7462, 10.1007/s10462-024-10770-x | |
8. | Joshua E. Mckone, Tryphon Lambrou, Xujiong Ye, James M. Brown, Weakly supervised pre-training for brain tumor segmentation using principal axis measurements of tumor burden, 2024, 6, 2624-9898, 10.3389/fcomp.2024.1386514 | |
9. | Tiwalade Modupe Usman, Adeyemi Abel Ajibesin, Yakub Kayode Saheed, Augustine Shey Nsang, 2023, GAPS-U-NET: Gating Attention And Pixel Shuffling U-Net For Optic Disc Segmentation In Retinal Images, 979-8-3503-5883-4, 1, 10.1109/ICMEAS58693.2023.10429873 |
Dimension | |f12| | |f15| | |f36| |
1 | 24 | 48 | 7 |
2 | 32 | 94 | 14 |
3 | 40 | 152 | 23 |
4 | 48 | 222 | 34 |
5 | 56 | 304 | 47 |
6 | 64 | 398 | 62 |
7 | 72 | 504 | 79 |
8 | 80 | 622 | 98 |
. | . | . | . |
. | . | . | . |
. | . | . | . |
n | 8n+16 | 6n2+28n+14 | n2+4n+2 |
Dimension | |f12| | |f14| | |f17| | |f18| |
1 | 6 | 0 | 0 | 0 |
2 | 6 | 12 | 12 | 12 |
3 | 6 | 24 | 24 | 60 |
4 | 6 | 36 | 36 | 144 |
5 | 6 | 48 | 48 | 264 |
6 | 6 | 60 | 60 | 420 |
7 | 6 | 72 | 72 | 612 |
8 | 6 | 84 | 84 | 840 |
. | . | . | . | . |
. | . | . | . | . |
. | . | . | . | . |
k | 6 | 12(k−1) | 12(k−1) | 18k2−42k+24 |
Dimension m | |f15| | |f16| | |f18| | |f∞| |
2 | 3 | 2(n−1) | n−1 | 20n+7 |
Dimension m | |f15| | |f16| | |f17| | |f18| | |f∞| |
2 | 3 | 2(n−1) | 0 | n−1 | 20n+7 |
3 | 2 | 2n | 1 | 3(n−1) | 20n+17 |
4 | 2 | 2n | 3 | 5(n−1) | 20n+27 |
5 | 2 | 2n | 5 | 7(n−1) | 20n+37 |
6 | 2 | 2n | 7 | 9(n−1) | 20n+47 |
. | . | . | . | . | . |
. | . | . | . | . | . |
. | . | . | . | . | . |
m | 2 | 2n | 2m−5 | 2mn−2m−3n+3 | 20n+10m−13 |
m | |f14| | |f15| | |f16| | |f17| | |f18| | |f20| | |f35| |
1 | 2n+1 | 2 | 4n−2 | 0 | 0 | 2n−1 | 0 |
2 | 2n+2 | 2 | 8n−2 | 2 | 2n−2 | 4n−2 | 2n−1 |
3 | 2n+3 | 2 | 12n−2 | 4 | 4n−4 | 6n−3 | 4n−2 |
. | . | . | . | . | . | . | . |
. | . | . | . | . | . | . | . |
. | . | . | . | . | . | . | . |
m | 2n+m | 2 | 4mn−2 | 2m−2 | 2mn−2(m+n)+2 | 2mn−m | 2mn−(m+2n)+1 |
Dimension | |f12| | |f15| | |f36| |
1 | 24 | 48 | 7 |
2 | 32 | 94 | 14 |
3 | 40 | 152 | 23 |
4 | 48 | 222 | 34 |
5 | 56 | 304 | 47 |
6 | 64 | 398 | 62 |
7 | 72 | 504 | 79 |
8 | 80 | 622 | 98 |
. | . | . | . |
. | . | . | . |
. | . | . | . |
n | 8n+16 | 6n2+28n+14 | n2+4n+2 |
Dimension | |f12| | |f14| | |f17| | |f18| |
1 | 6 | 0 | 0 | 0 |
2 | 6 | 12 | 12 | 12 |
3 | 6 | 24 | 24 | 60 |
4 | 6 | 36 | 36 | 144 |
5 | 6 | 48 | 48 | 264 |
6 | 6 | 60 | 60 | 420 |
7 | 6 | 72 | 72 | 612 |
8 | 6 | 84 | 84 | 840 |
. | . | . | . | . |
. | . | . | . | . |
. | . | . | . | . |
k | 6 | 12(k−1) | 12(k−1) | 18k2−42k+24 |
Dimension m | |f15| | |f16| | |f18| | |f∞| |
2 | 3 | 2(n−1) | n−1 | 20n+7 |
Dimension m | |f15| | |f16| | |f17| | |f18| | |f∞| |
2 | 3 | 2(n−1) | 0 | n−1 | 20n+7 |
3 | 2 | 2n | 1 | 3(n−1) | 20n+17 |
4 | 2 | 2n | 3 | 5(n−1) | 20n+27 |
5 | 2 | 2n | 5 | 7(n−1) | 20n+37 |
6 | 2 | 2n | 7 | 9(n−1) | 20n+47 |
. | . | . | . | . | . |
. | . | . | . | . | . |
. | . | . | . | . | . |
m | 2 | 2n | 2m−5 | 2mn−2m−3n+3 | 20n+10m−13 |
m | |f14| | |f15| | |f16| | |f17| | |f18| | |f20| | |f35| |
1 | 2n+1 | 2 | 4n−2 | 0 | 0 | 2n−1 | 0 |
2 | 2n+2 | 2 | 8n−2 | 2 | 2n−2 | 4n−2 | 2n−1 |
3 | 2n+3 | 2 | 12n−2 | 4 | 4n−4 | 6n−3 | 4n−2 |
. | . | . | . | . | . | . | . |
. | . | . | . | . | . | . | . |
. | . | . | . | . | . | . | . |
m | 2n+m | 2 | 4mn−2 | 2m−2 | 2mn−2(m+n)+2 | 2mn−m | 2mn−(m+2n)+1 |