Research article

Product evaluation through multi-criteria decision making based on fuzzy parameterized Pythagorean fuzzy hypersoft expert set

  • Received: 07 February 2022 Revised: 21 March 2022 Accepted: 24 March 2022 Published: 06 April 2022
  • MSC : 03B52, 03E72, 03E75

  • In many real-world decision-making situations, uncertain nature of parameters is to be discussed to have unbiased and reliable decisions. Most of the existing literature on fuzzy soft set and its related structures ignored the uncertain parametric attitudes. The concept of fuzzy parameterization is launched to tackle the limitations of existing soft set-like models. Several extensions have already been introduced by using the concept of fuzzy parameterization. In this research, a novel extension, fuzzy parameterized Pythagorean fuzzy hypersoft expert set is aimed to be characterized. This model is more flexible and reliable as compared to existing models because it addresses their insufficiencies for the consideration of multi-argument approximate function. With the entitlement of this function, it tackles the real-life scenarios where each attribute is meant to be further classified into its respective sub-attribute valued disjoint set. The characterization of fuzzy parameterized Pythagorean fuzzy hypersoft expert set is accomplished by employing theoretic, axiomatic and algorithmic approaches. In order to validate the proposed model, an algorithm is proposed to study its role in decision-making while dealing with real-world problem. Moreover, the proposed model is compared with the most relevant existing models to assess its advantageous aspects.

    Citation: Muhammad Ihsan, Muhammad Saeed, Alhanouf Alburaikan, Hamiden Abd El-Wahed Khalifa. Product evaluation through multi-criteria decision making based on fuzzy parameterized Pythagorean fuzzy hypersoft expert set[J]. AIMS Mathematics, 2022, 7(6): 11024-11052. doi: 10.3934/math.2022616

    Related Papers:

    [1] Aliyu Ismail Ishaq, Abdullahi Ubale Usman, Hana N. Alqifari, Amani Almohaimeed, Hanita Daud, Sani Isah Abba, Ahmad Abubakar Suleiman . A new Log-Lomax distribution, properties, stock price, and heart attack predictions using machine learning techniques. AIMS Mathematics, 2025, 10(5): 12761-12807. doi: 10.3934/math.2025575
    [2] Mohammed Alharbey, Turki Mohammed Alfahaid, Ousama Ben-Salha . Asymmetric volatility spillover between oil prices and regional renewable energy stock markets: A time-varying parameter vector autoregressive-based connectedness approach. AIMS Mathematics, 2023, 8(12): 30639-30667. doi: 10.3934/math.20231566
    [3] Pedro J. Gutiérrez-Diez, Jorge Alves-Antunes . Stock market uncertainty determination with news headlines: A digital twin approach. AIMS Mathematics, 2024, 9(1): 1683-1717. doi: 10.3934/math.2024083
    [4] Zhi Liu . Forecasting stock prices based on multivariable fuzzy time series. AIMS Mathematics, 2023, 8(6): 12778-12792. doi: 10.3934/math.2023643
    [5] Chenyang Hu, Yuelin Gao, Eryang Guo . Fractional portfolio optimization based on different risk measures in fuzzy environment. AIMS Mathematics, 2025, 10(4): 8331-8363. doi: 10.3934/math.2025384
    [6] Javed Hussain, Saba Shahid, Tareq Saeed . Pricing forward-start style exotic options under uncertain stock models with periodic dividends. AIMS Mathematics, 2024, 9(9): 24934-24954. doi: 10.3934/math.20241215
    [7] Yunjae Nam, Dongsun Lee . Efficient one asset replacement scheme for an optimized portfolio. AIMS Mathematics, 2022, 7(9): 15881-15903. doi: 10.3934/math.2022869
    [8] Hasnain Iftikhar, Murad Khan, Josué E. Turpo-Chaparro, Paulo Canas Rodrigues, Javier Linkolk López-Gonzales . Forecasting stock prices using a novel filtering-combination technique: Application to the Pakistan stock exchange. AIMS Mathematics, 2024, 9(2): 3264-3288. doi: 10.3934/math.2024159
    [9] Zifu Fan, Youpeng Tao, Wei Zhang, Kexin Fan, Jiaojiao Cheng . Research on open and shared data from government-enterprise cooperation based on a stochastic differential game. AIMS Mathematics, 2023, 8(2): 4726-4752. doi: 10.3934/math.2023234
    [10] Peng Yang . Robust optimal reinsurance-investment problem for $ n $ competitive and cooperative insurers under ambiguity aversion. AIMS Mathematics, 2023, 8(10): 25131-25163. doi: 10.3934/math.20231283
  • In many real-world decision-making situations, uncertain nature of parameters is to be discussed to have unbiased and reliable decisions. Most of the existing literature on fuzzy soft set and its related structures ignored the uncertain parametric attitudes. The concept of fuzzy parameterization is launched to tackle the limitations of existing soft set-like models. Several extensions have already been introduced by using the concept of fuzzy parameterization. In this research, a novel extension, fuzzy parameterized Pythagorean fuzzy hypersoft expert set is aimed to be characterized. This model is more flexible and reliable as compared to existing models because it addresses their insufficiencies for the consideration of multi-argument approximate function. With the entitlement of this function, it tackles the real-life scenarios where each attribute is meant to be further classified into its respective sub-attribute valued disjoint set. The characterization of fuzzy parameterized Pythagorean fuzzy hypersoft expert set is accomplished by employing theoretic, axiomatic and algorithmic approaches. In order to validate the proposed model, an algorithm is proposed to study its role in decision-making while dealing with real-world problem. Moreover, the proposed model is compared with the most relevant existing models to assess its advantageous aspects.



    Due to the dynamic nature of stock market volatility, researchers have to formulate different types of stock selection strategies in different periods. The stock selection strategy based on the traditional econometric model cannot dynamically modify the model parameters when the trend of data changed. Therefore, such strategies cannot effectively fit the dynamic trends of the stock market, and it is difficult to select high-quality stocks in the stock market [1]. With the development of artificial intelligence, big data, cloud computing technology, people gradually know the quantitative investment, which issues trading instructions through quantitative methods and computer programming. Currently, most of quantitative stock selection methods are still based on the simple scoring method and the multi-factor stock selection model of regression method. Therefore, the paper combines the random forest algorithm with strong ability to extract information through data with a multi-factor stock selection model, which makes the stock selection strategy more universal and adaptable.

    In 1987, Merton proposed the rotation effect that the strength and weakness of different types of stocks change with the market. Therefore, domestic and foreign experts have made extensive research about problems that whether there is a style rotation effect in the stock market and how to use the style rotation effect to make a reasonable investment. In foreign studies, Quigley and Sinquefield(2000) [2] studied the relationship between the investment style of institutional investors and their investment income. Froot and Melvyn (2004) [3] studied different types of investment styles from large-cap stocks and small-cap stocks, non-cyclical industries and cyclical industries. In domestic research, Xiao and Wang (2006) [4] empirically tested the momentum of Chinese stock market and pointed out that style momentum reflects the predictability of stock prices. Guo and Wang (2019) [5] used BP neural network to predict the style rotation of the Chinese stock market, and found that the BP neural network model has a better prediction effect on the medium-term style rotation. When judging which stocks have investment value, it is generally necessary to select a factor pool with a high correlation with stock returns as a feature of constructing the multi-factor stock selection model. Mohanram(2005) [6] selected eight indicators from the three aspects of profitability and cash flow performance, growth capacity and book value to market ratio, and based on these indicators to build an index that can affect the establishment of an investment portfolio. Pan (2011) [7] used 9 indicators such as PEG and book-to-market ratio as reference for stock selection. Guerard et al. (2015) [8] combines stock returns, book value, cash flow and sales volume, and momentum in the stock selection model to identify mispriced securities. DeMiguel et al. (2017) [9] based on the investor utility, found that six company characteristics can be used to predict stock returns. Kozak et al. (2019) [10] used PCA to reduce the dimensionality of factors and found that only a few principal component factors are needed to predict the cross-sectional stock returns. In recent years, with the advent of the era of artificial intelligence and the continuous improvement of machine learning algorithms, quantitative technology has gradually become an important technical force in the field of financial investment. The early portfolio investment can be traced back to the “mean variance” model proposed by Markowitz (1952), which has a milestone significance for quantitative investment. After nearly 70 years of development, the technology that combines machine learning algorithms with various stock picking investment theories has become more and more mature. Khaidem and Saha. (2016) [11] used the random forest algorithm to predict stock returns to minimize the risk of stock market investments. Guerard, J.B et al. (2016) [12] generates stock selection models based on robust regression techniques, with significant returns. Phillip (2018) [13] applied neural networks to stock selection strategies and compared it with traditional regression models, drawing the conclusion that deep learning is more effective for short-term prediction. Dai and Zhu (2020) [14] believe that mixing existing forecasting models can significantly improve prediction performance of stock returns. Dai and Zhou (2020) [15] combine the sum-of-the-parts method, non-negative economic constraint strategy, momentum of return prediction strategy, and three-sigma strategy to improve prediction performance of stock returns. From the existing literature, the existing stock selection strategies are mostly based on simple multi-factor scoring strategies, lacking the processing of high-dimensional and collinear factors. Therefore, this paper proposes a multi-factor stock selection model based on the rotation effect and further introduces the random forest algorithm, so that the model with enhanced data generalization ability can handle high-dimensional data. In the selection of factors, this paper proposes to dynamically select characteristic factors according to the factor contribution of the previous period, which can closely follow the market conditions and the prediction results may be more effective.

    The rotation of large-cap stock and small-cap stock refers to the phenomenon that the dominant styles in the large-cap stocks and small-cap stocks are alternately replaced. In general, the CSI 300 Index is used as the representative index of large-cap stocks, and the GEM index is the representative index of small-cap stocks. In order to identify the dominant style of the current period, this article chooses an indicator of the relative strength between the large-cap stocks and small-cap stocks, that is, the relative strength index of the index yield: RSI (Relative Strength Index), as shown below:

    RSIt=lnHS_indextHS_indext1lnGEM_indextGEM_indext1 (3.1)

    Where HS_indext is the price of the CSI 300 index in day t; GEM_indextis the price of the GEM index in day t; RSItis the logarithm of the excess return of CSI 300 relative to the GEM in day t; HS_indext/HS_indext1is a month yield of the CSI 300 Index, GEM_indext/GEM_indext1is a month yield of the GEM index; RSIis the difference between the logarithm of a month return rate of the CSI 300 index and the logarithm of a month return rate of the GEM index, being used to judge which style is dominant.

    Combined with the discussion of Guerard and Markowitz (2015)[8], the process of multi-factor stock selection is shown in the following Figure 1:

    Figure 1.  Multi-factor stock selection flow chart.

    Multi-factor stock selection strategies generally include single factor information coefficient IC test, multi-factor stock selection strategy construction and multi-factor stock selection strategy effect test. The single factor information coefficient IC test mainly includes the selection of candidate factors, the validity test of stock selection factors for the correlation analysis of the factors and the next yield, and the elimination of the factors with higher correlation between the factors. The effective but redundant factors are eliminated. The construction of multi-factor stock selection strategy includes scoring and regression methods. Here we use similar and regression methods, which use random forest algorithm to predict the next period of profit based on current factors. Finally, the effectiveness of the multi-factor stock selection strategy is tested according to the system back-testing.

    Bagging is one of the parallel integrated learning methods. Given a data set containing m samples, first randomly take a sample into the sampling set, and then put the sample back into the initial data set, so that the sample may still be selected at the next sampling. After m random sampling operations performed, a sample set containing m samples is obtained. Some samples in the initial training set appear multiple times in the sample set, and some never appear.

    Hoob(x)=argmaxYTt=1I(ht(x)=y)I(xDt) (3.2)

    Where Dt epresents the training sample set actually used by ht.Hoob(x) is the out-of-packet prediction of sample x, that is, only the predictions of the base learners that do not use x training on x.ht(x) represents a single decision tree classification mode. yrepresents the output variable.I() is an indicative function.

    The out-of-package estimates for Bagging generalization errors are:

    oob=1|D|(x,y)DI(Hoob(x)y) (3.3)

    Random forest is an extended variant of Bagging. Based on the base learner of decision tree to build Bagging integration, the random forest introduces random attribute selection in the training process of decision tree.

    The article assumes that there are a total of d attributes in the sample set. Each decision tree randomly selects a subset of k attributes from the attribute set. In general, take

    k=log2d (3.4)

    The schematic diagram of random forest is shown in Figure 2 below:

    Figure 2.  The schematic diagram of random forest.

    After each decision tree judges the corresponding category, the learner hi in the random forest will predict a marker from the category marker set {c1,c2,,cN}. The most common combination strategy is to use the voting method, including absolute majority voting, relative majority voting and weighted voting [16].

    ● Majority voting

    H(x)={cj,ifTi=1hji(x)>0.5Nk=1Ti=1hki(x)reject,otherwise.; (3.5)

    If a mark scores more than a half, the forecast is the mark; otherwise, the prediction is rejected.

    ● Plurality Voting

    H(x)=cargmaxjTt=1hji(x) (3.6)

    That is the mark that is predicted to be the most votes. If multiple tags have the highest number of votes at the same time, we randomly select one of them.

    ● Weighted Voting

    H(x)=cargmaxjTt=1wihji(x) (3.7)

    wi is the weight of hi, usuallywi0, Tt=1wi=1.

    Random forest contains many important parameters, here are a few more important parameters such as n_estimators, max_features and max_depth.

    N_estimators

    The number of trees in the forest. The larger the value of this parameter is, the better the model is. But when the parameter reaches a certain level, the accuracy of the random forest often does not rise but start to fluctuate. The larger the parameter value is, the greater the amount of calculation and memory are required, and the training time will become longer and longer.

    Max_features

    Maximum number of features to consider when dividing trees. Generally speaking, if the number of sample features is not large, we can consider all the number of features when dividing.

    Max_depth

    The maximum depth of the tree. In order to maintain the robustness of the random forest, branches exceeding the maximum depth will be cut off.

    Min_samples_split

    A node must contain at least the number of training samples. This node is not allowed to branch when the number of samples is insufficient.

    According to Markowitz (1952) [17], the objectives and constraints of the 30-stock minimum variance portfolio are as follows:

    min30i=130j=1XiXjσijst{E=30i=1Xiui30i=1Xi=10Xi1fori=1,,30 (3.8)

    Where Xi, Xj is the investment ratio of the ith and jth stocks, ui is the average return of the ith stock in one year (252 days), and σij is the covariance of the 252-day daily return data of ith and jth stocks.

    The research ideas in this article are based on the rotation of large-cap stock and small-cap stock in China's A-share market, and the change in stock price is based on the fundamental, technical and macro-level economic situation of listed companies. First of all, we will analyze the current situation of whether there is the rotation of large-cap stock and small-cap stock in China's A-share market, and judge the market's style trend by the magnitude of the rise in the previous 3 months of the CSI 300 index (large-cap stock) and the GEM index (small-cap stock). Style as a stock pool. Combining Guerard and Markowitz (2015) to calculate the information coefficient IC for the factor, then we make a correlation analysis between factors and returns, and between factors, and select the growth ability, profitability, quality of return, valuation and market value factors as the alternative factors from the factor bank, which are closely related to return rate. We fix the frequency of shifts, screen the ten most significant characteristic factors through the factor importance in the previous period, use the random forest to predict the stock return rate based on the ten characteristic factors, and then buy the stocks with the highest number of votes after voting through many decision trees, combined with Markowitz (1952 [17], 1959 [18], and 1987 [19]). The number of selected stocks is 30, and the purchase weight is based on the minimum risk of Markowitz effective boundary [19,20,21]. Finally, a simple stop-loss strategy is added to obtain the Shanghai Composite Index's five-day gain before the entire strategy. If it is less than- 0.1, it will be cleared, and the following stock purchase operations will not be performed.

    In this article, the previous stock data of this article is used as training data, using January 1, 2012 to April 1, 2020 as the interval for strategy stock selection. The frequency of position swaps is one month (position adjustment at the end of each month), and the 30 selected stocks are bought based on minimizing risk weights. The initial capital is set to 100,000, the daily buying rate is set to 0.1%, and the selling rate is set to 0.4% (including 0.3% stamp duty).

    Figure 3.  Research ideas.

    Although it is intuitively obvious that there is a phenomenon on rotation of large-cap stocks and small-cap stocks in China's A-share market, in order to further study the current situation of China's rotation of large-cap stocks and small-cap stocks, this article uses historical data to verify it. If the expected absolute value of the RSI index is too small, the variance is too large, the value has an upward or downward trend, and the value fluctuates around 0, we can judge whether A-share market has the rotation of large-cap stocks and small-cap stocks by that index.

    From the line chart of RSI, we can see that the index basically fluctuates around 0, and therefore conclude that A-share market has the rotation of large-cap stocks and small-cap stocks.

    Figure 4.  Line chart of RSI.

    Due to the following characteristics of financial data:

    ● The data distribution has the characteristics of “spikes and thick tails”;

    ● Data fluctuation has the characteristics of high fluctuation agammaegation.

    And this article assumes:

    ● The average (or median) RSI is equal to 0, indicating that although the market has a rotation effect, the overall return rate tends to 0;

    ● The larger the standard deviation of the RSI is, the greater the tendency of the market to invest in different styles in different periods is.

    In such cases, the standard parametric test is not applicable, so the non-parametric symbol test combined with the standard deviation of descriptive statistics is used to determine whether the market has a rotation effect.

    Based on the line chart of RSI, to verify the existence of the rotation of large-cap stocks and small-cap stocks in China's A-share market, we firstly calculate the expected value of the RSI within the interval. The closer to 0 the RSI expectation value is, the more significant the rotation effect is; If the absolute mean of RSI is larger, it indicates that the market bias of a certain style is greater, and the rotation effect is not significant at this time. Secondly, we calculate the standard deviation of the RSI within the interval. The larger the RSI standard deviation is, the greater the market volatility and the significant rotation effect is; If the RSI standard deviation is smaller, that is, the market volatility is small, there is no rotation basis.

    According to the above logic, this article conducts statistical analysis on RSI, the steps of symbol checking are as follows:

    ● Constructing null hypothesis and alternative hypothesis:

    Null hypothesis H0:Me=0;

    Alternative hypothesis H1:Me0

    ● Constructing hypothetical statistics Hypothetical statistics S+=ni=1Yi, S=ni=1Zi, in the formula:

    Yi=I{xi>0}={1,xi>00,xi0 (4.1)
    Zi=I{xi0}={1,xi00,xi>0 (4.2)

    ● Finding the P value

    According to the symbol test, the following results are obtained:

    Table 1.  Statistical analysis of RSI.
    Relative Strength Index Expected Value Standard Deviation Maximum Minimum
    RSI 0.0010 0.0153 0.0398 -0.0384

     | Show Table
    DownLoad: CSV

    As can be seen from the above table, the mean of RSI is 0.0010, which is close to 0, and the standard deviation is 0.0153. There is a large fluctuation, so it is sufficient to prove that there is the rotation of large-cap stocks and small-cap stocks in China's A-share market.

    In this article, the median of RSI is tested by non-parametric symbolic test. The test results are as follows:

    Table 2.  Symbol checklist.
    Inspection Type Statistics S+ Statistics S- P-values
    Sign Test 51 48 0.84

     | Show Table
    DownLoad: CSV

    The P-value of the symbol test is greater than 0.05, and the null hypothesis cannot be rejected, so the median of the RSI can be considered close to 0. According to the above statistical analysis results, A-share market has the rotation of large-cap stocks and small-cap stocks.

    The specific steps of the rotation strategy of large-cap stocks and small-cap stocks are as follows:

    ● Get the magnitude of the rise of GEM index and the CSI 300 index in the previous 3 months of the test.

    ● If the magnitude of the rise of both indexes are less than 0, the clearance is selected to increase the profit and reduce the risk of retracement.

    ● If the magnitude of the rise of both indexes are not less than 0:

    If the magnitude of the rise of the CSI 300 index is more than that of the GEM index, we select 30 stocks with the maximum market value from the CSI 300 index stocks.

    If the magnitude of the rise of the GEM index is more than that of the CSI 300 index, we select 30 stocks with the minimum market value from the GEM index stocks.

    ● Obtain the closing price of the first 252 days of the selected 30 stocks to calculate the daily rate of return, determine the weight of 30 stocks according to the minimum variance, and buy and hold according to the weights.

    ● Add a simple stop-loss strategy: Get the Shanghai Composite Index's five-day gain before the entire strategy, if it is less than -0.1, then the strategy clears the position, and does not perform the following stock purchase operations.

    The back-testing results are shown in Table 3 and Figure 5:

    Table 3.  Multi-disk rotation back-testing information table.
    Annualized Rate of Return Benchmark CSI 300 Index α β
    25.80% 5.80% 21.10% 0.52
    Information Ratio Maximum Withdrawal Annual Turnover Rate Sharpe Ratio
    0.71 45.1% 1314.21% 0.86

     | Show Table
    DownLoad: CSV
    Figure 5.  Back-up diagram of the rotation strategy of Large-cap stocks and Small-cap stocks.

    According to the rotation strategy of large-cap stocks and small-cap stocks, if the current large-cap stocks style dominates, the 60 stocks with the maximum market value will be selected from the CSI 300 index stocks as the forecast stock pool of the random forest. If the current small-cap stocks style dominates, 60 stocks with the minimum market value are selected from the GEM index stocks as the forecast stock pool of the random forest.

    By correlating between the 237 factors in the factor pool from June 1, 2010 to December 31, 2011, and the next return rate and between the factors, we select the factors with larger correlation coefficient with the rate of return and eliminate the factors with greater multicollinearity.

    rxky=kii=1(xkiˉxk)(yiˉy)kii=1(xkiˉxk)2ni=1(yiˉy)2 (5.1)
    rxkxj=kii=1(xkiˉxk)(xjiˉxj)kii=1(xkiˉxk)2jii=1(xjiˉxj)2 (5.2)

    rxky, rxjy represent the correlation coefficient between factor xk and yield y, factor xk and factor xj, respectively. Select the first 40 factors with the largest absolute value of the correlation coefficient, that is maxtop|rxky|. If |rxkxj|>0.8, only select the factor with the largest correlation coefficient with the yield, that is xs=max(rxky,rxkjy).

    Since there are many factors used for stock analysis in the market, not all factors have an impact on stock returns. According to the previous literature review, this paper combines “Analysis of Quantitative Stock Selection Based on Multi-Factor Model” (Xu, 2017) [22] with “Research on Multi-Factor Quantitative Stock Selection Based on Self-Attention Neural Network” (Zhang, 2020) [23] to select 35 factors in terms of quality, value, growth, momentum, technical indicator, sentiment and basic subject to measure the performance of stocks. Then according to the above steps, a total of 19 factors with high correlation with yield were selected. Then combine these 19 factors with some common growth ability factors, profitability and income quality factors, as well as valuation and market value factors, including 35 factors such as PE, PB, ROA, ROE, etc. The total of 54 factors are used as candidate factors shown in the following table:

    Table 4.  Factor classification.
    Classification Short name Factor name
    Technical indicators Mass Index Mass Index
    MTM Momentom Index
    EMV6 Ease of Movement Value
    Swing Index Swing Index
    Chaikin Volatility Chaikin Volatility
    Growth EGRO Five-year earnings growth
    Net Asset Grow Rate Net assets growth rate
    Net Profit Grow Rate Net profit growth rate
    Operating Revenue Grow Rate Operating revenue growth rate
    Total Profit Grow Rate Total profit growth rate
    Operating Profit Grow Rate Operating profit growth rate
    Financing Cash Grow Rate Net cash flow growth rate from financing activities
    Total Asset Grow Rate Total assets growth rate
    OperCash Grow Rate Net cash flow growth rate from operating activities
    SUOI Standardized unexpected gross income
    Momentum RC24 Rate of Change
    Aroon The number of periods that have elapsed since the price reached the highest and lowest recent values
    Aroon Down Aroon factor intermediate variable
    PLRC6 Price Linear Regression Coefficient
    EARNMOM Change tendency of net profit in the past eight quarters
    Value PE Price-earnings ratio
    CTOP Cash flow to price
    ETOP Earnings to price
    LFLO Natural logarithm of float market values
    LCAP Natural logarithm of total market values
    PB Price-to-book ratio
    PCF Price-to-cash-flow ratio
    PS Price-to-sales ratio
    TA2EV Assets to enterprise value
    Mood indicators ACD6 Accumulative Distribution
    VOL5 Turnover Rate
    TVSTD6 Turnover Value Standard Deviation
    VDEA Volume Difference Exponential Average
    VR Volume Ratio
    WVAD William's variable accumulation distribution
    BR Willingness index
    Klinger O scillator Volume swing indicator
    VSTD10 Volume Standard Deviation
    Quality DEGM Growth rate of gross income ratio
    ROA Return on assets
    ROE Return on equity
    Sale Service Cash To OR Sale service cash to operating revenues
    Admini Expense Rate Administrative expense rate
    NOCF To Operating NI Net cash flow from operating activities Ratio to net income
    Cash To Current Liability Cash to current liability
    Cash Rate Of Sales Cash rate of sales
    OperCash In To Current Liability Cash provided by operations to current liability
    CFO2EV Cash provided by operations to enterprise value
    NP To TOR Net profit to total operating revenues
    Operating Profit Ratio Operating profit ratio
    Net Profit Ratio Net profit ratio
    Gross Income Ratio Gross income ratio
    Basic Subjects ASSI Natural logarithm of total assets
    Index for each stock EPS Earnings per share

     | Show Table
    DownLoad: CSV

    Each time the position is changed, it is necessary to dynamically select the feature factor with the greater importance of the factor in the current period according to the previous market situation. First, the stocks in the stock pool are ranked by factors, and the top 20% and the bottom 20% stocks are selected to form two combinations; This article divides the average return difference of the two portfolios and the average return difference of the top 20% and the last 20% of all stocks in the stock pool, and the ratio is the factor contribution; The ten important characteristic factors with the largest factor contribution in the previous day for each back-testing day are selected. Each decision tree in the random forest predicts the stock price return rate through ten random factors, and opens (changes positions) to purchase the 30 stocks with the highest votes.

    ci=abs(IFiILiMFiMLi) (5.3)
    xk=max10(ci) (5.4)
    i=1,2,6,k=1,2,,10

    IFi is the average rate of return for the top 10% of the factor values of a certain factor of a certain index component, ILi is the average rate of return for the last 10% of the factor values of a certain factor of a certain index component, MFi is the average rate of return for the top 10% of the factor values of a certain factor in the market. MLi is the average rate of return for the last 10% of the factor values of a certain factor in the market. xk is the feature factor of the top 10 largest factor contributions selected from 46 candidate factors.

    If the data is missing, assign it a value of 0. Select 40% stocks before and after 60 stocks to eliminate the noise in the middle. Binarize the average return rate of the first five trading days of each stock. If it is greater than the average return rate of all selected stocks in the previous month, it is recorded as +1. If it is less than that, record it as -1. 10 feature factors with maximum factor contribution are used as input variables, and the average rise and fall after binarization are used as output labels. The strategy randomly divides the previous period factor data and output labels at the time of warehouse swap into 90% training set and 10% test set, outputs the training model whose test accuracy reaches a certain standard, and predicts the current rate of return of each constituent stock in the stock pool. Finally, select the 30 stocks with the highest yield prediction value, obtain the closing price of the first 252 days of the selected 30 stocks to calculate the daily yield, determine the weight of 30 stocks according to the minimum variance, and buy and hold according to the weights. Obtain the first five days of the Shanghai Composite Index's gains before the entire strategy. If it is less than -0.1, the strategy will clear the position and do not perform the following stock purchase operations. The overall idea of the strategy is as follows:

    Figure 6.  The overall idea of the combination strategy.

    Set the number of decision trees of random forest to 100,500 and 1000 respectively. The backtest results from January 1, 2012 to April 1, 2020 are shown in Table 5:

    Table 5.  Combination strategy back-testing information table.
    Number of decision trees annualized return Benchmark annualized return α β Information ratio Maximum drawdown Annualized turnover Sharp Rate
    100 17.80% 5.80% 13.10% 0.52 0.45 39.10% 1398.55% 0.54
    500 29.40% 5.80% 24.70% 0.53 0.77 44.20% 1439.53% 0.92
    1000 24.50% 5.80% 19.80% 0.53 0.64 39.30% 1496.74% 0.75

     | Show Table
    DownLoad: CSV

    The effect of combining the random forest of 500 decision and the rotation effect is better, and the back-testing results are shown in Figure 7:

    Figure 7.  Back-testing chart of combined strategy.

    The average accuracy of the random forest on the validation set is 0.66, and the line chart is drawn as shown in Figure 8:

    Figure 8.  Random forest accuracy.

    The back-testing index pairs of the two strategies are shown in Table 6:

    Table 6.  Back-testing table of two strategies.
    Strategy type annualized return α β Sharp Rate Maximum drawdown Annualized turnover
    Rotation strategy 25.80% 21.10% 0.52 0.86 45.10% 1314.21%
    Combination strategy 29.40% 24.70% 0.53 0.92 44.20% 1439.53%

     | Show Table
    DownLoad: CSV

    From the comparison of the back-testing results, the annualized return rate of the rotation strategy is 25.80%, and the combined strategy is 29.40%. The combined strategy effect is slightly higher than the rotation strategy effect. The β value of the rotation strategy is 0.52, and the β value of the combination strategy is 0.53, which indicates that the combination strategy is more sensitive to the systemic risk of the stock market. The Sharpe ratio of the rotation strategy is 0.86, and the Sharpe ratio of the combination strategy is 0.92. From the perspective of the excess return of unit risk, the rotation strategy has more advantages than the combination strategy. The maximum drawdown of the rotation strategy is 45.10%, and the maximum drawdown of the combination strategy is 44.20%, which indicates that the combined strategy has better risk control than the rotation strategy.

    Comparing the strategy of this article with the quantitative investment funds in the market, the three-year annualized return rate comparison is shown in Table 7:

    Table 7.  Back-testing table of two strategies.
    Strategy / fund company name annualized return Strategy / fund company name annualized return
    Rotation strategy 25.80% Cathay Juxin Value Advantage Hybrid Fund A 21.97%
    Combination strategy 29.40% Changsheng Medical Industry Quantitative Allocation of Stock Fund 21.72%
    Bocom Alpha Core Hybrid Fund 27.07% Cathay Juxin Value Advantage Hybrid Fund C 21.12%
    Agricultural Bank of China Strategic Hybrid Fund 23.10% Everbright Industry Rotation Hybrid Fund 19.31%

     | Show Table
    DownLoad: CSV

    It can be seen that the rotation strategy and the combination strategy of this article can outperform the quantitative investment funds in the market, indicating that the strategy of this article has the value of in-depth research.

    The quantitative stock selection strategy for the market periodic rotation effect and random forest constructed in this paper is improved on the basis of the traditional style stock selection and multi-factor stock selection strategy. The random forest algorithm in machine learning, to some extent overcome the subjectivity, blindness and speculation of investment. It is a rational investment with significant returns. The selection of characteristic factors in the random forest changes with market conditions, and the frequency of changing positions is appropriate. It keeps up with the market while ensuring transaction costs and avoids the lag of stock market information. The average prediction accuracy of random forest is 0.67, and empirical results show that the strategy based on the rotation effect and the random forest algorithm has a good performance. Therefore, the strategy of this paper has certain efficiency in stock selection of China's A-share market and has certain reference value.

    However, due to the limited level of research, the feature factor selection in this paper only roughly selected 54 important factors based on the analysis of correlation and collinearity among more than 200 stock market indicators. The random forest algorithm also does not use the rolling prediction method and loses some accuracy. In terms of risk control, the stop loss strategy is relatively simple and needs further research and optimization.

    Research project of Humanities and Social Sciences in general colleges and universities of Guangdong Province (2015WQNCX038); National University Student Innovation Training Project (201910592003); Guangdong provincial department of education young creative talents project (2016WQNCX046).

    All authors declare no conflicts of interest in this paper.



    [1] L. A. Zadeh, Fuzzy sets, Inf. Control, 8 (1965), 338–353. http://dx.doi.org/10.1016/S0019-9958(65)90241-X doi: 10.1016/S0019-9958(65)90241-X
    [2] D. Molodtsov, Soft set theory–first results, Comput. Math. Appl., 4 (1999), 19–31, http://dx.doi.org/10.1016/S0898-1221(99)00056-5 doi: 10.1016/S0898-1221(99)00056-5
    [3] N. Çaǧman, S. Karataş, Intuitionistic fuzzy soft set theory and its decision making, J. Intell. Fuzzy Syst., 24 (2013), 829–836. http://dx.doi.org/10.3233/IFS-2012-0601 doi: 10.3233/IFS-2012-0601
    [4] X. D. Peng, Y. Yang, J. Song, Y. Jiang, Pythagorean fuzzy soft set and its application, Comput. Eng., 41 (2015), 224–229.
    [5] F. Feng, Z. Xu, H. Fujita, M. Liang, Enhancing PROMETHEE method with intuitionistic fuzzy soft sets, Int. J. Intell. Syst., 35 (2020), 1071–1104. https://doi.org/10.1002/int.22235 doi: 10.1002/int.22235
    [6] Z. Chen, P. Liu, Z. Pei, An approach to multiple attribute group decision making based on linguistic intuitionistic fuzzy numbers, Int. J. Comput. Intell. Syst., 8 (2015), 747–760.
    [7] J. Liu, J. Mai, H. Li, B. Huang, Y. Liu, On three perspectives for deriving three-way decision with linguistic intuitionistic fuzzy information, Inform. Sci., 588 (2022), 350–380. https://doi.org/10.1016/j.ins.2021.12.072 doi: 10.1016/j.ins.2021.12.072
    [8] J. Liu, H. Li, B. Huang, Y. Liu, D. Liu, Convex combination-based consensus analysis for intuitionistic fuzzy three-way group decision, Inform. Sci., 574 (2021), 542–566. https://doi.org/10.1016/j.ins.2021.06.018 doi: 10.1016/j.ins.2021.06.018
    [9] F. Meng, J. Tang, H. Fujita, Linguistic intuitionistic fuzzy preference relations and their application to multi-criteria decision making, Inform. Fusion, 46 (2019), 77–90.
    [10] T. M. Athira, S. J. John, H. Garg, Entropy and distance measures of Pythagorean fuzzy soft sets and their applications, J. Intell. Fuzzy Syst., 37 (2019), 4071–4084. https://doi.org/10.3233/JIFS-190217 doi: 10.3233/JIFS-190217
    [11] T. M. Athira, S. J. John, H. Garg, A novel entropy measure of Pythagorean fuzzy soft sets, AIMS Math., 5 (2020), 1050–1061.
    [12] F. Feng, H. Fujita, M. I. Ali, R. R. Yager, X. Liu, Another view on generalized intuitionistic fuzzy soft sets and related multiattribute decision making methods, IEEE T. Fuzzy Syst., 27 (2018), 474–488.
    [13] F. Xiao, W. Ding, Divergence measure of Pythagorean fuzzy sets and its application in medical diagnosis, Appl. Soft Comput., 79 (2019), 254–267.
    [14] F. Xiao, CEQD: A complex mass function to predict interference effects, IEEE T. Cybernetics, 2021, 1–13. https://doi.org/10.1109/TCYB.2020.3040770
    [15] P. K. Maji, P. K. Biswas, A. Roy, Fuzzy soft sets, J. Fuzzy Math., 9 (2001), 589–602.
    [16] S. Alkhazaleh, A. R. Salleh, Soft expert sets, Adv. Decision Sci., 2011 (2011), 1–12. https://doi.org/10.1155/2011/757868 doi: 10.1155/2011/757868
    [17] M. Ihsan, M. Saeed, A. U. Rahman, A rudimentary approach to develop context for convexity cum concavity on soft expert set with some generalized results, Punjab Univ. J. Math., 53 (2021), 621–629. https://doi.org/10.52280/pujm.2021.530902 doi: 10.52280/pujm.2021.530902
    [18] S. Alkhazaleh, A. R. Salleh, Fuzzy soft expert set and its application, Appl. Math., 5 (2014), 1349–1368. https://doi.org/10.4236/am.2014.59127 doi: 10.4236/am.2014.59127
    [19] M. Ihsan, A. U. Rahman, M. Saeed, H. A. E. W. Khalifa, Convexity-cum-concavity on fuzzy soft expert set with certain properties, Int. J. Fuzzy Log. Intell. Syst., 21 (2021), 233–242. https://doi.org/10.5391/IJFIS.2021.21.3.233 doi: 10.5391/IJFIS.2021.21.3.233
    [20] N. Çaǧman, S. Karataş, Intuitionistic fuzzy soft set theory and its decision making, J. Intell. Fuzzy Syst., 24 (2013), 829–836. https://doi.org/10.3233/IFS-2012-0601 doi: 10.3233/IFS-2012-0601
    [21] F. Smarandache, Extension of soft set to hypersoft set, and then to plithogenic hypersoft set, Neutrosophic Sets Syst., 22 (2018), 168–170. https://doi.org/10.5281/zenodo.2838716 doi: 10.5281/zenodo.2838716
    [22] M. Saeed, A. U. Rahman, M. Ahsan, F. Smarandache, An inclusive study on fundamentals of hypersoft set, Brussel: Pons Publishing House, 2021.
    [23] M. Ahsan, M. Saeed, A. U. Rahman, A theoretical and analytical approach for fundamental framework of composite mappings on fuzzy hypersoft classes, Neutrosophic Sets Syst., 45 (2021), 268–285. https://doi.org/10.5281/zenodo.5486295 doi: 10.5281/zenodo.5486295
    [24] M. Saeed, M. K. Siddique, M. Ahsan, M. R. Ahmad, A. U. Rahman, A novel approach to the rudiments of hypersoft graphs, Brussel: Pons Publication House, 2021.
    [25] M. Saeed, M. Ahsan, A. U. Rahman, A novel approach to mappings on hypersoft classes with application, Brussel: Pons Publication House, 2021.
    [26] M. Saeed, M. Ahsan, A. U. Rahman, M. Saeed, A. Mehmood, An application of neutrosophic hypersoft mapping to diagnose brain tumor and propose appropriate treatment, J. Intell. Fuzzy Syst., 41 (2021), 1677–1699. https://doi.org/10.3233/JIFS-210482 doi: 10.3233/JIFS-210482
    [27] M. Saeed, A. U. Rahman, M. Arshad, A study on some operations and products of neutrosophic hypersoft graphs, J. Appl. Math. Comput., 67 (2021), 1–28. https://doi.org/10.1007/s12190-021-01614-w doi: 10.1007/s12190-021-01614-w
    [28] A. U. Rahman, M. Saeed, F. Smarandache, M. R. Ahmad, Development of hybrids of hypersoft set with complex fuzzy set, complex intuitionistic fuzzy set and complex neutrosophic set, Neutrosophic Sets Syst., 38 (2020), 335–354. https://doi.org/10.5281/zenodo.4300520 doi: 10.5281/zenodo.4300520
    [29] A. U. Rahman, M. Saeed, F. Smarandache, Convex and concave hypersoft sets with some properties, Neutrosophic Sets Syst., 38 (2020), 497–508. https://doi.org/10.5281/zenodo.4300580 doi: 10.5281/zenodo.4300580
    [30] A. U. Rahman, A. Hafeez, M. Saeed, M. R. Ahmad, U. Farwa, Development of rough hypersoft set with application in decision making for the best choice of chemical material, Brussel: Pons Publication House, 2021.
    [31] A. U. Rahman, M. Saeed, M. Arshad, A. Dhital, A novel approach to neutrosophic hypersoft graphs with properties, Neutrosophic Sets Syst., 46 (2021), 336–355. https://doi.org/10.5281/zenodo.5553538 doi: 10.5281/zenodo.5553538
    [32] A. U. Rahman, M. Saeed, A. Khalid, M. R. Ahmad, S. Ayaz, Decision-making application based on aggregations of complex fuzzy hypersoft set and development of interval-valued complex fuzzy hypersoft set, Neutrosophic Sets Syst., 46 (2021), 300–317. https://doi.org/10.5281/zenodo.5553532 doi: 10.5281/zenodo.5553532
    [33] A. U. Rahman, M. Saeed, A. Hafeez, Theory of bijective hypersoft set with application in decision making, Punjab Univ. J. Math., 53 (2021), 511–527. https://doi.org/10.5281/zenodo.4721481 doi: 10.5281/zenodo.4721481
    [34] M. Ihsan, A. U. Rahman, M. Saeed, Hypersoft expert set with application in decision making for recruitment process, Neutrosophic Sets Syst., 42 (2021), 191–207. https://doi.org/10.5281/zenodo.4711524 doi: 10.5281/zenodo.4711524
    [35] M. Ihsan, M. Saeed, A. U. Rahman, Fuzzy hypersoft expert set with application in decision making for the best selection of product, Neutrosophic Sets Syst., 46 (2021), 318–335. https://doi.org/10.5281/zenodo.5553534 doi: 10.5281/zenodo.5553534
    [36] N. Çaǧman, F. Çitak, S. Enginoglu, FP-soft set theory and its applications, Ann. Fuzzy Math. Inform., 2 (2011), 219–226.
    [37] S. Alkhazaleh, A. R. Salleh, Fuzzy soft expert set and its application, Appl. Math., 5 (2014), 1349–1368. https://doi.org/10.4236/am.2014.59127 doi: 10.4236/am.2014.59127
    [38] Y. Tella, A. Peter, S. B. Hosea, Fuzzy parameterized fuzzy soft set for multiple criteria decision process under multiple expert assessments, Int. J. Eng. Sci. Manag., 5 (2015), 180–185.
    [39] K. Zhu, J. Zhan, Fuzzy parameterized fuzzy soft sets and decision making, Int. J. Mach. Learn. Cyb., 7 (2016), 1207–1212. https://doi.org/10.1007/s13042-015-0449-z doi: 10.1007/s13042-015-0449-z
    [40] E. Sulukan, N. Çaǧman, T. Aydın, Fuzzy parameterized intuitionistic fuzzy soft sets and their application to a performance-based value assignment problem, J. New Theor., 29 (2019), 79–88.
    [41] M. Bashir, A. R. Salleh, Fuzzy parameterized soft expert set, Abstr. Appl. Anal., 2012 (2012), 1–15. https://doi.org/10.1155/2012/258361 doi: 10.1155/2012/258361
    [42] A. Hazaymeh, I. B. Abdullah, Z. Balkhi, R. Ibrahim, Fuzzy parameterized fuzzy soft expert set, Appl. Math. Sci., 6 (2012), 5547–5564.
    [43] G. Selvachandran, A. R. Salleh, Fuzzy parameterized intuitionistic fuzzy soft expert set theory and its application in decision making, Int. J. Soft Comput., 11 (2016), 52–63. https://doi.org/10.36478/ijscomp.2016.52.63 doi: 10.36478/ijscomp.2016.52.63
    [44] V. Chinnadurai, A. Arulselvam, Q-Pythagorean fuzzy soft expert set and its application in multi-criteria decision making process, J. Phys.: Conf. Ser., 1850 (2021).
    [45] A. U. Rahman, M. Saeed, S. Zahid, Application in decision making based on fuzzy parameterized hypersoft set theory, Asia Mathematika, 5 (2021), 19–27. https://doi.org/10.5281/zenodo.4721481 doi: 10.5281/zenodo.4721481
    [46] A. U. Rahman, M. Saeed, S. S. Alodhaibi, H. A. E. W. Khalifa, Decision making algorithmic approaches based on parameterization of neutrosophic set under hypersoft set environment with fuzzy, intuitionistic fuzzy and neutrosophic settings, CMES-Comp. Model. Eng., 128 (2021), 743–777. https://doi.org/10.32604/cmes.2021.016736 doi: 10.32604/cmes.2021.016736
    [47] A. U. Rahman, M. Saeed, A. Dhital, Decision making application based on neutrosophic parameterized hypersoft set theory, Neutrosophic Sets Syst., 41 (2021), 1–14. https://doi.org/10.5281/zenodo.4625665 doi: 10.5281/zenodo.4625665
    [48] A. Yolcu, F. Smarandache, T. Y. Öztürk, Intuitionistic fuzzy hypersoft sets, Commun. Fac. Sci. Univ., 70 (2021), 443–455. https://doi.org/10.31801/cfsuasmas.788329 doi: 10.31801/cfsuasmas.788329
    [49] S. Debnath, Fuzzy hypersoft sets and its weightage operator for decision making, J. Fuzzy Ext. Appl., 2 (2021), 163–170. https://doi.org/10.22105/JFEA.2021.275132.1083 doi: 10.22105/JFEA.2021.275132.1083
  • This article has been cited by:

    1. Lei Zhang, Yu Yin, Yi You, 2021, Chapter 27, 978-3-030-79205-3, 352, 10.1007/978-3-030-79206-0_27
    2. Hongzeng He, Shufen Dai, A prediction model for stock market based on the integration of independent component analysis and Multi-LSTM, 2022, 30, 2688-1594, 3855, 10.3934/era.2022196
    3. Tatyana Novikova, Svetlana Evdokimova, Gotsui Wu, Development of a quantitative investment algorithm based on Random Forest, 2022, 15, 2219-0767, 53, 10.12737/2219-0767-2022-15-4-53-60
    4. Yuqi Guo, Bingzhen Sun, Juncheng Bai, Jin Ye, Xiaoli Chu, A new portfolio approach integrating three-way decision and Encoder–Decoder network, 2024, 258, 09574174, 125233, 10.1016/j.eswa.2024.125233
  • Reader Comments
  • © 2022 the Author(s), licensee AIMS Press. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0)
通讯作者: 陈斌, bchen63@163.com
  • 1. 

    沈阳化工大学材料科学与工程学院 沈阳 110142

  1. 本站搜索
  2. 百度学术搜索
  3. 万方数据库搜索
  4. CNKI搜索

Metrics

Article views(1986) PDF downloads(65) Cited by(3)

Figures and Tables

Figures(10)  /  Tables(9)

/

DownLoad:  Full-Size Img  PowerPoint
Return
Return

Catalog