Fuzzy Gaussian Lasso clustering with application to cancer data

Miin-Shen Yang; Wajid Ali; Miin-Shen Yang; Wajid Ali

doi:10.3934/mbe.2020014

Mathematical Biosciences and Engineering

2020, Volume 17, Issue 1: 250-265. doi: 10.3934/mbe.2020014

Previous Article Next Article

Research article Special Issues

Fuzzy Gaussian Lasso clustering with application to cancer data

Miin-Shen Yang ^,,
Wajid Ali

Department of Applied Mathematics, Chung Yuan Christian University, Chung-Li 32023, Taiwan

Received: 23 May 2019 Accepted: 05 September 2019 Published: 30 September 2019

Recently, Yang et al. (2019) proposed a fuzzy model-based Gaussian (F-MB-Gauss) clustering that combines a model-based Gaussian with fuzzy membership functions for clustering. In this paper, we further consider the F-MB-Gauss clustering with the least absolute shrinkage and selection operator (Lasso) for feature (variable) selection, termed a fuzzy Gaussian Lasso (FG-Lasso) clustering algorithm. We demonstrate that the proposed FG-Lasso is a good clustering algorithm with better choice for feature subset selection. Experimental results and comparisons actually present these good aspects of the proposed FG-Lasso clustering algorithm. Cancer is a disease with growth of abnormal cells in a body. WHO reported that it is the first or second main leading cause of death. It spreads and affects the other parts of body if there is not properly diagnosed. In the paper, we apply the proposed FG-Lasso to cancer data with good feature selection and clustering results.

Keywords:

fuzzy sets,
model-based clustering,
fuzzy model-based Gaussian,
feature selection,
Lasso,
Fuzzy Gaussian Lasso (FG-Lasso) clustering

Citation: Miin-Shen Yang, Wajid Ali. Fuzzy Gaussian Lasso clustering with application to cancer data[J]. Mathematical Biosciences and Engineering, 2020, 17(1): 250-265. doi: 10.3934/mbe.2020014

Related Papers:

[1]	D. Marene Larruskain, Inmaculada Zamora, Oihane Abarrategui, Garikoitz Buigues, Víctor Valverde, Araitz Iturregi . Adapting AC Lines to DC Grids for Large-Scale Renewable Power Transmission. AIMS Energy, 2014, 2(4): 385-398. doi: 10.3934/energy.2014.4.385
[2]	Arben Gjukaj, Rexhep Shaqiri, Qamil Kabashi, Vezir Rexhepi . Renewable energy integration and distributed generation in Kosovo: Challenges and solutions for enhanced energy quality. AIMS Energy, 2024, 12(3): 686-705. doi: 10.3934/energy.2024032
[3]	Chukwuebuka Okafor, Christian Madu, Charles Ajaero, Juliet Ibekwe, Happy Bebenimibo, Chinelo Nzekwe . Moving beyond fossil fuel in an oil-exporting and emerging economy: Paradigm shift. AIMS Energy, 2021, 9(2): 379-413. doi: 10.3934/energy.2021020
[4]	Victoria Gartman, Kathrin Wichmann, Lea Bulling, María Elena Huesca-Pérez, Johann Köppel . Wind of Change or Wind of Challenges: Implementation factors regarding wind energy development, an international perspective. AIMS Energy, 2014, 2(4): 485-504. doi: 10.3934/energy.2014.4.485
[5]	Albert K. Awopone, Ahmed F. Zobaa . Analyses of optimum generation scenarios for sustainable power generation in Ghana. AIMS Energy, 2017, 5(2): 193-208. doi: 10.3934/energy.2017.2.193
[6]	Ashebir Dingeto Hailu, Desta Kalbessa Kumsa . Ethiopia renewable energy potentials and current state. AIMS Energy, 2021, 9(1): 1-14. doi: 10.3934/energy.2021001
[7]	Gustavo Henrique Romeu da Silva, Andreas Nascimento, Christoph Daniel Baum, Nazem Nascimento, Mauro Hugo Mathias, Mohd Amro . Renewable energy perspectives: Brazilian case study on green hydrogen production. AIMS Energy, 2025, 13(2): 449-470. doi: 10.3934/energy.2025017
[8]	Fazri Amir, Hafiz Muhammad, Nasruddin A. Abdullah, Samsul Rizal, Razali Thaib, Hamdani Umar . Performance analysis of heat recovery in Heat Pipe Heat Exchanger on room air conditioning systems. AIMS Energy, 2023, 11(4): 612-627. doi: 10.3934/energy.2023031
[9]	María del P. Pablo-Romero, Rafael Pozo-Barajas . Global changes in total and wind electricity (1990–2014). AIMS Energy, 2017, 5(2): 290-312. doi: 10.3934/energy.2017.2.290
[10]	Surender Reddy Salkuti . Sustainable energy technologies for emerging renewable energy and electric vehicles. AIMS Energy, 2024, 12(6): 1264-1270. doi: 10.3934/energy.2024057

Abstract

1. Introduction

1.1. Background

International as well as national environmental and energy policies have resulted in a boost of bioenergy during the past two decades. Particularly in Germany the Renewable Energy Sources Act (EEG), passed in 2000, and the so called ´energy transition´, which was initiated in 2011 have led to a significant expansion of renewable energy. Biomass plays an important role among the different sources of renewable energy. Since the late 1990s there was a strong tendency in Germany to expand the cultivation of energy crops which is shown by an increase of over 500% from 1997 to 2010 ^[1]. According to the 2006 action plan "Biomass" there is a potential for a share of 8-10 % in the national primary energy consumption ^[2]. The focus lies on biogas produced from silage maize or corn. For agriculture in rural areas the use of renewable energy products potentially creates an added value through new production and market alternatives.

Moreover, the substitution of fossil energy has positive effects on the balance of greenhouse gases ^[3], but at the same time, the intensification of agricultural production can lead to severe impacts on soil, groundwater ^[4,5] and biodiversity ^[6,7], which opens many conflicts concerning commitments of the European Union (EU) to maintain biodiversity and to protect surface and ground water ^[8,9].

Modeling approaches provide an important contribution to analyze the impact of increasing competition in the agricultural biomass production for food, feed and fuels. In order to be able to depict the diverse economic and ecological effects of agricultural production, integrated model approaches are necessary. During the past decades numerous approaches were developed for specific modelling levels, such as: farm, region, country, EU and global ^{[10,11,12,13,14,15,16,17,18]}. Against the background of ecosystem services used by agriculture Kirchner et al. ^[15] provide a very detailed overview on existing modelling approaches for the assessment and evaluation of economic, biotic and abiotic effects.

1.2. Objectives

Environmental impacts and conflicting nature conservation targets play a vital element in the actual debate on how much intensification can be still considered as a sustainable solution to meet food and energy demand. Restrictions on the share of energy crops or compensation for impacts by enforcing nature conservation measures represent key policies to control intensification and are considered in our study. We ask for their efficiency in a scenario experiment which is based on geographic information processing and numeric models and addresses the field of integrated land-use modelling. In contrast to Kirchner et al. ^[15], who considers land-use change under climate scenarios, our approach considers the effects of land-use decisions on biodiversity and soil degradation. And as a second question, we analyze economic, spatial and environmental effects from restrictions deduced from worthwhile nature conservation targets.

So the objective of the study is to support policy making by answering the following questions when facing intensification by biomass production for energy use:

- If there is an increase of using energy from biomass, which effects on farm production portfolio and farmer's income are to be expected?

- Which effects on soil erosion, nitrate leaching and greenhouse gas emissions are to be expected, which effects on biodiversity?

- Is there a target conflict between ability for intensification and ecological and environmental standards?

- Do restrictions on the availability of farm land from nature conservation reasons enable more sustainability?

These general questions are answered by a case study which is set up for the German federal state of Baden-Württemberg. The study suggests an integrated modelling approach (IMA) which implements a model chain and which is able to combine spatially aggregated economic optimization and spatially explicit ecological evaluation schemes. The approach tackles an ongoing challenge in economic-ecological modelling: to work on different spatial and time scales ^[12,18,19] including the choice of an appropriate method when linking models of different spatial scales which is of particular importance for the quality of modelling results ^[20,21].The applied methodology uses scenarios as defined by van Notten ^[22] and Sparrow ^[23] to quantify trade-offs and synergies between the expansion of area for the cultivation of energy crops and the target to protect ecosystem integrity and services.

2. Materials and Method

2.1. Integrated modelling approach

The integrated modelling approach (IMA) reported in this paper follows to the general idea (i) to calculate agro-economic parameters using an optimization approach that leads to spatially aggregated crop shares and (ii) to disaggregate the results at a spatially explicit 1-hectare (ha) raster level which sufficiently allows site specific calculations and balances of ecological effects. We followed a strategy of loosely coupling the different components of a model chain by file exchange when built up the IMA which is sketched in Figure 1.

Figure 1. Structure of the integrated modeling approach (IMA)

DownLoad: Full-Size Img PowerPoint

The approach uses a comparative static linear optimization model called Economic Farm Emission Model (EFEM; see chapter 2.3) which maximizes total gross margins on farm level. The results from EFEM are subsequently analyzed in regard to ecological impacts at a spatially explicit level. For this purpose a spatial distribution model for cultivated crops is applied (see chapter 2.6). Environmental impacts on soil and water are reported following calculations from the Environmental Policy Integrated Climate model (Ehttps://www.aimspress.com/aimspress-data/aimsagri/2017/1/PIC) (see 2.4). Impacts on species habitat on the other hand are analyses using a GIS approach (see 2.5).

2.2. Study area

We applied the approach to the territory of the German Federal State of Baden-Württemberg on the basis of eight agri-ecological regions (see Figure 2). The regions are characterized by different agricultural production potentials due to a broad range of annual precipitation between 600 mm (minimum) and 1600 mm (maximum) per year, an average annual temperature between 5 °C (minimum) and 9 °C (maximum) and by different soil types. Predominant soil types in Baden-Württemberg are Cambisoles, haplic Luvisoles and Fluvisoles with mainly silty-loam but partly also sandy or clayey soil types. The utilized agricultural area (UAA) is about 14,000 km² (60%arable land and 40% grasslands) and covers 46% of the State territory.

Figure 2. Location of the German Federal State of Baden-Württemberg subdivided into eight agri-ecological regions (AER) which cover the study area

DownLoad: Full-Size Img PowerPoint

2.3. Agricultural sector modelling

The economic-ecological model EFEM (Figure 1) is an agricultural sector model, follows a bottom-up approach and describes all predominant farming systems in Germany at farm and at regional scale. EFEM is a comparative static supply model that maximizes the gross margin at farm level based on linear programming ^[24,25]. The predominant farm models are classified according to the farm type systems of the EU classification scheme and derived by analyzing datasets of the Farm Accountancy Data Network "FADN" ^[26]. The capacities of typical farm models, such as stable places, farmland, labour ability etc., are the basis for the farm type module and restrict the linear optimization processes in EFEM. The prices for producers and for means of production and the production capacities for typical farms are exogenously determined.

The activities for crop and livestock production are integrated in the production module of EFEM. They differ regionally in terms of yields, intensities, revenues and costs. To clear annual variations, estimates for prices, yields and costs are based on an average of five years. The model includes all important agricultural production methods. Beside conventional production methods, environmentally friendly production methods for crop production and grassland cultivation that are supported by the agri-environmental programme of Baden-Württemberg (MEKA) are integrated in the model.

Grassland can be used for grazing or hay and grass silage production. Depending on the frequency of use, different fertilization intensities are formulated in the model. The yields of the different grassland management options are simulated with a model internal yield module, depending on the amount of fertilizer applied. In the field of bioenergy crop production, beside annual crops, perennial energy crops such as short rotational forestry plantations (willow, poplar) and miscanthus are taken into consideration. Some crops can be used as food or as feeding stuff or for energy production (Table 1). Based on the comparative static approach, crop rotations are specified as a maximum cultivation share for all crops as model restrictions. The grassland biomass can alternatively be used for biogas or animal production. Perennial energy crops are only used for the production of thermal energy.

Table 1. Utilization options and energy yields for the modelled crops (own calculations according to KTBL ^[27] and Öko-Institut ^[28])^a.

	Food/Feeding stuff	Bioenergy
	Food/Feeding stuff	Biogas plant	Combustion	Biofuel	Net energy production
Crop					kWh/ha	kWh/t DM^c
Cereals^b	X		X (only straw)		20,407	34.5
Winter rapeseed	X			X	12,949	38.4
Sunflower	X
Sugar beet	X
Potatoes	X
Maize (corn, silage)	X	X			29,382	19.1
Clover grass	X	X			25,345	18.2
Grassland^d	X	X			4270	18.1
Miscanthus			X		61,966	33.0
Short rotation plantations (willow, poplar)			X		31,930	27.9
^a All values are exemplary for AER1 ^b Straw of all cereals cultivated in EFEM can be used. The values here are exemplary for winter wheat ^c Dry matter ^d Exemplary for a 3-cut grassland silage

| Show Table

DownLoad: CSV

The modelled livestock production activities comprise dairy farming including heifers for restocking, male and female calves, fattening bulls, suckler cows, pig production (fattening pigs and sows) and poultry production (layer hens and broilers). EFEM considers all variable costs including machinery costs but not investment and fixed costs. Perennial crops are characterised by high initial costs and irregular revenues over long operation lengths. Therefore, the specific costs are amortised over the entire duration of operation and are considered as annual average costs in EFEM.

In the production module also greenhouse gas emissions from the agricultural sector are quantified. Thereby the emissions are distinguished according to their sources along the production process chain (Figure 3): upstream processes, agricultural production and downstream processes for the conversion of energy crops into bioenergy. In this way, the greenhouse gas balance shows the net substitution effect from agricultural bioenergy production, because the emissions of the conversion processes are taken into account ^[29]. Using global warming potential for 100 years the different greenhouse gases are added to CO₂ equivalents (CO₂e).

Figure 3. Considered sources of greenhouse gases and account boundaries in EFEM

DownLoad: Full-Size Img PowerPoint

IMA works on eightagri-ecological regions as aggregated spatial units. They subdivide Baden-Württemberg into homogeneous regions regarding natural conditions for agriculture. This approach was chosen, because it allows identifying and distinguishing natural site conditions as well as it considers a sufficient number of predominant farms from the FADN data sets while assuring data privacy.

By linking the production module and the farm type module and subsequently performing the optimization process, agri-economic values like gross margin, factor input, amount and structure of agricultural production and associated environmental impacts at farm level were estimated. The results at farm level were extrapolated to regional scale with expansion factors that were identified by a linear optimization approach (extrapolation module), independently from EFEM. Thus upscaling factors are generated which help to cover the regional capacities of the six designated farm type models. A detailed description of this approach and of the EFEM is given in Schäfer ^[30].

For the scenarios EFEM analyses exclusively the supply side and not the overall agriculture biomass market. Therefore, a full elasticity of demand is assumed for the scenarios BioE1 and BioE2 (refer to Table 3). Consequently, biomass produced for energy generation completely finds a purchaser accepting the assumed prices.

EFEM was calibrated on the basis of statistical data for the reference year 2003. The calibration results showed a high accordance with the statistical data deviating for crop production of maximum 7% and for livestock production of maximum 3%, due to integrated restrictions for housing capacities.

2.4. Analysis of impacts on soil and water

By a geometrical intersection of land use (cultivated crops), associated climate station data and soil association layers, quasi homogeneous spatial entities were generated with a minimum area of 1 ha, the so-called Land Use Soil Association Climate (LUSAC) land units. As an input for LUSAC generation and properties assignment theSoil and Land Resource Information System (SLISYS) was applied ^[31]. SLISYS couplesGIS and the Ehttps://www.aimspress.com/aimspress-data/aimsagri/2017/1/PIC models described below, documents soil and terrain data from 311 soil profiles in southwest Germany according to the international Soil and Terrain Database (SOTER) standard ^[32] to be mapped to the main mapping units of the general soil map in the scale 1:200,000, edited by the Geological State Office. Secondly LUSACs are generated from 20 climate stations for which daily values of temperature (minimum and maximum), precipitation and days of rain, air humidity, wind speed and global radiation are recorded.

Operating on SLISYS and concerning LUSACs the environmental cropping models in Ehttps://www.aimspress.com/aimspress-data/aimsagri/2017/1/PIC (version 0509) ^[33,34] were applied for soil erosion by water(according to the Universal Soil Loss Equation (USLE)), for nitrate (NO₃) leaching and for greenhouse gas emissions (CO₂, N₂O) from soils. The estimationsare based on climate parameters and soil properties and depend on different field management systems.CO₂ soil emissionwas calculated based on the change of organic carbon in the soil (SOC) as an Ehttps://www.aimspress.com/aimspress-data/aimsagri/2017/1/PIC output(CO₂emission = SOC Change x 3.667). The direct and indirect N₂O soil emission calculationwas done by multiplying and addingdifferent Ehttps://www.aimspress.com/aimspress-data/aimsagri/2017/1/PIC output parameters with conversion factors according to Schmid et al. ^[35] and Khalil et al. ^[36].

Ehttps://www.aimspress.com/aimspress-data/aimsagri/2017/1/PIC did not include parameters for three energy crops considered in the Scenarios. Therefore, we adapted and extended existing crop parameter sets. The short rotation forestry plantation was simulated in Ehttps://www.aimspress.com/aimspress-data/aimsagri/2017/1/PIC with the already implemented poplar. For miscanthus an extra dataset was provided by Schmid [personal notification in 2008]. Red clover and red fescue represented clover-grass mixture. In addition, fertilizer application rates resulting from EFEM were used as input data for the Ehttps://www.aimspress.com/aimspress-data/aimsagri/2017/1/PIC model. To avoid misleading calculations and interpretations crop rotation was considered. Finally, weighted according to the spatial extent of the LUSACs, the output parameters assigned to the LUSACs were summed up.

2.5. Evaluation of effects on target species habitat

The result of EFEM consistsin alist of crop cultivation area for different crops and for each agri-ecological region. So, as landscape related ecological models need land-use patterns as a spatially explicit input, a disaggregationprocedure was applied which assigns to each of 1ha-gridcells covering the study area a specific crop. The method applied per cropfollows two steps: (1) evaluation of agricultural land in regard to its soil suitability/priority according to UMBW ^[37] and site requirements of relevant crops ^[38,39] implemented in SLISYS and considering slope, coarse fragment, soil depth, texture, average annual temperature, average annual precipitation and sea level.(2) selection of as much land as indicated by the result of EFEM. The procedure starts allocating the crop with the highest gross margin followed successively by the other crops according to their gross margin. The advantage of the applied procedure, compared to crop share downscaling approaches suggested by Britz and Leip ^[40], Chakir ^[41]and Henseler et al. ^[12], leads to an explicit occupation of a 1-ha-rastercell by a unique crop. The procedure was implemented as an ESRI ArcGIS™ geo-processing model and successfully applied before in a watershed management project ^[42,43].

The impacts of scenario land-use patterns concerning species habitat performance were then analyzed in GIS-overlaying the crop type distribution asgenerated by the disaggregation and allocation step described above and existing areas of occurrence for target species from ^[44,45,46]. So an area summary statistics of crop types inside the target species areas was produced. The crop types were classified by an expert-based impact evaluation as documented in Table 2. In a next step, this evaluation scheme led to an area summary statistics for selected target species areas, which was calculated for each of the assessment levels in Table 2. The different scenarios then were compared with regard to their implications of an improvement or a decline of habitat quality. Following the idea of target species as a key species for monitoring species diversity, the different scenarios can be described by this method as more or as less biodiversity friendly.

Table 2. Influence of different crops on habitat quality for Sky Lark (Alaudaarvensis).

Crop	Assessment
Wheat, spring barley, oat,	Very beneficial (++)
Winter wheat, winter barley, winter rye, winter rapeseed, sugar beet, potatoes, intertillage	Beneficial (+)
Corn, sun flowers	Neutral (O)
Maize, clover grass, miscanthus	Adverse (-)
Short rotation woods	Very adverse (--)

| Show Table

DownLoad: CSV

Table S1 in supplementary material indicates the habitat quality of land according to different intensification levels considering the change in grassland use according to ^[47]. Here it is not possible to formulate reasonable rules for a disaggregation as we did for crop cultivation. In the case of the cultivation of grassland we only consider the area of the grassland of a specific type respectively the area of land covered by the assigned evaluation class for the AERs. It is to be mentioned, that this only indicates the decline of the considered land of a specific habitat quality but not of actually occupied habitat area.

2.6. Implementation of nature conservation restrictions

To answer the question of conflicts between energy production and nature conservation we assumed, that a proactive policy follows the strategy of introducing measures targeting nature conservation and protection of aquatic habitat to achieve higher environmental standards in agriculture. So we formulated restrictions in regard to available land for agriculture and intensification. A list of measures you can find in TableS3 in supplementary material. To prepare the scenario calculations three levels of implementation intensity were defined by an estimate of the overall percentage of agricultural land dedicated to the measures. The estimate included complex GIS analyses and the intensively useddatabase of the state level agri-environmental program called "MEKA" ^[47].The amount of area being funded by the MEKA program in the year 2000 was fixed as "actual standard" level

Level "legal minimum standard" assumes a more rigorous farm strategywhich means that mandatory conservation measures in legally binding conservation areas are the only restrictions assumed for the farmers. As appropriate areas are assumed to be: (a) those sites that are legally regulated by limitations for the intensity of agricultural production (mainly grasslands protected by law for reasons of nature conservation or ground water protection), (b) grasslands on dry soils which have to be cut once per year and which grow up without being fertilized and (c) grasslandson humid soils which have to be cut twice per year and fertilized only up to a maximum of 40 kg N ha⁻¹a⁻¹ (TableS3).

To delineate the area restricted by "improved nature conservation" conservation measures were considered as laid down in the "Action Plan for the Environment 2007-2012" of Baden-Württemberg. The target of those measures consists in a considerable reduction of the loss of species and habitats, corresponding to national and EU targets for the protection of biodiversity. Target species for arable land and grasslands were selected from the State's governmental "Target Species Concept" ^[45,46] (see TableS2) following the multi-species approach of Lambeck ^[48]. Then the re-establishment area for the selected target species was estimated by the following steps:

1. Population size was assumed at an approximate level as stated for the 1980s.

2. For birds, typical population densities on farmland enhanced by conservation measures were assumed as an expert attuned mean value derived from literature analysis ^[49].

3. Using a Geographical Information System (GIS), arable land or grassland area potentially suitable for each species was extracted from GIS land-use layers by eliminating species-specifically disturbed buffer-zones around built-up area, roads and forests. This setting was overlaid by the current spots of occurrence of the species to identify the currently populated area (data was provided by the State Agency for the Environment, Measurements and Nature Conservation of Baden-Württemberg).

4. To demarcate the additional area necessary to support the population size assumed for the 1980s, the largest suitable and coherent area within a 2-km buffer-zone around the currently populated area was taken and added. This step was repeated until the desired area was reached. This approach follows the idea that the development of new habitat is most promising when starting from the largest remaining populations.

5. In addition, areas with a ban on short rotation forestry and miscanthus plantations were introduced. This is due to the fact that breeding birds of open farmland like sky lark, corn bunting or lapwing tend to avoid vegetation structures of several meters of height like woods, groves or higher hedges. This step anticipates that landscape planning needs to exclude short rotation forestry and miscanthus plantations from the territories of these birds.

The area according to the three levels of measures (Table S3) were handed over to EFEM and treated there as being restricted for crop and grassland production. As data input for EFEM, the area of applied conservation and protection measures was set up for each of these levels and for the territory of the whole State of Baden-Württemberg and was then aggregated by the agri-ecological regions.

2.7. Scenarios

The target of the scenario analyses using IMA is to quantify trade-offs and synergies between the expansion of area for the cultivation of energy crops and the target to protect ecosystem integrity and services. Thus scenarios are defined that compare a specified business-as-usual situation with scenarios assuming (a) an extended production of energy crops and (b) an extension of restrictions on agricultural production to support nature conservation. To evaluate the scenarios and applying the IMA a wide range ofeconomic and ecological aspects are considered: gross margin, crop shares, energy supplygreenhouse gas emissions from agricultural and bioenergy production, conflicts concerning habitat of endangered species, and soil erosion, nitrate leaching and nutrient loss.

The scheme of analyzed scenarios is documented in Table 3. We first defined an empirical baseline (BL) for all necessary key parameters. Based on this EFEM model was validated and calibrated (BLMod). Next a business-as-usual (BAU) scenario targeting at the year 2015 was defined considering all scheduled measures of the EU Common Agricultural Policy as well as projections of prices and crop yields. Our study then compares the fixed BAU situation with two types of scenarios:

Table 3. Overview on scenarios and basic assumptions used; for abbreviations see text

	References		Scenarios
	BL	BLMod	BAU/BAU_Nat	BioE1/BioE1_Nat	BioE2/BioE2_Nat
Data type	Statistical	Modelled	Modelled	Modelled	Modelled
Share of perennial energy crops on arable land	No perennial energy crops	No perennial energy crops	No perennial energy crops	No perennial energy crops	Cultivation on max. 30% of suitable arable land
Share of annual energy crops on land use area	unknown	Max. 4.3% of crop rotation	Max. 5% of crop rotation	No restriction	No restriction
Max. Share of cereals in crop rotation	57%	75%	75%	75%	75%
Max. Share of winter rape in crop rotation	8%	13%	18%	25%	25%
Max. Share of maize silage in crop rotation	9%	50%	70%	50-70%	50-70%
Level of nature conservation	Not relevant	Not relevant	Actual standard/Improved nature conservation	Legal minimum standard/Improved nature conservation	Legal minimum standard/Improved nature conservation

| Show Table

DownLoad: CSV

1. Unfixed option for enlargement of agricultural bioenergy crop production without (BioE1) and including perennial crops (BioE2) (specifications see Table 3)

2. Enlargement of nature conservation measures: dedication of new conservation areas, additional subsidies, incentives, compensation measures and areas (see Table 3) and generate "nature friendly" scenarios BAU_Nat, BioE1_Nat, BioE2_Nat.

3. Results

We first report on the economic and environmental impacts of the scenario assumptions of an increased crop cultivation for bioenergy production (3.1). Subsequently and respecting the results of this first analysis the described area restrictions on agricultural production targeting improved nature, soil and water protection are assumed to be introduced and related to income effects (3.2).

3.1. Impacts of an increased cultivation of crops for bioenergy

3.1.1. Effects on agricultural and bioenergy production (EFEM results)

Under the given assumptions of the model the cultivation of energy crops is significantly expanded. Thereby, the cultivation of grain in Baden-Württemberg decreased considerable in both bioenergy scenarios compared to the business as usual scenario (BAU, Figure 4). The straw of the cereals is almost completely combusted. Therefore, the humus balance is obtained by cultivation of catch crops. The option to expand energy crop cultivation led to a large expansion of maize area. In BioE1 maize would account for 40% of arable land. A large proportion of cultivated maize was used for energy production in biogas plants; in some regions more than 90%. In BioE2 perennial energy crops, particularly miscanthus that is economically preferred, grown on 18% of the arable land. Wobei jedoch die ökonomische Vorteilhaftigkeit dieser Kulturen regional sehr unterschiedlich ist. Thus, the proportion of field cropping regions increases to 30% of arable land, in forage growing regions, however, to only slightly more than 1%.Overall, the bioenergy scenarios rarely changed the animal husbandry.

Figure 4. Cultivation areas of the main crops in Baden-Württemberg

DownLoad: Full-Size Img PowerPoint

Through conversion of the cultivated energy crops and use of agricultural residues in BioE1 74,200 TJ and in Bio2 86,800 TJ net energyis produced. This would cover 5.3 or 6.2% of final energy demand and is nearly 60% of the present part of renewable energy sources in Baden-Württemberg ^[49].

The bio energy scenarios lead to an increase in agricultural gross margins, namely by 15% in BioE1 and by 18% in BioE2 (see Figure 5). In both bioenergy scenarios, equal or more emissions areavoided downstream by bioenergy production than caused by agricultural production. This results in a compensated or negative greenhouse gas balance where the net substitution effect was higher than agricultural emissions. The most savings were reached due to cultivation of perennials crops (BioE2). As the analysis is based on extrapolation of farm gross margins, all bioenergy scenarios would lead to mitigation benefits on farm level. A detailed evaluation of potential mitigation costs, as a result of increased energy crop cultivation, would require a macroeconomic analysis including subsidies, taxes and market effects and is not possible with the IMA presented here.

Figure 5. Development of gross margins, agricultural energy production and greenhouse gasemission balances on state level

DownLoad: Full-Size Img PowerPoint

The effects of the scenarios highly depend on region and farm conditions. Table 4 shows these different reactions on the basis of key parameters for one appropriate farm type model in selected regions. The cultivation structure of almost all farm types showed a clear trend from food production to energy crop production. Energy crops replaced especially the production of winter wheat. Particularly noticeable was the expansion of maize by the factor of seven in intensive livestock farms. While in field cropping and intensive livestock farms maize was almost entirely used for the fermentation in biogas plants, it was mainly fed in forage growing farms. The opportunity to produce bioenergy led to an intensification of grassland particularly for forage-growing farms where a considerable part of the grassland now is used for bioenergy production. For farms with a high percentage of arable land, the cultivation of perennial bioenergy plants, especially miscanthus, was an economically attractive alternative and was often extended up to model constraints.

Table 4. Results for the different farm types in selected AER

		Field cropping farm (AER 1: "UnterlandGäue")			Mixed crops-livestock farm (AER 2: "Rhein/Bodensee")
		BAU	BioE1	BioE2	BAU	BioE1	BioE2
Gross margin	€/ha	546	755	872	811	1057	1142
Power generation	kWh/ha	1312	29,275	22,043	1065	23,034	18,328
Total balance	t CO₂e/ha	2.7	−3.8	−6.4	2.9	−2.7	−4.1
		Forage growing farm(AER 5: "Allgäu")			Intensive livestock farm (AER 8: "Bauland/Hohenlohe")
		BAU	BioE1	BioE2	BAU	BioE1	BioE2
Gross margin	€/ha	2256	2334	2334	4566	4898	4934
Power generation	kWh/ha	165	3666	3666	1533	30,998	28,475
Total balance	t CO₂e/ha	7.8	7.9	7.9	5.7	−2.8	−3.9

| Show Table

DownLoad: CSV

Because of a high flexibility in cultivation, especially field cropping farms profited from the option of cultivating energy crops. This can also be shown by the gross margins. While the high gross margins of the forage growing and intensive livestock farms were subject to only small fluctuations, those of field cropping and mixed crops-livestock farms showed a clear increase by 25-60%. The highest energy production was gained by field cropping farms, especially when the cultivation of perennial energy crops was possible (BioE2). The energy production of these farms was about twenty times higher than that of forage growing farms. The forage farm model is least affected by the bioenergy scenarios, because farmland is needed for dairy cows feeding. In contrast, the intensive livestock farm can compensate bioenergy crop cultivation by purchasing feed. The highest savings of emissions are gained in field cropping farms if the cultivation of perennials is possible.

3.1.2. Effects on soil and water

LUSAC-units serve as indicator units being characterized by a broad range of abiotic parameters. Before we discuss the effects of energy crop intensification on soil and water parameters as indicated by Table 6 it is important to state that inherent to scenarios BioE1 and BioE2 (a) there is an increase in total cropping area due to conversion of grassland to arable land and (b) there is a decline in crop and land unit diversity (Table 5). Both land use change effects aredetrimental to soil and water quality.

Table 5. Number of LUSACs, crop diversity and total cropping area in Scenarios BioE1 and BioE2 compared to BL

Scenario	Number of different LUSACs	Number of different crops	Total cropping area in ha
BL	8767	13	837,300
BioE1	4818	9	859,700
BioE2	5850	11	863,700

| Show Table

DownLoad: CSV

Table 6. Balance and alterations of soil and water parameters in Scenarios BioE1 and BioE2 compared to BL.

Indicator	Units	Balance	Change compared to BL
		BL	BioE1	BioE2
Water erosion	Gga⁻¹ in BW	1606	+228	−9
	kgha⁻¹ a⁻¹	1920	+220	−70
Soil Organic Carbon (SOC)	Gg a⁻¹in BW	0¹	−679	−376
	kg ha⁻¹ a⁻¹	0¹	−787	−434
Nitrous oxide-N	Gg a⁻¹in BW	4338	+1890	+1623
	kg ha⁻¹ a⁻¹	5.2	+2.1	+1.7
Nitrate-N	Gg a⁻¹in BW	19,889	+7422	+5968
	kg ha⁻¹ a⁻¹	24	+8.0	+6.2
¹ If plant production is not changed, an equalised soil-C-balance is assumed at least in the medium term.

| Show Table

DownLoad: CSV

Soil erosion by water, an indicator for the need of soil conservation, summed up for the whole investigation area increased in the scenario BioE1 in comparison to BL (see Table 4) due to increase in monoculture silage maize production for energy use. On the other hand, soil erosion slightly decreased in the scenario BioE2 due to increase in perennial crops. Overall, the soil erosion was on a low level on average, but on sites susceptible to erosion, rates of over 50 Mgha⁻¹a⁻¹ were indicated. The differences were more obvious in scenario BioE1 without perennial energy crops compared to scenario BioE2, which includes cultivation of perennial energy crops. By relating the erosion rate to the LUSAC-area per hectare, a very slight decrease can be stated in BioE2 in the average of all LUSACs, mainly because perennial energy crops covered the soil throughout the year. The model simulations, carried out separately for the eight agri-ecological regions, showed regional differences. The reason was the diverse development of cultivation areas of monocultures, e.g. changes in the proportion of summer crops.

Nitrate leaching as an indicator for the need of groundwater protection increased in the bioenergy scenarios in comparison to BL. This increase was observed regarding the sum over the whole study area as well as relating the rates to LUSAC-area per hectare (see Table 4). Losses in nitrate were higher in scenario BioE1 than in scenario BioE2. This trend is parallel to the proportion of silage maize and perennial energy cropping. The higher amount of nitrogen fertilizer of up to 200 kg Nha⁻¹a⁻¹ on maize assumed in both bioenergy scenarios caused the increase in nitrate leaching. This nitrogen fertilizer originated partly from digestate. The partial substitution of maize for extensively fertilized miscanthus (on average 60 kg Nha⁻¹a⁻¹) and unfertilized short rotation forestry plantations resulted in a lower increase of nitrate leaching in scenario BioE2. However, a leaching of 30 kg Nha⁻¹a⁻¹ from soils with representative site conditions in South West Germany could reach the EU threshold for drinking water, which is at 50 mg nitrate/L ^[50].

Soil-borne CO₂ and N₂O emissions as an indicator for the relevance to climate protection concerns were also calculated with Ehttps://www.aimspress.com/aimspress-data/aimsagri/2017/1/PIC using the changes in soil organic carbon (SOC) and the changes in selected parameters of the nitrogen budget. According to this, the annual decrease in SOC stocks, associated with decreasing soil fertility and increasing CO₂ emissions, was higher in BioE1 with annual energy crops, compared to the initial stocks in 2003 than in BioE2 with perennial energy crops. This is comparable with the rates of change, summed up over the whole study area as well as for the annual changes related to LUSAC-area per hectare. A plausible reason is the conversion of grassland into arable land leading to SOC losses. Another reason could be the increase of silage maize cropping that led to lower amounts of harvest residues and thus lower carbon accumulation in the topsoil. The development of the N₂O emissions showed similar characteristics. The increase of nitrogen fertilizer application (see annotation on nitrate leaching) resulted also in higher N₂O emissions. Perennial energy crop cultivation such as miscanthus and short rotation forestry led to lower changes in SOC stocks and in N₂O emissions in scenario BioE2, because they do not affect the soil and thus prevent SOC degradation. Therefore, there are less soil-borne CO₂ emissions which, together with the omitted application of nitrogen fertilizer, led to the prevention of soil-borne N₂O emissions.

3.1.3. Effects on target species

Figure 6 shows the area per habitat impact class as fixed in Table 5 for the sky lark. Crop cover with the attributes "very beneficial" or "beneficial" for habitat suitability declined in the scenario BioE1 and more in BioE2 compared to the BAU scenario. This indicates the effects of the reduction in winter wheat production andon a minor extent also in winter rapeseed, sugar beet and potato production. The area covered with "adverse" or "very adverse" crops clearly increased because of a three- to four-fold expansion in maize production in the bioenergy scenarios and, additionally in BioE2, short rotational forestry and miscanthus production, which are classified as "very adverse".

Figure 6. Impact analysis considering the target species Sky Lark (Alaudaarvensis) in arable land and reporting area per habitat suitability class within recent species range.

DownLoad: Full-Size Img PowerPoint

As an example for an impact analysis considering the use of grassland, Figure 7 shows the differences in the area per habitat suitability/impact class as fixed in Table 5 for the bush cricket (Polysarcusdenticauda) in the agri-ecological region "Schwäbische Alb/Baar". This hilly area, with a high portion of extensive use of grasslands, is the most important of only two relicts of occurrence of the Bush Cricket in Germany. Although the aggregate analysis considered the whole agri-ecological region, the sharp decline in extensive production in the bioenergy scenarios causes a risk for further loss of populations of this highly threatened species.

Figure 7. Impact analysis considering the target species Bush cricket (Polysarcusdenticauda) in mesophile grasslands and reporting area per habitat suitability class within the AER "Schwäbische Alb/Baar".

DownLoad: Full-Size Img PowerPoint

We also analysed in more detail the conflicts between breeding birds of open farmlands and the high-growing miscanthus and short rotation forestry plantations. In scenario BioE2 and as a result of the spatial disaggregation described in chapter 2.5 the area of short rotation forestry plantations covered2,5% of the arable land in the State territory, whereof about 60% was located inside zones that are considered as a conflicting area. On the other hand, simulated spatial distribution ofmiscanthus cultivation in BioE2 covered 14% of the arable land in the State territory, whereof only 14% (17,900 ha) was located inside of ecologically disfavoured zones.

3.2. Impacts of improved nature conservation standards

The following results intend to demonstrate the effects of enlargement of nature conservation measures by dedication of new conservation areas, additional subsidies and compensation measures (see TablesS1 and S2 in supplementary material) on selected EFEM results. In doing so, the previously examined model scenarios are compared to the corresponding "nature friendly" scenarios BAU_Nat, BioE1_Nat, BioE2_Nat.

In most of the regions, improved nature conservation policies led to intensification of the remaining grassland and of the arable land, which were not part of these measures. The strong influence of nature conservation measures on grassland management results in two adjustment measures of the farm models in order to meet the feeding needs for cattle.On the one hand, the cutting frequency and the degree of fertilizing intensity will be considerably increased on the remaining grassland and on the other hand, cultivation of silage maize will be increased Latter is particularly evident at BioE2_Nat where the cultivation of maize silage increases another 14% in comparison to BioE2 (see Figure 8). In grainfarming it becomes evident that in BioE1_Nat and BioE2_Nat spring grain becomes more attractive than winter grain. Furthermore, the cultivation of winter cereals is falling sharply.For example, the winter cereal area in BioE1_nat decline by around 91,500 ha, a drop of 33%.This adjustment was caused due to nature conservation measures, e.g.: Sky lark plots, which have more severe effects on the growth of winter cereals. This illustrates the competition among the requirements for nature conservation and intensive production of bioenergy, feed or food. Spare winter cereal areawas cultivated with summer corn and silo maize.

Figure 8. Impact of improved conservation measures on the growing share of the main crops in Baden-Württemberg.

DownLoad: Full-Size Img PowerPoint

Despite the severe impact of implemented nature conservation measures relative to the utilized agricultural areas, the gross margin decrease is relatively small. The highest decrease amounts to 5% at BioE2 versus BioE2_Nat.

This is based on the fact that most measures can be enhanced by agri-environmental schemes. However, this would entail an increase in MEKA expenditure in Baden-Württemberg. The highest increase would amount to 27% again at BioE2 versus BioE2_Nat. The power generation per hectare utilised agricultural area will decrease by 13% in BioE1_Nat and by 22% in BioE2_Nat. Conservation measures in the BAU scenario will result in a slight reduction of greenhouse gas emissions due to induced extensification effects. In contrast, they decrease the sink effects of greenhouse gas emissions in bioenergy scenarios by a reduced substitute amount of fossil energy sources.

4. Discussion

4.1. Agro-economic results

A central result of the agro-economic modelling with EFEM is the estimated future increase in arable land used for energy crop production. The model calculations show an increase in agricultural gross margin in Baden-Württemberg by up to 18%. Arable farmer would benefit more from this development than forage growing farmer.

However, the simulated cultivation area for energy crops might be overestimated, because the agro-economic model EFEM is a statistically linear programming model that deals with exogenously determined prices. Such a strong limitation of the food and feed production, as the modelling results show, would in reality lead to changes in price of these products. As these price interdependencies could not be represented within the modelling approach, sensitivity analyses with different price assumptions were carried out. They showed that the modelling results were very stable regarding the scale of cultivation of bioenergy resources, especially for the cultivation of perennial energy crops. An alternative option to better show the correlation between farm decisions and market effects is coupling farm based models like EFEM with a partial equilibrium for the agricultural sector ^[51,52].

From a business point of view the high relative excellence of perennial crops in our analysis is also due to the assumed costs e.g. for fertilizer, pesticides and energy in our model scenarios. Thus, energy crops with low input demand like short rotation forestry or miscanthus would become more competitive. However, in reality, factors such as the comparable high costs of investment for planting and the farmers' attitude towards entrepreneurial risk play an important role for farmers when considering long-term fixing of land-use. These factors were not incorporated in this analytical approach. Model approaches, e.g. Real Options Approaches, can better explain the reluctance for planting observed in reality by taking into consideration complex planning approaches ^[53,54].

Scenario calculations by EFEM are based on a completely elastic demand, i.e., it is assumed that cultivation of bioenergy crops simultaneouslyallows generate the setup of biogas or combustion for crop conversion. Subsequently, the results represent an economic production potential on farm level.In order to be able to make further statements regarding a realistic demand for agricultural energy crops, EFEM could be coupled with energy market models as TIMES PanEU ^[55].

Modelling results showed only small reductions in income caused by the implementation of nature conservation measures in the different scenarios. In reality a higher reduction in income would be expected, because the adaptability of farm models was overestimated with this optimized approach. Furthermore no limitation of the payments by agri-environment schemes was assumed. Possible governmental transfer payments for incentives as well as potential costs for monitoring of the nature conservation services need to be added.

By balancing the greenhouse gases, EFEM as a farm based model accounts the emissions from the upstream processes for means of production, agricultural production itself and downstream processes for bioenergy supply. Hereby the greenhouse gas balance in Figure 6 includes also emissions that have been caused by an increase in the purchase of additional feeding stuff as a result of the strong competition in area. A reduction in the degree of self-sufficiency by using food crops for energy production and extending the cultivation of energy crops has not been considered in the EFEM so far. Increased imports of food items would lead to a shift in greenhouse gas emissions into other regions or countries.

4.2. Impacts on soil and water

The scenario results from the EFEM model for maize production area, conversion of grassland and amount of applied fertilizer served as input data for subsequent calculations with the Ehttps://www.aimspress.com/aimspress-data/aimsagri/2017/1/PIC model. The change in soil and water parameters was described applying the principle of sustainability assessment of agriculture with key indicators ^[56,57,58], e.g. soil erosion for soil conservation, nitrate leaching for water protection or soil-borne greenhouse gas emissions for climate change mitigation ^[59,60]. With regard to such indicators, the plausibility of the Ehttps://www.aimspress.com/aimspress-data/aimsagri/2017/1/PIC modelling results is satisfying in comparison to single and representative surveys. The modelled mean values of C content are comparable to several surveys on increases and decreases of land use changes ^[61,62,63]. The modelled nitrogen losses by nitrate leaching is still in the frame derived from the results of governmental controlling campaigns in southwestern Germany ^[64]. The nitrous oxide emissions that have been estimated by using factors cited by Schmid ^[36] ranged in the frame as reviewed by Kaiser ^[65] on soils in Germany. The modelled erosion is equal to merely about 20% of the mass observed in south west Germany and Bavaria ^[66,67].

The changes in the investigated parameters indicating resource conservation are less distinct when integrating perennial energy crops such as short rotation forestry plantations or miscanthus into the cropping system (refer to Table 6). However, not included in the considerations are additional conservation measures like reduced soil tillage that offer the possibility for compensation ^[4]. Model results from Ehttps://www.aimspress.com/aimspress-data/aimsagri/2017/1/PIC (refer to Table 6) showed that adverse environmental effects increased if energy crop production increased, mainly because of grassland conversion to crop land. Annual energy crops such as silage maize (scenario BioE1) caused higher adverse ecological impacts than perennial energy crops like short rotation forestry or miscanthus (scenario BioE2). In the modelled area, positive effects of perennial energy crop production compensated negative effects of grassland conversion to crop land and annual energy crop production (see scenario BioE2 in Table 6).

4.3. Species habitat effects

From the results we have to state a clear decrease in habitat potentials for the selected target species and the contribution to biodiversity they represent when energy crop share increases (Figures 6 and 7). Main factors are the drastic increase in maize cultivation, the intensification of grassland use and grassland conversion to crop land. These findings have been expected, but we have shown that nature conservation regulations can prevent such effects without severe economic effects (Figure 9). Everaars et al. ^[6] report from modelling experiments and state considering an intensive bioenergy scenario, "that the decrease in breeding pair density … could be fully mitigated for all the considered bird species through 10% set-aside". But they also state that, if there is an increase of dominance or a spatial agglomeration of a single energy crop (e.g., maize), mitigation is very difficult. The first findings confirm our result that nature conservation is not really a kick out argument for cost effective energy crop agriculture, and the second statement meets our findings for BioE2, where maize is the dominating crop and severely affects sky lark habitat quality.

Figure 9. Impact of improved conservation measures on gross margin, power generation and greenhouse gas emissions per utilized agricultural area in Baden-Württemberg.

DownLoad: Full-Size Img PowerPoint

As stated in 3.2 the bioenergy scenarios without applying "improved nature conservation", grasslands are massively converted to arable land until the Baden-Württemberg specific legal limit for grassland conversion is reached. Especially in cases of low intensity grassland, a dramatic loss of biodiversity and negative effects concerning soil ecosystem services is to be expected. These processes are well known and were approved by this study. The second question was whether trade-offs are occurring if the requirements for "improved nature conservation" are applied. The study shows that even under "improved nature conservation" restrictions, bioenergy crop production can increase considerably (refer to Figure 9 and compare in the left chart columns 4 and 6 with column 2), while the greenhouse gas balance for the agricultural sector is nearly equalized (Figure 9 right chart, columns 4 and 6). In this case, the funding of conservation measures underagri-environmental programs needs to be flexibly increased to keep up with rising income effects from energy crop cultivation. In addition, a planned regionalization of specific conservation measures based on the distribution of indicator species would highly improve their efficiency.Our study depicts rather positive effects of perennial energy crops on agricultural land. Aversion of farmers to long-term fixation of land use could be reduced by planting premiums, but for this aspect more research is needed.

5. Conclusion

Our approach indicatesuncovered effects, conflicts and synergies between agriculture and ecology. It suggests a successful method to consider trade-off scenarios which are able to support policy making. The results show that a coexistence of energy use driven intensification and nature conservation is not a utopian promise. It is just a question of control considering where (locations in respect to soil conditions) and what (share of crops and short rotation trees). And politicians must not fear to demand that this is done rigorously with due respect. Farmers income must not come under pressure and this is a good basis to convince them that it is their own benefit if ecosystem services like pollination are secured by the integrity of the ecosystem as a whole.

Acknowledgments

This article includes results which are fundamental to the doctoral theses of Dr. Heike Weippertand Dr. Angelika Konold. The research reported in this article was funded by the environmental research fund of the German State of Baden-Württemberg BWPLUS (project IDs BWB27003, BWB27006, BWK27003). We are grateful to Erwin Schmid (University of Natural Resources and Life Sciences, Vienna, Austria) for providing a data set with crop parameters formiscanthus. Many thanks to Susanne Wagner for proof-reading and fruitful discussions.

Conflict of Interest

The authors declare that there are no competing interests.

References

[1]	L. Kaufman, P.J. Rousseeuw, Finding Groups in Data: An Introduction to Cluster Analysis, Wiley-Interscience, New York, 2009.
[2]	J. C. Bezdek, Pattern Recognition with fuzzy objective function algorithms, Plenum Press, New York, 1981.
[3]	D. Jiang, C. Tang and A. Zhang, Cluster analysis for gene expression data: A survey, IEEE Trans. Knowl. Data Eng.,16 (2004), 1370-1386.
[4]	J. M. T. Wu, C. W. Lin, P. Fournier-Viger, et al., The density-based clustering method for privacy-preserving data mining, Math. Biosci. Eng., 16 (2019), 1718-1728.
[5]	M. S. Yang, C. Y. Lai and C. Y. Lin, A robust EM clustering algorithm for Gaussian mixture models, Pattern Recognit., 45 (2012), 3950-3961.
[6]	A. K. Jain, Data clustering: 50 years beyond k-means, Pattern Recognition Lett., 31 (2010), 651-666.
[7]	A. Baraldi and P. Blonda, A survey of fuzzy clustering algorithms for pattern recognition-part I and part II, IEEE Trans. Syst. Man Cybern. B, 29 (1999), 778-785.
[8]	M. S. Yang and Y. Nataliani, Robust-learning fuzzy c-means clustering algorithm with unknown number of clusters, Pattern Recogni t., 71 (2017), 45-59.
[9]	R. Krishnapuram and J. M. Keller, A possibilistic approach to clustering, IEEE Trans. Fuzzy Syst., 1 (1993), 98-110.
[10]	M. S. Yang, S. J. Chang-Chien and Y. Nataliani, A fully-unsupervised possibilistic c-means clustering method, IEEE Access, 6 (2018), 78308-78320.
[11]	A. P. Dempster, N. M. Laird and D. B. Rubin, Maximum likelihood from incomplete data via the EM algorithm (with discussion), J. R. Stat. Soc. Ser. B, 39 (1977), 1-38.
[12]	W. Pan and X. Shen, Penalized model-based clustering with application to variable selection, J. Mach. Learn. Res., 8 (2007), 1145-1164.
[13]	R. Tibshirani, Regression shrinkage and selection via the Lasso, J. R. Stat. Soc. Ser. B, 58 (1996), 267-288.
[14]	J. D. Banfield and A. E. Raftery, Model-based Gaussian and non-Gaussian Clustering, Biometrics, 49 (1993), 803-821.
[15]	A. J. Scott and M. J. Symons, Clustering methods based on likelihood ratio criteria, Biometrics, 27 (1971), 387-397.
[16]	M. J. Symons, Clustering criteria and multivariate normal mixtures, Biometrics,37 (1981), 35-43.
[17]	R. Wehrens, L. M. C. Buydens, C. Fraley, et al., Model-based clustering for image segmentation and large datasets via sampling, J. Classif., 21 (2004), 231-253.
[18]	W. C. Young, A. E. Raftery and K. Y. Yeung, Model-based clustering with data correction for removing artifacts in geneexpression data, Ann. Appl. Stat., 11 (2017), 1998-2026.
[19]	T. Akilan, Q. M. J. Wu and Y. Yang, Fusion-based foreground enhancement for background subtraction using multivariate multi-model Gaussian distribution, Inf. Sci., 430-431 (2018), 414-431.
[20]	M. S. Yang, S. J. Chang-Chien and Y. Nataliani, Unsupervised fuzzy model-based Gaussian clustering,Inf. Sci., 481 (2019), 1-23.
[21]	L. A. Zadeh, Fuzzy sets, Inf. Control, 8 (1965), 338-353.
[22]	M. S. Yang and Y. Nataliani, A feature-reduction fuzzy clustering algorithm based on feature-weighted entropy,IEEE Trans. Fuzzy Syst., 26(2018), 817-835.
[23]	K. Voevodski, M. F. Balcan, H. Röglin, et al., Active clustering of biological sequences, J. Mach. Learn. Res., 13 (2012), 203-225.
[24]	D. Gaweł and K. Fujarewicz, On the sensitivity of feature ranked lists for large-scale biological data, Math. Biosci. Eng., 10 (2013), 667-690.
[25]	J. Xiong, Essential Bioinformatics, Cambridge University Press, New York, 2006.
[26]	R. Jiang, X. Zhang, M. Q. Zhang, Basics of Bioinformatics, Springer-Verlag Berlin An, 2013.
[27]	E. H. Ruspini, A new approach to clustering,Inf. Control, 15 (1969), 22-32.
[28]	D. M. Witten and R. Tibshirani, A framework for feature selection in clustering, J. Am. Stat. Assoc.,105 (2010), 713-726.
[29]	E. A. Castro and X. Pu, A simple approach to sparse clustering, Comput. Stat. Data Anal.,105 (2017), 217-228.
[30]	X. Qiu, Y. Qiu, G. Feng, et al., A sparse fuzzy c-means algorithm base on sparse clustering framework, Neurocomputing,157 (2015), 290-295.
[31]	X. Chang, Q. Wang, Y. Liu, et al., Sparse regularization in fuzzy c-means for high-dimensional data clustering,IEEE Trans. Cybern., 47 (2017), 2616-2627.
[32]	T. Hastie, R. Tibshirani and M. Wainwright, Statistical Learning with Sparsity: The lasso and Generalization, Chapman and Hall/CRC press, New York, (2015).
[33]	C. L. Blake and C. J. Merz, UCI repository of machine learning database, a huge collection of artificial and real-world data sets, (1988).
[34]	N. K. Phan, Biological therapy: A new age of cancer treatment, Biomed. Res. Ther., 1 (2014), 32-34.
[35]	Global Health Observatory (GHO) data, World Health Organization, Geneva, 2018. Available from: https://www.who.int/gho/en/.
[36]	F. Bray, J. Ferlay, I. Soerjomataram, et al., A. Jemal, Global cancer statistics 2018: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries, CA A Cancer J. Clin., 68 (2018), 394-424.
[37]	D. N. K. Boulos and R. R. Ghali, Awareness of breast cancer among female students at Ain Shams University, Egypt, Glob. J. Health Sci., 6 (2014), 154-161.
[38]	K. McPherson, C. M. Steel and J. M. Dixon, Breast cancer-epidemiology, risk factors, and genetics, BMJ, 321 (2000), 624-628.
[39]	R. R. Janghel, A. Shukla, R. Tiwari, et al., Intelligent decision support system for breast cancer, International Conference in Swarm Intelligence, Beijing, China, 2010, 351-358. Available from: https://link_springer.gg363.site/chapter/10.1007/978-3-642-13498-2_46#citeas.
[40]	W. N. Street, W. H. Wolberg and O. L. Mangasarian, Nuclear feature extraction for breast tumor diagnosis, Biomedical image processing and biomedical visualization, 1905 (1993), 861-870. Available from: https://doi.org/10.1117/12.148698.
[41]	A. R. Marley and H. Nan, Epidemiology of colorectal cancer, Int. J. Mol. Epidemiol. Genet., 7 (2016), 105-114.
[42]	M. Arnold, M. S. Sierra, M. Laversanne, et al., Global patterns and trends in colorectal cancer incidence and mortality, Gut, 66 (2017), 683-691.
[43]	Cancer Stat Facts: Leukemia, National Cancer Institute, Surveillance Epidemiology and End Results Program, 2006-2010. Available from: http://seer.cancer.gov/statfacts/html/leuks.html.
[44]	A. S. Davis, A. J. Viera and M. D. Mead, Leukemia: An overview for primary care, Am. Fam. Physician, 89 (2014), 731-738.
[45]	T. R. Golub, D. K. Slonim, P. Tamayo, et al., Molecular classification of cancer: Class discovery and class prediction by gene expression monitoring, Science,286 (1999), 531-537.

This article has been cited by:

1.	Eckart Petig, Andreas Rudi, Elisabeth Angenendt, Frank Schultmann, Enno Bahrs, Linking a farm model and a location optimization model for evaluating energetic and material straw valorization pathways-A case study in Baden-Wuerttemberg, 2019, 11, 17571693, 304, 10.1111/gcbb.12580
2.	Salwa Haddad, Wolfgang Britz, Jan Börner, Economic Impacts and Land Use Change from Increasing Demand for Forest Products in the European Bioeconomy: A General Equilibrium Based Sensitivity Analysis, 2019, 10, 1999-4907, 52, 10.3390/f10010052
3.	P. Schlager, C. Ruppert-Winkel, K. Schmieder, Assessing the potential impacts of bioenergy cropping on a population of the ground-breeding bird Alauda arvensis: a case study from southern Germany, 2020, 45, 0142-6397, 1000, 10.1080/01426397.2020.1808963
4.	Eckart Petig, Hyung Sik Choi, Elisabeth Angenendt, Pascal Kremer, Harald Grethe, Enno Bahrs, Downscaling of agricultural market impacts under bioeconomy development to the regional and the farm level—An example of Baden‐Wuerttemberg, 2019, 11, 1757-1693, 1102, 10.1111/gcbb.12639
5.	Elisabeth Angenendt, Witold-Roger Poganietz, Ulrike Bos, Susanne Wagner, Jens Schippl, 2018, Chapter 9, 978-3-319-68151-1, 289, 10.1007/978-3-319-68152-8_9
6.	Jiaxin Zhou, Wei Li, Philippe Ciais, Thomas Gasser, Jingmeng Wang, Zhao Li, Lei Zhu, Mengjie Han, Jiaying He, Minxuan Sun, Li Liu, Xiaomeng Huang, Contributions of countries without a carbon neutrality target to limit global warming, 2025, 16, 2041-1723, 10.1038/s41467-024-55720-x

Reader Comments

Your name:*

Email:*
© 2020 the Author(s), licensee AIMS Press. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0)

通讯作者: 陈斌, bchen63@163.com

1.
沈阳化工大学材料科学与工程学院沈阳 110142

Mathematical Biosciences and Engineering

4.4

Metrics

Article views(5403) PDF downloads(604) Cited by(9)

Preview PDF

Download XML

Export Citation

Article outline

Show full outline

Figures and Tables

Figures(2) / Tables(7)

Mathematical Biosciences and Engineering

Fuzzy Gaussian Lasso clustering with application to cancer data

Related Papers:

Abstract

1. Introduction

1.1. Background

1.2. Objectives

2. Materials and Method

2.1. Integrated modelling approach

2.2. Study area

2.3. Agricultural sector modelling

2.4. Analysis of impacts on soil and water

2.5. Evaluation of effects on target species habitat

2.6. Implementation of nature conservation restrictions

2.7. Scenarios

3. Results

3.1. Impacts of an increased cultivation of crops for bioenergy

3.1.1. Effects on agricultural and bioenergy production (EFEM results)

3.1.2. Effects on soil and water

3.1.3. Effects on target species

3.2. Impacts of improved nature conservation standards

4. Discussion

4.1. Agro-economic results

4.2. Impacts on soil and water

4.3. Species habitat effects

5. Conclusion

Acknowledgments

Conflict of Interest

References

This article has been cited by:

Reader Comments

通讯作者: 陈斌, bchen63@163.com

Metrics

Figures and Tables

Other Articles By Authors

Catalog

Mathematical Biosciences and Engineering

Fuzzy Gaussian Lasso clustering with application to cancer data

Related Papers:

Abstract

1. Introduction

1.1. Background

1.2. Objectives

2. Materials and Method

2.1. Integrated modelling approach

2.2. Study area

2.3. Agricultural sector modelling

2.4. Analysis of impacts on soil and water

2.5. Evaluation of effects on target species habitat

2.6. Implementation of nature conservation restrictions

2.7. Scenarios

3. Results

3.1. Impacts of an increased cultivation of crops for bioenergy

3.1.1. Effects on agricultural and bioenergy production (EFEM results)

3.1.2. Effects on soil and water

3.1.3. Effects on target species

3.2. Impacts of improved nature conservation standards

4. Discussion

4.1. Agro-economic results

4.2. Impacts on soil and water

4.3. Species habitat effects

5. Conclusion

Acknowledgments

Conflict of Interest

References

This article has been cited by:

Reader Comments

通讯作者: 陈斌, bchen63@163.com

Metrics

Figures and Tables

Other Articles By Authors

Related pages

Tools

Export File

Citation

Format

Content

Catalog