Optimizing B2B customer relationship management and sales forecasting with spectral graph convolutional networks: A quantitative approach

Shagufta Henna; Shyam Krishnan Kalliadan; Mohamed Amjath; Shagufta Henna; Shyam Krishnan Kalliadan; Mohamed Amjath

doi:10.3934/QFE.2025015

Quantitative Finance and Economics

2025, Volume 9, Issue 2: 449-478. doi: 10.3934/QFE.2025015

Previous Article Next Article

Research article

Optimizing B2B customer relationship management and sales forecasting with spectral graph convolutional networks: A quantitative approach

Department of Computing, Atlantic Technological University, Donegal, Ireland

Received: 05 January 2025 Revised: 16 April 2025 Accepted: 16 May 2025 Published: 04 June 2025
JEL Codes: C45, M15

Customer relationship management (CRM) in business-to-business (B2B) environments requires robust strategies and informed decision-making to cultivate strong inter-business relationships, which are pivotal for achieving competitive advantage and maximizing profitability. Traditional CRM analytics, which leverages conventional data mining, machine learning, and deep learning techniques, often fails to address the intricate and interdependent nature of B2B systems. To overcome this limitation, we proposed a spectral graph convolutional neural network (GCN) approach that utilized graph-based modeling to capture the structural complexity of B2B CRM. Companies were represented as nodes, and their interactions as edges within a graph, enriched with Eigenvector centrality and shortest-path graph features, which were particularly suited for spectral GCN operations. Using graph Laplacian-based convolutions, the spectral GCN effectively aggregated global and local relational information, enabling accurate and scalable B2B sales predictions. Experimental evaluations demonstrated that GCN models with spectral attributes significantly outperformed state-of-the-art machine learning and deep learning models, including random forests, convolutional neural networks, feed-forward neural networks, Extreme Gradient Boosting (XGBoost), and Categorical Boosting (CATboost), in terms of accuracy, F1 Score, precision, and specificity. Among the models, the GCN with Eigenvector features achieves the best classification performance, with a high Receiver Operating Characteristic (ROC) value of 0.924, further demonstrating its robustness against variations in feature correlations.

Keywords:

Citation: Shagufta Henna, Shyam Krishnan Kalliadan, Mohamed Amjath. Optimizing B2B customer relationship management and sales forecasting with spectral graph convolutional networks: A quantitative approach[J]. Quantitative Finance and Economics, 2025, 9(2): 449-478. doi: 10.3934/QFE.2025015

Related Papers:

[1]	Tzu-Chien Wang . Deep Learning-Based Prediction and Revenue Optimization for Online Platform User Journeys. Quantitative Finance and Economics, 2024, 8(1): 1-28. doi: 10.3934/QFE.2024001
[2]	Raymond J. Hawkins, Hengyu Kuang . Lending Sociodynamics and Drivers of the Financial Business Cycle. Quantitative Finance and Economics, 2017, 1(3): 219-252. doi: 10.3934/QFE.2017.3.219
[3]	Akash Deep . Advanced financial market forecasting: integrating Monte Carlo simulations with ensemble Machine Learning models. Quantitative Finance and Economics, 2024, 8(2): 286-314. doi: 10.3934/QFE.2024011
[4]	Chiao Yi Chang, Fu Shuen Shie, Shu Ling Yang . The relationship between herding behavior and firm size before and after the elimination of short-sale price restrictions. Quantitative Finance and Economics, 2019, 3(3): 526-549. doi: 10.3934/QFE.2019.3.526
[5]	Zhenghui Li, Zhenzhen Wang, Zhehao Huang . Modeling Business Cycle with Financial Shocks Basing on Kaldor-Kalecki Model. Quantitative Finance and Economics, 2017, 1(1): 44-66. doi: 10.3934/QFE.2017.1.44
[6]	Fangzhou Huang, Jiao Song, Nick J. Taylor . The impact of business conditions and commodity market on US stock returns: An asset pricing modelling experiment. Quantitative Finance and Economics, 2022, 6(3): 433-458. doi: 10.3934/QFE.2022019
[7]	Xi Zhou, Yin Pang, Esther Ying Yang, Jing Rong Goh, Shaun Shuxun Wang . Valuation of crypto assets on blockchain with deep learning approach. Quantitative Finance and Economics, 2025, 9(2): 479-505. doi: 10.3934/QFE.2025016
[8]	Chen Li, Xiaohu Li . Stochastic arrangement increasing risks in financial engineering and actuarial science – a review. Quantitative Finance and Economics, 2018, 2(1): 675-701. doi: 10.3934/QFE.2018.1.190
[9]	Vladimir Donskoy . BOMD: Building Optimization Models from Data (Neural Networks based Approach). Quantitative Finance and Economics, 2019, 3(4): 608-623. doi: 10.3934/QFE.2019.4.608
[10]	Melike E. Bildirici, Mesut M. Badur . The effects of oil prices on confidence and stock return in China, India and Russia. Quantitative Finance and Economics, 2018, 2(4): 884-903. doi: 10.3934/QFE.2018.4.884

Abstract

1. Introduction

An intelligent customer relationship management (CRM) system has the potential to significantly enhance the customer experience, improve brand awareness, and drive profitability. However, developing a cost-effective solution that meets the complex demands of a business-to-business (B2B) value chain remains a formidable challenge. A data-driven approach to model CRM systems can address customer-centric goals by extracting, storing, analyzing, and predicting potential customers and business opportunities. Graph-based modeling is particularly effective for this purpose, as it can abstract and capture complex data relationships that go beyond the limited scope of traditional business analytics. Despite the growing adoption of CRM data lakes, existing analytics methods often struggle to capture intricate relationships that are crucial to gain competitive advantage in complex B2B environments.

The rise of CRM data lakes has opened up new opportunities to leverage graph-based storage and analytics to improve business performance. Recent efforts to apply machine learning to CRM analytics have led to increased efficiency in certain areas. For instance, Zhang et al. (2019) employed semi-supervised spectral clustering for customer behavior analysis, while Yang et al. (2018) developed a recommender system for B2B sales and marketing. Furthermore, Konno et al. (2017) applied graph models to enhance business operations. However, despite these advances, the application of graph-based techniques in CRM remains underexplored, particularly within B2B contexts. Recent studies underscore the transformative potential of intelligent CRM systems that integrate artificial intelligence and big data for real-time strategic decision making (Nugmanova et al., 2019; Taleb et al., 2020; Kim et al., 2019).

Relational database management systems (RDBMS) fail to model the meaningful and critical relationships required for intelligent enterprise CRM solutions. In contrast, graph databases such as Neo4j are specifically designed to capture the inherent relationships present in complex CRM scenarios. Neo4j, a leading open source graph database, offers robust, scalable storage solutions and powerful graph querying and mining capabilities (Huang and Dong, 2013). Using a flexible schema, Atomicity, Consistency, Isolation, Durability (ACID) compliance, and Cypher query language to facilitate advanced graph analytics. Although graph databases have been extended to analyze customer relationships through centrality measures (Kimura et al., 2011; McClanahan and Gokhale, 2016), key graph features, such as Eigenvectors and shortest path distances, remain largely underutilized in CRM analytics.

Graph-based deep learning techniques, particularly graph convolutional networks (GCNs), have emerged as powerful tools to extract meaningful patterns and insights from graph data (Gheisari et al., 2017; Scarselli et al., 2009). Spectral GCNs, which leverage Fourier transforms and Laplacian-based graph convolutions, efficiently aggregate neighborhood information while capturing global and local structural properties (Bruna et al., 2014; Defferrard et al., 2016).

Despite recent advances in CRM, which have incorporated machine learning and graph-based models, these efforts have been limited in their ability to capture the complex relational dependencies inherent in B2B CRM systems. Traditional models often focus on individual customer behaviors or static relationships, overlooking the broader, dynamic networks of inter-business connections that are central to B2B CRM. This limitation hinders the ability to model the full spectrum of relationships that impact sales outcomes, customer loyalty, and business opportunities. Moreover, critical graph features—such as Eigenvector centrality and shortest-path distances—are underexplored, despite their potential to capture important relational structures in B2B networks.

This study addresses these gaps by proposing a graph-based deep learning framework for CRM in B2B settings. We design a graph model using the Neo4j platform to represent B2B sales data, incorporating key relational features like Eigenvector centrality and shortest-path distances. We then applied spectral GCN to this graph model, enabling accurate and scalable sales predictions. The proposed approach demonstrates significant improvements over traditional machine learning and deep learning models, including random forests (RFs), Extreme Gradient Boosting (XGBoost), Categorical Boosting (CATboost), convolutional neural networks (CNNs), and feed-forward neural networks (FNNs). The major contributions of the paper are summarized as follows:

● We develop a graph model for a B2B sales dataset using Neo4j, which supports advanced graph data mining and querying using Cypher, providing valuable insights into CRM interactions.

● We apply spectral GCN to leverage graph features such as Eigenvector centrality and shortest-path distances for sales predictions, showcasing the effectiveness of graph-based learning.

● Experimental results demonstrate that spectral GCN outperforms RFs, gradient-boosting classifiers, CNNs, and FNNs across multiple evaluation metrics, including accuracy, F1 score, and Area Under the Curve (AUC).

The remainder of this paper is structured as follows. Section 2 critically reviews the existing literature on business analytics, with an emphasis on machine learning and deep learning approaches relevant to CRM systems. Section 3 details the B2B dataset employed in this study, highlighting its structure and relevance to the proposed approach. Section 4 introduces the design of the graph model, derived from the Neo4j platform, utilizing advanced graph data mining techniques. In Section 5, we describe the representation features of the graph model that are subsequently utilized for GCN-based inference. Section 6 elaborates on the proposed spectral GCN-based approach for CRM analytics, including its architecture and implementation. Section 7 presents a comprehensive evaluation of the proposed GCN-based model, encompassing exploratory data analysis using Cypher queries, as well as detailed results and performance analysis of the GCN model under various graph feature configurations. Finally, Section 8 concludes the paper by summarizing the key contributions and suggesting promising avenues for future research.

2. Related work

This section investigates various works relevant to graph databases and graph-based deep learning for business analytics.

2.1. Graph database and graph models

Traditional databases and data warehouses cannot process or store unstructured data. Not Only SQL (NoSQL) databases and Hadoop distributed storage systems are alternative solutions under such scenarios. NoSQL database framework, however, requires efficient graph data processing where a graph represents real-life entities and relationships between the entities in a more meaningful manner. Graph representation of big data has significant applications in genetics, social networks, molecular chemistry, finance, and drug testing (Huang and Dong, 2013). A graph database based on a graph model can store, process, and perform efficient graph analysis, such as graph-based deep learning. Huang and Dong (2013) analyzed the performance of Neo4j Cypher query in various B2B use cases. Neo4j stores the graph data as a record file and uses two caching mechanisms for faster data retrieval and visualization. The first mechanism stores the reference relationship of nodes with minimal information in the file system cache. The second mechanism stores major graph connectivity and node attribute information in the object cache. An authors group of researchers analyze the performance of different NoSQL databases and recommend Neo4j and Arango database as highly scalable graph databases for enterprises (Das et al., 2020)· Neo4j-based graph analysis and data mining have applications in various domains including business-to-consumer (B2C) and B2B. Hoksza and JelínekIn (2015) applied Neo4j to perform data mining on various protein graphs. Despite its significant benefits, Neo4j has performance limitations in terms of graph model complexity proportional to dataset size. Similarly, the performance of Cypher queries is limited to larger subgraphs with a higher degree of connectivity. Neo4j graph database analysis has been widely investigated and adopted in the healthcare domain. Zhao et al. (2019) proposed a graph model in Neo4j with the help of Cypher queries for data analysis and disease prediction. Cypher queries can extract hidden patterns to reveal meaningful insights into various business trends. Zou and Liu (2020) analyzed air crash incidents from 1908 to date using Neo4j and Cypher query-enabled data mining to compare the performance of various data import methods in Neo4j. Among the various methods used for import operation, batch-wise import, and Neo4j admin import are suggested as the fastest methods for large datasets. Another work proposed by Needham and Hodler (2019) extracts and analyzes various hidden patterns and relationships in the file dataset using the Neo4j graph database platform and Cypher query. The application of graph representation learning and analysis for CRM based on the social network visual analytic tool (VisCRM) was introduced by Ye et al. (2008). The proposed model extracts hidden features from the customers in a social network with the help of a graph model. In its practical application, the model is limited to visual graphs and exploratory analysis and is unable to perform predictions. Another application of Neo4j in CRM is for goods recommendations using retail knowledge graph and jess reasoning engine (Konno et al., 2017). The work constructs a graph model based on the retail ontology that is queried and analyzed using the Neo4j framework with recommendations offered by the jess reasoning engine. According to Wang et al. (2014), a social network-based enterprise relationship graph can deliver higher customer value and business success rates.

Aasman (2017) presented a knowledge graph using customer data with a 360-degree view of a business's client data catalog. Work considered a customer perspective as an operational knowledge, product knowledge, and service knowledge graph with an analysis of the enterprise data lake. Saha and Sahoo (2018) implemented a graph clustering technique using the telecommunication call records dataset. The authors compared GCN with the K-means algorithm using various features. Zhang et al. (2019) proposed a convolutional graph neural network clustering method using a sentiment lexicon extracted from the social network graph model. In the work, authors implemented three specific models for constructing a topic-specific sentiment lexicon; a filtering text model, a sentiment relationship graph model, and a graph clustering model. The proposed graph model with clustering outperforms the traditional lexicon model with the help of sentiment analysis. The performance of the proposed model decreases with an exponential increase in data collected from the social network.

2.2. Graph-based deep learning

Grap-based deep learning has been widely adopted in different domains, including healthcare, businesses, and social networks. It is mainly used for graph representation learning and graph classification. Graph representation learning is the process of structural data encoding of a graph network (Cai et al., 2018). The encoded information is mapped to a low dimensional vector space, such as adjacency or Laplacian matrices. These matrices are used in machine learning and data analysis operations. An example of graph representation learning using graph neural networks (GNNs) and its variations presented by Scarselli et al. (2009). Graph classification methods can perform node, link, and graph level classifications to classify latent features and knowledge based on graph model (Zhang et al., 2022). The applications of graph classification methods include network behavior prediction, graph matching, and graph generation. Node-level classification is used for node clustering, node recommendation, link prediction, node prediction, and retrieval. Various variants of GCNs, such as spectral and spatial models, have been developed to address real-world challenges, including those in the B2B domain (Zhang et al., 2019; Bruna et al., 2014). Spectral GCNs, for instance, initially relied on Chebyshev polynomial-based approximations (Defferrard et al., 2016), which were later simplified by Kipf and Welling (2017) using first-order spectral propagation. While these methods introduced mathematical rigor to graph-based learning, their scalability to larger, complex graphs with dynamic relationships remains a challenge due to computational inefficiencies in spectral approaches. Spatial graph convolution networks aim to overcome these limitations by directly aggregating information from a node's local neighborhood. These models, which benefit from learnable parameters that are independent of graph size, generalize better across diverse scenarios (Duvenaud et al., 2015). However, spatial GCNs often rely on localized structural information, limiting their ability to capture global relational patterns that are critical for complex tasks like B2B CRM. More advanced models, such as the diffusion convolutional neural Network proposed by Zhuang and Ma (2018), compute node receptive fields based on diffusion transition probabilities. While this approach enhances relational modeling by incorporating probabilistic transitions, its performance often degrades when applied to dynamic and large-scale graphs due to inherent inefficiencies in diffusion-based methods. Similarly, approaches, such as random-walk-based convolution (Alomrani et al., 2024) exhibit constrained performance on larger graphs due to fixed walk lengths, which fail to fully exploit the underlying graph structure.

Graph sample and aggregated embeddings (GraphSAGE) is a spatial GCN designed for scalable representation learning on large and dynamic graphs by aggregating features from a node's neighborhood (Hamilton et al., 2017). It introduces flexible aggregation techniques, including mean, long short-term memory (LSTM), and pooling functions, to capture local structural information. These aggregated features are then propagated through a neural network for downstream tasks such as prediction or classification. While GraphSAGE significantly improves scalability and adaptability for dynamic graphs, it primarily relies on localized feature aggregation, which can limit its ability to capture complex global graph structures. mixture model networks (MoNET) takes a step further by generalizing graph learning techniques through the integration of spatial and spectral graph convolution approaches (Monti et al., 2017). By employing a parametric kernel on pseudo-coordinates, MoNET efficiently models a node's neighborhood and learns shareable features across the graph. This hybrid approach enables MoNET to leverage both local and global structural properties, making it more versatile in addressing diverse graph-based problems.

Recently, self-attention and multi-head attention mechanisms have been integrated into GCNs to address their inherent limitations, giving rise to graph attention networks (GATs) (Veličković et al., 2018). These approaches employ attention mechanisms to dynamically identify critical nodes in variable-sized input graphs and learn node representations through weighted convolution operations. By extracting hidden features of each node using self-attention, GATs exhibit superior adaptability and learning capabilities, particularly for complex and unseen graphs. An advanced version of GATs incorporates a multi-head attention mechanism (Yu et al., 2018; Chaudhari et al., 2021), which employs parallelized self-attention layers to assign differentiated priorities to various sets of nodes. This enables the model to simultaneously learn from multiple perspectives within the graph, improving representational diversity and learning stability. While these attention-based methods have demonstrated impressive performance gains, particularly in capturing local dependencies and identifying key nodes, they exhibit notable limitations. One critical drawback is their constrained scalability, as the computational cost of attention mechanisms grows exponentially with the size of the graph, making them less suitable for large-scale or high-density datasets.

Martínez et al. (2019) employed GCNs for customer prediction within a customer-supplier graph network. The study developed a risk assessment model leveraging topological graph metrics, such as clustering coefficient, node degree, and PageRank, integrated with GCNs. The customer-supplier graph captured relationships based on contact sharing, financial flows, and other domain-specific features, with a dataset comprising 168,305 company nodes and 310,084 edges. While this model outperformed a baseline GCN model and emphasized the importance of relationship types in the graph structure, its approach was limited by its reliance on static graph metrics, which may not adequately capture the nuanced, evolving relationships critical in large-scale, dynamic B2B CRM scenarios. Similarly, Kim et al. (2019) proposed a GCN-based model for supply-demand prediction in a public bike-sharing environment, utilizing temporal and spatial node features to predict hourly demand. The model demonstrated responsiveness to sudden changes in global features, such as weather conditions, showcasing the utility of GCNs in dynamic business contexts. However, both studies focused primarily on spatial features and static graph metrics, neglecting the incorporation of spectral features—such as Eigenvector centrality or spectral clustering—which are crucial for capturing global graph properties and hierarchical relationships in complex networks.

An investigation of current approaches reveals that graph-based deep learning has demonstrated significant promise across various domains, including business analytics. Spectral GCNs, which utilize global graph properties derived from Laplacian matrices, have been effective in capturing overarching structural patterns. However, their scalability remains a challenge for large, dynamic graphs due to high computational costs (Kipf and Welling, 2017; Defferrard et al., 2016). On the other hand, spatial GCNs, such as GraphSAGE, focus on aggregating local neighborhood features, offering improved scalability but often at the cost of missing global relational insights crucial for complex B2B CRM tasks (Hamilton et al., 2017).

Hybrid approaches, such as MoNET, aim to combine spectral and spatial methods to provide more comprehensive models, but still face limitations in handling large-scale datasets due to computational inefficiencies (Monti et al., 2017). Attention-based models like GATs introduce self- and multi-head attention mechanisms to enhance adaptability and learning from diverse node features (Veličković et al., 2018; Chaudhari et al., 2021). While these models show promise in identifying critical nodes and relationships, their scalability remains constrained by the exponential computational cost associated with attention mechanisms. In CRM applications, existing models such as those for customer prediction and supply-demand forecasting have predominantly relied on static graph metrics and localized node features (Martínez et al., 2019; Kim et al., 2019). These methods fail to capture spectral features, like Eigenvector centrality and spectral clustering, which are essential for modeling global, hierarchical relationships in complex B2B networks.

Current B2B sales prediction models are limited by their inability to fully integrate spectral features and global relationships into graph-based learning frameworks. This oversight hinders their performance in capturing the intricate, dynamic relationships in B2B CRM scenarios. Therefore, there is a need for scalable graph-based models that combine both spectral and spatial features to enhance predictive accuracy and address the complexity of real-world B2B environments.

3. Description of dataset

In our work, we have used the publicly available B2B dataset, a real-world dataset from Salvirt Limited (B2B Dataset, 2020). The selected dataset does not contain any sensitive information regarding clients, business products, or strategies. It consists of anonymized sales data from a real-world organization trading in software solutions and services at the global level. The B2B marketing and sales process follows an auxiliary approach and a structural procedure for establishing connections among clients, which plays a significant role in CRM. The dataset includes 448 training instances with 23 features/attributes, with a sales status column as the class label. Initially, the raw dataset does not include a unique identifier for each sale. Therefore, we added a column, "sales_enquiry_id", during the data preprocessing step, representing the unique sales ID for each sale transaction. All features in the dataset are of the object data type, representing categorical variables. After preprocessing, the dataset consists of 449 training samples with 24 attributes, including one labeled sales status column and the newly created "sales_enquiry_id" column with 448 unique values. A detailed description of each dataset feature is presented in Table 1.

Table 1. B2B sales features description.

No.	Feature Name	Code	Description
1	Product Name	Product	Offered product code
2	Seller Name	Seller	Name of the in-charge seller
3	Authority	Authority	Authority level at the client side
4	Company Size	$Comp\_size$	Size of the client or company
5	Competitors	Competitors	Competitors for a sale
6	Purchasing Department	$Purch\_dept$	Purchasing department involved
7	Partnership	Partnership	Product is being sold in partnership
8	Budget allocation	$Budget\_alloc$	Reservation of budget by the client
9	Formal Tender	$Formal\_tend$	Tendering procedure
10	RFI	RFI	Request for Information
11	RFP	RFP	Request for Proposal
12	Growth of a Client	Growth	Growth status of the client
13	Positive Statements	$Posit\_statm$	Client's positive attitude
14	Source	Source	Source of sales inquiry
15	Client	Client	Type of client
16	Scope Clarity	Scope	Clarity of implementation scope
17	Strategic Deal	$Strat\_deal$	Deal with a strategic value
18	Cross Sale	$Cross\_sale$	Different product sold to a client
19	Up Scale	$Up\_sale$	Upgrading or increasing an existing product
20	Deal Type	$Deal\_type$	Type of a sale or business requirement
21	Needs Defined	$Needs\_def$	Clearly expressed needs of a client
22	Attention to Client	$Att\_t\_client$	Attention/importance given to a client
23	Status	Status	Outcome of a sales opportunity
24	Sales Inquiry ID	$Sales\_enquiry\_id$	Unique sales inquiry ID

| Show Table

DownLoad: CSV

4. CRM graph model

To realize the full benefits of graph-based deep learning, in this section, we present the design of two graph models for the CRM called the exploratory data analysis graph model (EDA-Graph model) and GCN-Graph model, respectively. Graph models capture all the meaningful information from the CRM dataset (B2B Dataset, 2020). EDA-Graph model is useful for query-based data mining and exploratory analysis to identify pattern recognition coupled with interactive query-answering abilities. The second graph model represents the interconnectivity of "sale_enquiry_id" nodes. Both the graph models are the outcome of testing various graph models evaluated in terms of efficient data mining and querying capabilities. In both graph models, nodes and relationships are primarily modeled entities according to the selected use case. The EDA-Graph model is defined using 7 labels, and 7 different relationships between node entities. EDA-Graph model is presented in Figure 1, representing various entities/objects in the dataset. In the graph, the start node and target node depict the direction of the relationship. Neo4j has equal traversal performance in both directions that can query the association without specifying any direction (Robinson et al., 2020).

Figure 1. EDA-Graph model.

Parameter Name	Value	Unique Labels	Count
Number of nodes	1386	Sale	448
Number of labels	7	Product	14
No. of relationships	3136	Seller	18
No. of unique labels	5	Source	8
No. of relationship types	7	Sales Status	2
Size	2.03 MB	–	–

Parameter Name	Value
No. of nodes	367
No. of labels	1
No. of relationships	423
Min. relationship count	1.00
Max. relationship count	33.00
Avg. relationship count	2.305
Relationship types	1
Store size	1.53 MB
Name of label	Sales
Name of relationship	IS_CONNECTED

Sales Status	EDA-Graph Count	GCN-Graph Count
Won	227	177
Loss	221	190
Total	448	367

Model	True Negative	False Positive	False Negative	True Positive
Random Forest	0.86	0.14	0.13	0.87
FNN	0.86	0.14	0.15	0.85
CNN	0.77	0.23	0.15	0.85
GCN-1	1.00	0.00	0.14	0.86
GCN-2	0.97	0.03	0.26	0.74
XGBoost	0.82	0.18	0.17	0.83
CATboost	0.84	0.16	0.20	0.80

[1]	Aasman J (2017) Transmuting information to knowledge with an enterprise knowledge graph. IT Professional 19: 44–51. https://doi.org/10.1109/MITP.2017.4241469 doi: 10.1109/MITP.2017.4241469
[2]	Alomrani M, Biparva M, Zhang Y, et al. (2024) DyG2Vec: Efficient Representation Learning for Dynamic Graphs. Trans Mach Learn Res. https://doi.org/10.48550/arXiv.2210.16906
[3]	Asniar and Surendro K (2019) Predictive analytics for predicting customer behavior. 2019 International Conference of Artificial Intelligence and Information Technology (ICAIIT), Yogyakarta, Indonesia, 230–233. https://doi.org/10.1109/ICAIIT.2019.8834571
[4]	Brahim AB, Raboudi W (2020) Improving customer relationship management using machine learning techniques: A Tunisian case study. 2020 International Multi-Conference on: "Organization of Knowledge and Advanced Technologies" (OCTA), Tunis, Tunisia, 1–16. https://doi.org/10.1109/OCTA49274.2020.9151465
[5]	Bruna J, Zaremba W, Szlam A, et al. (2014) Spectral networks and locally connected networks on graphs. Int Conf Learn Representations. https://doi.org/10.48550/arXiv.1312.6203
[6]	B2B Dataset (2020) A real-world dataset for B2B sales analysis. Salvirt Research. Available from: http://www.salvirt.com/research/b2bdataset.
[7]	Cai H, Zheng VW, Chang KC (2018) A comprehensive survey of graph embedding: Problems, techniques, and applications. IEEE Trans Knowl Data Eng 30: 1616–1637. https://doi.org/10.1109/TKDE.2018.2807452 doi: 10.1109/TKDE.2018.2807452
[8]	Chaudhari S, Mithal V, Polatkan G, et al. (2021) An attentive survey of attention models. ACM Trans Intell Syst Technol 12: 1–32. https://doi.org/10.1145/3465055 doi: 10.1145/3465055
[9]	Chung F, Lu L, Vu V (2003) Spectra of random graphs with given expected degrees. Proceedings of the National Acad of Scien 100: 6313–6318. https://doi.org/10.1073/pnas.0937490100 doi: 10.1073/pnas.0937490100
[10]	Das A, Mitra A, Bhagat SN, et al. (2020) Issues and concepts of graph database and a comparative analysis on list of graph database tools. 2020 International Conference on Computer Communication and Informatics (ICCCI), Coimbatore, India, 1–6. https://doi.org/10.1109/ICCCI48352.2020.9104202
[11]	Defferrard M, Bresson X, Vandergheynst P (2016) Convolutional neural networks on graphs with fast localized spectral filtering. Adv Neural Inf Process Sys. https://arXiv.org/abs/1606.09375
[12]	Duvenaud DK, Maclaurin D, Iparraguirre J, et al. (2015) Convolutional networks on graphs for learning molecular fingerprints. Adv Neural Inf Process Syst 28. https://doi.org/10.48550/arXiv.1509.09292 doi: 10.48550/arXiv.1509.09292
[13]	Francis N, Green A, Guagliardo P, et al. (2018) Cypher: An evolving query language for property graphs. Proceedings of the 2018 Int Conf on Manage of Data, 1433–1445. https://doi.org/10.1145/3183713.3190657 doi: 10.1145/3183713.3190657
[14]	Gheisari M, Wang G, Bhuiyan MZA (2017) A survey on deep learning in big data. IEEE Int Conf Comput Sci Eng and IEEE Int Conf Embedded Ubiquitous Comput: 173–180. https://doi.org/10.1109/CSE-EUC.2017.215 doi: 10.1109/CSE-EUC.2017.215
[15]	Global Customer Relationship Management (2020) Growth, Trends and Forecast to 2025. BusinessWire. Available from: https://www.businesswire.com/news/home/20200416005517/en/Global-Customer-Relationship-Management-CRM-Market-Growth.
[16]	Hamilton WL, Ying Z, Leskovec J (2017) Inductive representation learning on large graphs. Adv Neural Inf Process Syst 30. https://doi.org/10.48550/arXiv.1706.02216 doi: 10.48550/arXiv.1706.02216
[17]	Hoksza D, Jelínek J (2015) Using Neo4j for mining protein graphs: A case study. 26th Int Workshop Database Expert Syst Appl, 230–234. https://doi.org/10.1109/DEXA.2015.59 doi: 10.1109/DEXA.2015.59
[18]	Huang H, Dong Z (2013) Research on architecture and query performance based on distributed graph database Neo4j. 3rd Int Conf Consum Electron Commun Netw, 533–536. https://doi.org/10.1109/CECNet.2013.6703387 doi: 10.1109/CECNet.2013.6703387
[19]	Ibrahim A, Ermatita, Saparudin, et al. (2017) Analysis of weakness of data validation from social CRM. Int Conf Data Softw Eng (ICoDSE), 1–5. https://doi.org/10.1109/ICODSE.2017.8285849 doi: 10.1109/ICODSE.2017.8285849
[20]	Kim TS, Lee WK, Sohn SY (2019) Graph convolutional network approach applied to predict hourly bike-sharing demands considering spatial, temporal, and global effects. PLOS ONE 14: e0220782. https://doi.org/10.1371/journal.pone.0220782 doi: 10.1371/journal.pone.0220782
[21]	Kimura D, Gotoh T, Ikeda K (2011) Eliciting considerable requirements with word and customer graphs. IEEE 35th Annu Comput Softw Appl Conf, 476–485. https://doi.org/10.1109/COMPSAC.2011.68 doi: 10.1109/COMPSAC.2011.68
[22]	Kipf TN, Welling M (2017) Semi-supervised classification with graph convolutional networks. Int Conf Learn Representations (ICLR). https://doi.org/10.48550/arXiv.1609.02907 doi: 10.48550/arXiv.1609.02907
[23]	Kitchin R (2014) The data revolution: Big data, open data, data infrastructures & their consequences. SAGE Publications Ltd. https://doi.org/10.4135/9781473909472 doi: 10.4135/9781473909472
[24]	Konno T, Huang R, Ban T, et al. (2017) Goods recommendation based on retail knowledge in a Neo4j graph database combined with an inference mechanism implemented in Jess. IEEE SmartWorld/SCALCOM/UIC/ATC/CBDCom/IOP/SCI, 1–8. https://doi.org/10.1109/UIC-ATC.2017.8397433 doi: 10.1109/UIC-ATC.2017.8397433
[25]	LeCun Y, Bengio Y, Hinton G (2015) Deep learning. Nature 521: 436–444. https://doi.org/10.1038/nature14539 doi: 10.1038/nature14539
[26]	Martínez A, Nin J, Tomás E, et al. (2019) Graph convolutional networks on customer/supplier graph data to improve default prediction. Complex Netw and Their Appl VII, 135–146. https://doi.org/10.1007/978-3-030-14459-3_11 doi: 10.1007/978-3-030-14459-3_11
[27]	Masci J, Boscaini D, Bronstein MM, et al. (2015) Geodesic convolutional neural networks on Riemannian manifolds. IEEE Int Conf Comput Vis Workshop: 832–840. https://doi.org/10.1109/ICCVW.2015.112 doi: 10.1109/ICCVW.2015.112
[28]	McClanahan B, Gokhale SS (2016) Centrality and cluster analysis of Yelp mutual customer business graph. IEEE 40th Annu Comput Softw Appl Conf, 592–601. https://doi.org/10.1109/COMPSAC.2016.79 doi: 10.1109/COMPSAC.2016.79
[29]	Micheli A (2009) Neural network for graphs: A contextual constructive approach. IEEE Trans Neural Networks 20: 498–511. https://doi.org/10.1109/TNN.2008.2010350 doi: 10.1109/TNN.2008.2010350
[30]	Monti F, Boscaini D, Masci J, et al. (2017) Geometric deep learning on graphs and manifolds using mixture model CNNs. 2017 IEEE Conf Comput Vision Pattern Recognit, 5425–5434. https://doi.org/10.1109/CVPR.2017.576 doi: 10.1109/CVPR.2017.576
[31]	Needham M, Hodler AE (2019) Graph Algorithms: Practical Examples in Apache Spark and Neo4j. O'Reilly Media. ISBN: 9781492047681.
[32]	Newman MEJ (2008) Mathematics of networks. In: Durlauf SN, Blume LE (eds) The New Palgrave Dictionary of Economics, 1–8. https://doi.org/10.1057/978-1-349-95121-5_2565-1
[33]	Nugmanova A, Chernykh I, Bulusheva A, et al. (2019) Unsupervised training of automatic dialogue systems for customer support. Int Conf Qual Manag Transp Inf Secur Inf Technol, 436–438. https://doi.org/10.1109/ITQMIS.2019.8928445 doi: 10.1109/ITQMIS.2019.8928445
[34]	Robinson I, Webber J, Eifrem E(2020) Graph Databases, 2nd Edition. Available from: https://www.oreilly.com/library/view/graph-databases-2nd/9781491930885/.
[35]	Saha L, Sahoo L (2018) Adaptation of spectral clustering in telecommunication industry for customer relationship management. 2nd Int Conf Electron Mater Eng Nano-Technol, 1–6. https://doi.org/10.1109/IEMENTECH.2018.8465187 doi: 10.1109/IEMENTECH.2018.8465187
[36]	Scarselli F, Gori M, Tsoi AC, et al. (2009) The graph neural network model. IEEE Trans Neural Networks 20: 61–80. https://doi.org/10.1109/TNN.2008.2005605 doi: 10.1109/TNN.2008.2005605
[37]	Taleb N, Salahat M, Ali L (2020) Impacts of big-data technologies in enhancing CRM performance. 2020 Int Conf Inf Manag, 257–263. https://doi.org/10.1109/ICIM49319.2020.244708 doi: 10.1109/ICIM49319.2020.244708
[38]	Veličković P, Cucurull G, Casanova A, et al. (2018) Graph attention networks. Int Conf Learn Representations. https://doi.org/10.48550/arXiv.1710.10903
[39]	Wang L, Liu S, Pan L, et al. (2014) Enterprise relationship network: Build foundation for social business. IEEE Int Congr Big Data, 347–354. https://doi.org/10.1109/BigData.Congress.2014.57 doi: 10.1109/BigData.Congress.2014.57
[40]	Yang J, Liu C, Teng M, et al. (2018) A unified view of social and temporal modeling for B2B marketing campaign recommendation. IEEE Trans Knowl Data Eng 30: 810–823. https://doi.org/10.1109/TKDE.2017.2783926 doi: 10.1109/TKDE.2017.2783926
[41]	Ye Q, Wang C, Wu B, et al. (2008) VisCRM: A social network visual analytic tool to enhance customer relationship management. IEEE Int Conf Serv Oper Logistics Informatics: 825–830. https://doi.org/10.1109/SOLI.2008.4686513 doi: 10.1109/SOLI.2008.4686513
[42]	Yu A, Dohan D, Le Q, et al. (2018) [Qanet: Combining local convolution with global self-attention for reading comprehension]. Int Conf Learn Representations. https://doi.org/10.48550/arXiv.1804.09541
[43]	Zhang B, Xu D, Zhang H, et al. (2019) STCS lexicon: spectral-clustering-based topic-specific Chinese sentiment lexicon construction for social networks. IEEE Trans Comput Soc Syst 6: 1180–1189. https://doi.org/10.1109/TCSS.2019.2941344 doi: 10.1109/TCSS.2019.2941344
[44]	Zhang Z, Cui P, Zhu W (2022) Deep learning on graphs: A survey. IEEE Trans Knowl Data Eng, 249–270. https://doi.org/10.1109/TKDE.2020.2981333 doi: 10.1109/TKDE.2020.2981333
[45]	Zhao J, Hong Z, Shi M (2019) Analysis of disease data based on Neo4j graph database. IEEE/ACIS 18th Int Conf Comput Inf Sci, 381–384. https://doi.org/10.1109/ICIS46139.2019.8940247 doi: 10.1109/ICIS46139.2019.8940247
[46]	Zhu J, San-Segundo R, Pardo JM (2017) Feature extraction for robust physical activity recognition. Human-Centric Compt and Inf Scien 7: 16. https://doi.org/10.1186/s13673-017-0097-2 doi: 10.1186/s13673-017-0097-2
[47]	Zhuang C, Ma Q (2018) Dual graph convolutional networks for graph-based semi-supervised classification. Proc 2018 World Wide Web Conf, 499–508. https://doi.org/10.1145/3178876.3186116 doi: 10.1145/3178876.3186116
[48]	Zou Y, Liu Y (2020) The implementation knowledge graph of air crash data based on Neo4j. 2020 IEEE 4th Information Technology, Networking, Electronic and Automation Control Conference (ITNEC), Chongqing, China, 1699–1702. https://doi.org/10.1109/ITNEC48623.2020.9085182 doi: 10.1109/ITNEC48623.2020.9085182

Algorithm 1 GCN with Convolution Operation
Require: Adjacency matrix $A$ , Feature matrix $X$ , Label matrix $L$
Ensure: Latent feature representation $F_{latent}$
1: $H \gets H_{L}$ $\triangleright$ Number of hidden neural network layers
2: $\hat{A} \gets A + 1$ $\triangleright$ Generate adjacency matrix $A$ of size $N \times N$
3: function GCNmodel( $\hat{A}$ )
4: $CON\_Agg \gets \emptyset$
5: for $h = 1$ to $H$ do
6: for $i = 1$ to $\text{number_of_rows}(L)$ do
7: $CON\_Agg.\text{append}(\sigma (D^{-0.5}(\hat{A})D^{-0.5}H_{i}W_{i}))$ $\triangleright$ Kipf and Welling's spectral propagation rule
8: end for
9: end for
10: $F_{latent} \gets \text{LogisticReg}(CON\_Agg)$ $\triangleright$ Return aggregated features
11: return $F_{latent}$
12: end function

Algorithm 2 GCN Model Training and Predictions
Require: $crm.edgelist$ , $crm.attributes$
Ensure: Sales prediction
1: $network \gets crm.edgelist$
2: $attributes \gets crm.attributes$ $\triangleright$ Generate adjacency matrix $A$ of size $N \times N$
3: $X\_train, Y\_train \gets \text{attributes['churn']}$ $\triangleright$ Setting labeled nodes for training, i.e., 'Won' or 'Lost'
4: $X\_test, Y\_test \gets \text{attributes['churn'] = = 'sales'}$ $\triangleright$ Setting all unlabeled nodes for testing
5: $X\_train, X\_test \gets \text{flatten}(X\_train, X\_test)$ $\triangleright$ Partition data into test and train sets
6: $\text{train}(GCNmodel, X, X\_train, Y\_train)$ $\triangleright$ Feed the GCN model training data
7: $\text{predict}(GCNmodel, X, X\_test)$ $\triangleright$ Perform predictions

Quantitative Finance and Economics

Optimizing B2B customer relationship management and sales forecasting with spectral graph convolutional networks: A quantitative approach

Related Papers:

Abstract

1. Introduction

2. Related work

2.1. Graph database and graph models

2.2. Graph-based deep learning

3. Description of dataset

4. CRM graph model

5. Feature engineering

6. GCN-based analytics

7. Performance evaluations

7.1. EDA graph model-based exploratory analysis

7.2. Exploratory data analysis

7.3. Spectral GCN model evaluation and analysis

8. Conclusions and future work

Author contributions

Use of AI tools declaration

Conflict of interest

References

Reader Comments

通讯作者: 陈斌, bchen63@163.com

Metrics

Figures and Tables

Other Articles By Authors

Related pages

Tools

Export File

Citation

Format

Content

Catalog