Auto-3P: An autonomous VNF performance prediction & placement framework based on machine learning

doi:10.1016/j.comnet.2020.107433

Computer Networks

Volume 181, 9 November 2020, 107433

https://doi.org/10.1016/j.comnet.2020.107433 Get rights and content

Abstract

We propose Auto-3P, an Autonomous module for Virtual Network Functions Performance Prediction and Placement at network cloud and edge facilities based on Machine Learning (ML). Auto-3P augments the autonomous placement capabilities of MANagement and Orchestration frameworks (MANOs) by considering both resource availability at hosting nodes and the implied impact of a VNF node placement decisions on the whole service level end-to-end performance. Unlike that, most existing placement methods take a rather myopic approach after manual rule-based decisions, and/or based exclusively on a host-centric view that focuses merely on node-local resource availability and network metrics. We evaluate and validate Auto-3P with real-field trials in the context of a well-defined Smart City Safety use case using a real end-to-end application over a real city-based testbed. We meticulously conduct repeated tests to assess (i) the accuracy of our adopted prediction models; and (ii) their placement performance against three other existing MANO approaches, namely, a “Traditional”, a “Latency-aware” and a “Random” one, as well as against a collection of well-known Time Series Forecasting (TSF) methods. Our results show that the accuracy of our ML models outperforms the one by TSF models, with the most prominent accuracy performances being exhibited by models such as K-Nearest Neighbors Regression (K-NNR), Decision Tree (DT), and Support Vector Regression (SVR). What is more, the resulted end-to-end service level performance of our approach outperforms “Traditional”, “Latency-aware”, and Random MANO placement. Last, Auto-3P achieves load balancing at selected VNF hosts without degrading end-to-end service level delay, and without a need for a (fixed) overload threshold check, unlike what is suggested by other works in the literature for coping with heavy system-wide load conditions.

Introduction

Nowadays networks are increasingly becoming largely programmable and software-driven, following the trend for adopting the concepts of Network Function Virtualization (NFV) and Software Defined Networking (SDN). NFV and SDN demonstrate their capability to satisfy the much-needed reliability, low latency, and availability in many application fields such as industrial automation [1] and even beyond that. This signifies an important paradigm shift towards Network Softwarization (NS) and leveraging autonomous network and service management, for which both NFV and SDN are key enablers.

But in order to leverage the evident benefits of a fully automated networking paradigm, there is a series of main challenges that have been raised and need to be addressed. Important examples include autonomous self-deploying of Virtual Network Functions (VNFs), VNF self-destruction (i.e., automatic termination), and self-healing (i.e., automatic troubleshooting), just to name a few. In this context, identifying the appropriate network locations for VNFs is a key optimization problem that seeks for a solution. Existing Management and Orchestration frameworks (MANO), such as the OSM MANO [2], apply mechanisms to instantiate VNFs on hosts with sufficient resources (e.g., number of Central Processing Unit (CPU) cores, amount of memory and storage), which nonetheless disregard important performance factors like latency and host load.

This problem is well-known and gained further attention due to the 5G technology expectations stated by the International Telecommunication Union (ITU)¹ with respect to Ultra Reliable Low Latency Communications (URLLC). As a result, a significant amount of effort has been put lately on optimizing VNF placement at the network’s edge with the goal of reducing application and service delay. A key enabler concept for this is Multi-access Edge Computing (MEC), which allows to move much of cloud computing functionality to the edge of the network, hence processing traffic closer to users instead of forwarding traffic and/or traffic information to cloud. This not only mitigates cloud congestion but also causes significant latency reduction for end-to-end services.

Nonetheless, as there are no information suggesting the optimum location for delay-sensitive VNFs, corresponding placement decisions to either the cloud or MEC facilities are still dominated by manual operations. However, the outcome of manual actions is questionable because they come without any satisfactory placement optimization guarantee. Furthermore, achieving accurate VNF performance predictions at different MEC nodes poses a significant contribution to realising the Zero-touch network and Service Management (ZSM)² [3] concept.

The prediction of Predicting post-placement VNF performance constitutes a complex task that comes with challenges. For end-to-end delay-critical services, predictions depend on multiple stochastic variables that refer to devices and cloud/edge components such as load and latency. To the best of our knowledge, there is little progress in the related literature [4] on forecasting (i) VNF performance in the broader context of service instantiation as well as on (ii) autonomous placement of stand-alone VNFs., hence giving rise to the following questions:

•
“How can one predict end-to-end application-level VNF performance prior to appointing a VNF’s host location?
•
Assuming such a performance prediction, how can a VNF then be autonomously deployed?”

To address these questions, we propose “Auto-3P”, a solution based on Machine Learning (ML) models that make accurate predictions of the Total Response Time (T_total) for a VNF running an end-to-end service, thus augmenting existing MANO frameworks by autonomously placing VNFs at hosting locations. As we elaborate further in Section 2, Auto-3P (i) provides an autonomous placement mechanism to any MANO frameworks by (ii) applying ML models that suggest the best VNF location.

We assess the performance of Auto-3P through an extensive experimental campaign showing that our solution can deliver better end-to-end service performance based on autonomous VNF placement decisions. We also compare Auto-3P against existing approaches, showing that in most of the cases considered, Auto-3P places VNFs at nodes that can yield optimum performance.

The contributions of this paper are summarized as follows.

(i)
A novel approach to VNF performance prediction and to autonomous VNF placement:The novelty of our work lies in augmenting MANO VNF placement decisions with accurate performance predictions based on specially trained ML models. What is more, our approach considers the impact of VNF placement decisions at a service level, end-to-end. Moreover, we try to address an existing gap in well-established orchestrators like OSM or Openstack [5]. These orchestrators may pose a powerful placement mechanism, but they do not provide native support for ML modules. Last, only a few ML research efforts aside Auto-3P have focused on the prediction of service level end-to-end network or processing delay. Exploring this direction has enough potential for improving VNF placement, especially for delay-sensitive use cases.
(ii)
Real testbed and use case based experimentation:Unlike many previous studies that are simulation-based, the current one is experimental-based, hence demonstrating the capability of deploying a solution like Auto-3P in real-life networks under realistic conditions. To do so, we train and test our prediction models using (i) real data from a (ii) real VNF video transcoder in the context of a (iii) real Smart City Safety use case and finally, in a (iv) real deployment environment. As a result, we bridge an important gap in a predominantly simulation-based literature on VNF placement [6], [7], [8], [9], [10], [11].
(iii)
Comprehensive comparative study of VNF prediction models and corresponding placement decisions:Our work integrates VNF performance predictions and placement decisions. As such, we study and compare different prediction performances by different ML algorithms as well as by non-ML models. Moreover, we study the placement efficiency of MANO when augmented with Auto-3P against a Traditional, a Latency-aware, and a Random MANO placement method. Our evaluation results denote that Auto-3P significantly improves MANO placement decisions concerning both balancing system-wide load and decreasing end-to-end service level delay. These findings contribute towards the ultimate goal of ZSM in 5G regarding delay-sensitive services and applications.

This article is structured as follows. Section 2 provides the reader with a necessary background on MANO systems, defines the problem targeted by this article and analyses the related work on ML-based VNF management. Section 3 describes the prediction model including a detailed discussion on the autonomous VNF placement module. Section 4 describes in detail our experimental setup and scenarios. Section 5 presents our evaluation results. Finally, we draw our conclusions and discuss our future work plans in Section 6.

Section snippets

Background and related work

In this section, we provide the reader with a necessary background on contemporary MANO systems that enables us to identify the current gaps towards achieving full management automation and to motivate our targeted research effort on VNF placement. Based on this background discussion, we proceed with analysing the current state of the art in ML-based solutions related to VNF management, including VNF placement.

System model

End-to-end applications typically require a system that is composed of three nodes, namely, a Source Node (SN), a Processing Node (PN), and an End-user Node (EN). The SN is a device that produces the data. The PN is the middle point that receives data from the SN, processes data, and transmits the corresponding results to users. The EN is the end-user of this application. SN and EN are usually devices with a limited computational capability, while a PN is typically a powerful server. In

Experimental setup

We execute the algorithm to place the VNF video transcoder on the Smart City Safety (SCS) system at the University of Bristol, UK site of 5GinFIRE infrastructure. The latter is a multi-site 5G NFV ecosystem located in the UK, Spain, Portugal, Poland, and Greece⁹. The component of the SCS system can be briefly described as follows: Source Node (SN) is composed of a 360 degrees camera and a Raspberry Pi (Raspi). Processing Node (PN) is a VNF transcoder running on a compute

Evaluation

In what follows we present and analytically discuss our evaluation results from the conducted real field experiments executed on the 5GinFIRE testbed. All results refer to mean values along with 95% confidence intervals after conducting a 10-fold cross-validation by randomly splitting the datasets into training and testing subsets.

Conclusions and future work

In this paper, we have presented Auto-3P, an autonomous module that improves the performance of VNF placement for any MANO framework by offering an “autonomous” approach that considers “both resources availability” and “VNF performance” of end-to-end service. It is composed of two components: 1) a novel model that applies machine learning algorithms for the VNF performance prediction for end-to-end applications. 2) an autonomous placement module that suggests the placement location to the MANO

CRediT authorship contribution statement

Monchai Bunyakitanon: Conceptualization, Methodology, Software, Validation, Investigation, Writing - original draft. Aloizio Pereira da Silva: Conceptualization, Methodology, Writing - review & editing. Xenofon Vasilakos: Formal analysis, Data curation, Writing - review & editing. Reza Nejabati: Resources, Supervision, Project administration. Dimitra Simeonidou: Supervision, Funding acquisition.

Declaration of Competing Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Acknowledgments

This work has received funding from the UK EPSRC project TOUCAN (EP/L020009/1), and from the EU H2020 projects 5G-VICTORI and MATILDA (grant agreements 857201, 761898).

Monchai Bunyakitanon is an officer of the Royal Thai Navy and a member of the Smart Internet Lab, currently pursuing his Ph.D. degree in Electrical Engineering at the University of Bristol, United Kingdom. His research focus includes Machine Learning solutions for Multi-access Edge Computing and Networking, Network Function Virtualization and Software-Defined Networks. He holds a BSEE degree since 2006 from the Escuela Naval Militar, Pontevedra, Spain, and a MSEE degree with an emphasis in

References (53)

B. Addis et al.
Virtual network functions placement and routing optimization
2015 IEEE 4th International Conference on Cloud Networking (CloudNet)
(2015)
J. Quionero-Candela et al.
Dataset shift in machine learning
(2009)
P. Mekikis et al.
NFV-Enabled experimental platform for 5G tactile internet support in industrial environments
IEEE Trans. Ind. Inf.
(2020)
OSM. https://osm.etsi.org/. Accessed on: July 9th,...
M. Jie, H. Gang, P. Xiaoyan, M. Klaus, K. Michael, K. Ashiq, K. Gerald, K. Ryosuke, G. Yoshinori, M. Serge, W. Farni,...
Y. Fu et al.
Artificial intelligence to manage network traffic of 5G wireless networks
IEEE Netw.
(2018)
Build the future of Open Infrastructure., https://www.openstack.org/. Accessed on: July 9th,...
I. Alawe et al.
Improving traffic forecasting for 5G core network scalability: a machine learning approach
IEEE Netw.
(2018)
R. Li et al.
Deep reinforcement learning for resource management in network slicing
IEEE Access
(2018)
C. Zhang et al.
Deep learning based network application classification for SDN
Transactions on Emerging Telecommunications Technologies
(2018)

M.C. Luizelli et al.

Piecing together the NFV provisioning puzzle: Efficient placement and chaining of virtual network functions

2015 IFIP/IEEE International Symposium on Integrated Network Management (IM)

(2015)

M.F. Bari et al.

On Orchestrating Virtual Network Functions in NFV

arXiv:1503.06377 [cs]

(2015)

R. Cohen et al.

Near optimal placement of virtual network functions

2015 IEEE Conference on Computer Communications (INFOCOM)

(2015)

ONAP, https://www.onap.org/. Accessed on: July 9th,...

J.W. Guck et al.

Achieving end-to-end real-time Quality of Service with Software Defined Networking

2014 IEEE 3rd International Conference on Cloud Networking (CloudNet)

(2014)

M. Haiyan et al.

Towards SDN based queuing delay estimation

China Commun.

(2016)

F.C. Chua et al.

Stringer: balancing latency and resource usage in service function chain provisioning

IEEE Internet Comput.

(2016)

Wei-Ping Yang et al.

A queueing analytical model for service mashup in mobile cloud computing

2013 IEEE Wireless Communications and Networking Conference (WCNC)

(2013)

T. Lei et al.

Deploying QoS-assured service function chains with stochastic prediction models on VNF latency

2017 IEEE Conference on Network Function Virtualization and Software Defined Networks (NFV-SDN)

(2017)

A. Gulenko et al.

Evaluating machine learning algorithms for anomaly detection in clouds

2016 IEEE International Conference on Big Data (Big Data)

(2016)

R. Shi et al.

MDP and Machine Learning-Based Cost-Optimization of Dynamic Resource Allocation for Network Function Virtualization

2015 IEEE International Conference on Services Computing

(2015)

J. Zhou et al.

A dynamic logistic regression for network link prediction

Science China Mathematics

(2017)

B. Betancourt et al.

Bayesian fused lasso regression for dynamic binary networks

Journal of Computational and Graphical Statistics

(2017)

N. Baldin et al.

Optimal link prediction with matrix logistic regression

arXiv:1803.07054 [math, stat]

(2018)

R. Hu et al.

CPU Load Prediction Using Support Vector Regression and Kalman Smoother for Cloud

2013 IEEE 33rd International Conference on Distributed Computing Systems Workshops

(2013)

R. Beverly et al.

SVM learning of IP address structure for latency prediction

(2006)

Cited by (15)

HADES: An NFV solution for energy-efficient placement and resource allocation in heterogeneous infrastructures
2024, Journal of Network and Computer Applications
Network Function Virtualization (NFV) aims to replace traditional network functions running in proprietary hardware with software instances (i.e., Virtual Network Functions, VNFs) embedded in general-purpose virtualization solutions. Aware that the transition to a fully virtualized network infrastructure will pay a high energy cost, especially in IoT systems composed of a myriad of devices, energy efficiency is one of the key innovative targets of future networks. Edge computing should be considered in IoT environments to save time and energy by processing data near the producer devices. However, applying NFV in the context of IoT/Edge/Cloud environments complicates the placement of VNFs, due to the inherent heterogeneous nature of such environments and the variety of resource demands. This paper proposes an energy-aware placement of service function chains of VNFs and a resource-allocation solution for heterogeneous edge infrastructures that considers the computation and communication delays according to the VNFs’ location in the infrastructure. The solution has been integrated with the ETSI-sponsored project Open Source Management and Orchestration (OSM) as an extension called HADES, which allows the configuration of VNFs and their subsequent resource allocation and deployment at the edge, minimizing energy consumption and ensuring a quality of service. We have applied the deployment of augmented reality services in real and simulated scenarios. The results show up to a 59% reduction in power consumption and QoS compliance in all scenarios considered compared to default OSM placement and four other allocation policies. We prove that our solution has negligible power overhead, and validate the scalability and applicability of HADES.
ION-PROFILER: INTELLIGENT ONLINE MULTI-OBJECTIVE VNF PROFILING WITH REINFORCEMENT LEARNING
2023, arXiv
Multicast-Oriented Dynamic Virtual Network Function Placement Algorithm
2023, Jisuanji Xuebao/Chinese Journal of Computers
Hades: An Nfv Solution for Energy-Efficient Placement and Resource Allocation in Heterogeneous Infrastructures
2023, SSRN
Deep Q-Network-Based Cloud-Native Network Function Placement in Edge Cloud-Enabled Non-Public Networks
2023, IEEE Transactions on Network and Service Management
A Tenant-based Two-stage Approach to Auditing the Integrity of Virtual Network Function Chains Hosted on Third-Party Clouds
2023, CODASPY 2023 - Proceedings of the 13th ACM Conference on Data and Application Security and Privacy

View all citing articles on Scopus

Aloizio P. Silva is currently a Technical Project Manager for Platforms for Advanced Wireless Research (PAWR) a NSF program in U.S. and he is also Research Associate at Northeastern University. He is member of the steering committee for IEEE Future Networks Initiative and Region 1 representative. He has Phd degree at Department of Computer and Electronic Engineer Instituto Tecnologico de Aeronautica (ITA) (2015). PMP Certified. Master of Business Administration (MBA) in Project Management at Fundacao Getelio Vargas (FGV) (2008) and Master Degree in Computer Science at Federal University of Minas Gerais (2002). His areas of interest and research include Distributed Systems, Computer Networks, Algebraic Topology, Telecommunication Networks, Network Softwarization (Software Defined Network - SDN and Network Function Virtualization - NFV), Mobile Edge Computing (MEC), Software Defined Radio (SDR), Internet of Things (IoT), Smart Cities, Delay and Disruption Tolerant Networks (DTN), Advanced Wireless Communication and Interplanetary Networks (IPN).

Xenofon Vasilakos is a Research Fellow with the University of Bristol, Bristol, the U.K., where he is a member of the Smart Internet Laboratory and the lead researcher of the Zero Downtime Edge Application Mobility (ZeroDEAM) project funded by Samsung Electronics UK. He received the M.Sc. degree in Parallel and Distributed Computer Systems from Vrije Universiteit Amsterdam, and the Ph.D. degree in informatics from the Athens University of Economics and Business with a focus on Information-Centric Networking architectures, protocols, and distributed solutions. He has participated in various EU and national funded research projects such as 5GPPP SliceNet and the FIA award-winning FP7 project PURSUIT. His current research interests include 5G/B5G technologies with a focus on Multi-access Edge Computing based on cognition approaches inspired by machine learning models towards Zero touch network & Service Management. He is also involved in 5G-related areas of Internet of Things, Software-Defined Networking, Network Function Virtualization, and network slicing. Dr. Vasilakos was a recipient of an excellence fellowship grant from the French government (LABoratoires d’EXcellence), and has received an accolade and awards for his academic performance from the Greek State Scholarship Foundation. CV: http://pages.cs.aueb.gr/∼xvas/pdfs/detailedCV.pdf

Reza Nejabati current area of research is in the field of disruptive new Internet technologies with focus on application of high-speed network technologies. He has a successful track record in working at the interface between optical networks and computer science, as well as between academia and industry. Throughout his research career, he has made important and pioneering contributions to the fields of Optical Networking, Grid Networking, Data center Networks, Software Defined Networking (SDN), Network Virtualization, and Network Function Virtualization (NFV). He has written or co-authored over 200 peer reviewed papers and several standardization documents.

Dimitra Simeonidou is a Full Professor at the University of Bristol, the Co-Director of the Bristol Digital Futures Institute and the Director of Smart Internet Lab. Her research is focusing in the fields of high performance networks, programmable networks, wireless-optical convergence, 5G/B5G and smart city infrastructures. She is increasingly working with Social Sciences on topics of digital transformation for society and businesses. Dimitra has been the Technical Architect and the CTO of the smart city project Bristol Is Open. She is currently leading the Bristol City/Region 5G urban pilots. She is the author and co-author of over 500 publications, numerous patents and several major contributions to standards. She has been co-founder of two spin-out companies, the latest being the University of Bristol VC funded spin-out Zeetta Networks, http://www.zeetta.com, delivering SDN solutions for enterprise and emergency networks. Dimitra is a Fellow of the Royal Academy of Engineering and a Royal Society Wolfson Scholar.

View full text

Auto-3P: An autonomous VNF performance prediction & placement framework based on machine learning

Abstract

Introduction

Section snippets

Background and related work

System model

Experimental setup

Evaluation

Conclusions and future work

CRediT authorship contribution statement

Declaration of Competing Interest

Acknowledgments

NFV-Enabled experimental platform for 5G tactile internet support in industrial environments

IEEE Trans. Ind. Inf.

Artificial intelligence to manage network traffic of 5G wireless networks

IEEE Netw.

Improving traffic forecasting for 5G core network scalability: a machine learning approach

IEEE Netw.

Deep reinforcement learning for resource management in network slicing

IEEE Access

Deep learning based network application classification for SDN

Transactions on Emerging Telecommunications Technologies

Piecing together the NFV provisioning puzzle: Efficient placement and chaining of virtual network functions

2015 IFIP/IEEE International Symposium on Integrated Network Management (IM)

On Orchestrating Virtual Network Functions in NFV

arXiv:1503.06377 [cs]

Near optimal placement of virtual network functions

2015 IEEE Conference on Computer Communications (INFOCOM)

Achieving end-to-end real-time Quality of Service with Software Defined Networking

2014 IEEE 3rd International Conference on Cloud Networking (CloudNet)

Towards SDN based queuing delay estimation

China Commun.

Stringer: balancing latency and resource usage in service function chain provisioning

IEEE Internet Comput.

A queueing analytical model for service mashup in mobile cloud computing

2013 IEEE Wireless Communications and Networking Conference (WCNC)

Deploying QoS-assured service function chains with stochastic prediction models on VNF latency

2017 IEEE Conference on Network Function Virtualization and Software Defined Networks (NFV-SDN)

Evaluating machine learning algorithms for anomaly detection in clouds

2016 IEEE International Conference on Big Data (Big Data)

MDP and Machine Learning-Based Cost-Optimization of Dynamic Resource Allocation for Network Function Virtualization

2015 IEEE International Conference on Services Computing

A dynamic logistic regression for network link prediction

Science China Mathematics

Bayesian fused lasso regression for dynamic binary networks

Journal of Computational and Graphical Statistics

Optimal link prediction with matrix logistic regression

arXiv:1803.07054 [math, stat]

CPU Load Prediction Using Support Vector Regression and Kalman Smoother for Cloud

2013 IEEE 33rd International Conference on Distributed Computing Systems Workshops

SVM learning of IP address structure for latency prediction