Deep Reinforcement Learning for the Management of Software-Defined Networks and Network Function Virtualization in an Edge-IoT Architecture
Abstract
:1. Introduction
2. Problem Description and Related Work
2.1. Challenges in Internet of Things and Industrial Internet of Things Scenarios
2.2. Edge Computing and Edge-IoT Platforms
2.3. Software-Defined Networking and Network Function Virtualization in Edge-IoT Scenarios
2.4. Reinforcement Learning and Deep Reinforcement Learning in SDN/NFV Scenarios
3. Management of SDN Flow Entries in the Global Edge Computing Architecture by means of Deep Reinforcement Learning
3.1. The Global Edge Computing Architecture 1.0
- IoT Layer: in GECA, this layer is made up of the set of elements that define an environment like IoT, that is, objects or connected things that constantly generate data. Among them are sensors, actuators, controllers or IoT gateways. The data transmission process in this layer is done through the most used communication standards, such as: Wi-Fi, ZigBee, LoRa or SigFox. In addition to the IoT devices, this layer includes the components that provide security to the architecture. The incorporation of a basic blockchain scheme in which the smart contracts through oracles interact with the physical components of the layer, allows the data to be transferred safely and following the predefined terms in each contract.
- Edge Layer: This layer proposes the use of low-cost and high-capacity boards such as the Raspberry Pi, whose characteristics allow it to function as an edge node to process and filter the data collected and sent by the devices deployed in the IoT layer. The components of these boards: I/O ports, 32 or 64 bit Linux operating system, RAM memory up to 4GB, USB 2.0 to 3.0 ports or SD cards, allow the installation of libraries such as TensorFlow Lite or similar used to apply Machine Learning techniques for data management [76]. Machine Learning techniques contribute to the deployment of the Data Analytics in the Edge layer, making it easier for users to obtain valuable data and responses with lower latency, reducing the costs associated with Cloud Computing such as shipping, processing, data storage as well as bandwidth consumption.
- Business Solution Layer: The services and applications associated with Business Intelligence, which are in a classic Cloud Computing architecture, are included in this layer of GECA. At this level the architecture facilitates the deployment of public or private services, also allowing the inclusion of components of: analysis (case based reasoning, data analysis and visualization algorithms), authentication (to ensure security) knowledge base (using virtual agent organizations or decision support systems) and APIs (so that services are available in any standard web browser).
3.2. SDN and NFV in the New Global Edge Computing Architecture 2.0
- IoT Layer: This layer and its components remain unchanged in the new version of the architecture. In this sense, it is not necessary to modify any of its components, since in this version only the interaction with the Edge nodes (and the new Fog nodes) from the SDN controller has been considered. The IoT Layer is the lowest layer within the Infrastructure plane (called data plane in other SDN architectures).
- Fog/Edge Layer: The previous Edge Layer in GECA 1.0 is now called Fog/Edge Layer. Moreover, the Fog/Edge Layer is the topmost layer within the Infrastructure plane. This layer is now subdivided into two sublayers: the Edge Sublayer and the Fog Sublayer.
- -
- Edge Sublayer: This sublayer inherits the existing components of the previous GECA 1.0 Edge Layer. That is, it is formed by the Edge nodes or Edge Gateways. Thus, these nodes act as a gateway between the IoT devices and the applications in the Cloud (Business Solution Layer). Like in GECA 1.0, the Edge Gateways have computing and local storage capacities to preprocess the data sent from the IoT Layer to the Business Solution layer. Besides, it is possible to apply machine learning techniques in the same Edge in order to detect anomalous patterns. Among others, it is possible to apply k-NN (k-Nearest Neighbors) algorithms to detect anomalous service requests that may mean cyberattacks [77]. Nonetheless, in the new version of the GECA 2.0 architecture, the Edge Gateways include in their local data stores routing tables with information about what to do with each type of packet (discard, forward to a certain communication interface or send to the SDN Controller), as well as counters with the number of packets received of each type. The packet types are actually filtering patterns based on the packet headers. The entries (rules) in the tables of each Edge Gateway are configured and consulted remotely from the SDN Controller in the Cloud (i.e., in the Business Solution Layer) by means of the Southbound API. In this sense, this nodes incorporate capabilities similar to those of the COTS forwarding nodes in Figure 2. Each designer can follow the OpenFlow standard or implement his/her own protocol. The architecture is flexible in this sense, in order to be open and dynamic in time.
- -
- Fog Sublayer: This new sublayer is composed of Fog Forwading Gateways. Like the Edge Gateways in the Edge Sublayer, the Fog Forwarding Gateways include packet routing tables that can be configured remotely from the SDN Controller via the Southbound API. However, these types of nodes are not connected directly to a subnet of devices in the IoT Layer and do not need to be able to apply machine learning techniques on the edge. Thus, their function is quite similar to that of the COTS forwarding nodes in Figure 2. Thus, they are not provided with computational or storage resources that would raise the costs of the network infrastructure (whether physical or virtual).
- Business Solution Layer: The Business Solution Layer maintains its name in the new GECA 2.0. However, the Business Solution Layer is now subdivided into two sublayers: the Virtual Network Management Sublayer and the Application Sublayer.
- -
- Virtual Network Management Sublayer: This sublayer forms the control plane of the SDN architecture and it contains the SDN Controller. Also, in this sublayer would be carried out the capabilities of Network Function Virtualization when managing flows. The same SDN Controller carries in its SDN/NFV database the management of virtual flows and the dynamic reconfiguration of remote nodes. In this way, from the first moment of the GECA 2.0 design, it is possible to use SDN and NFV functionalities indistinctly or combined if the network designer requires it. The SDN Controller makes use of the Southbound API to remotely reconfigure both Fog Forwarding Gateways and Edge Gateways in the Fog/Edge Layer. In addition, it offers elements in the Application Sublayer the possibility of accessing the SDN configuration and the data exchange with the IoT-Edge layers via the Northbound API. A proposal for network resource allocation and flow control to be implemented as part of the SDN Controller will be described in the Section 3.3.
- -
- Application Sublayer: This sublayer includes all the functionalities that previously existed in the Business Solution Layer in GECA 1.0. The difference with GECA 1.0 is that now the different user applications and the different management components included in the Business Solution Layer communicate with the network through the Northbound API offered by the SDN Controller. Thus, the application layer corresponds to the application plane (or management plane in other SDN architectures). Also, thanks to the functionalities of the SDN Controller, different tenants or different applications can share the network infrastructure transparently, without seeing the data of other tenants or applications that are in another virtual network sharing the same infrastructure.
3.3. Adaptive Assignment of Network Resources by Means of Deep Q-Learning Techniques
4. Experimentation and Results
5. Conclusions and Future Work
Author Contributions
Funding
Acknowledgments
Conflicts of Interest
References
- Radanliev, P.; De Roure, D.C.; Nicolescu, R.; Huth, M.; Montalvo, R.M.; Cannady, S.; Burnap, P. Future developments in cyber risk assessment for the internet of things. Comput. Ind. 2018, 102, 14–22. [Google Scholar] [CrossRef]
- De la Prieta, F.; Rodríguez-González, S.; Chamoso, P.; Corchado, J.M.; Bajo, J. Survey of agent-based cloud computing applications. Future Gener. Comput. Syst. 2019, 100, 223–236. [Google Scholar] [CrossRef]
- Mohammadi, M.; Al-Fuqaha, A.; Sorour, S.; Guizani, M. Deep learning for IoT big data and streaming analytics: A survey. IEEE Commun. Surv. Tutor. 2018, 20, 2923–2960. [Google Scholar] [CrossRef] [Green Version]
- García, O.; Alonso, R.S.; Prieto, J.; Corchado, J.M. Energy Efficiency in Public Buildings through Context-Aware Social Computing. Sensors 2017, 17, 826. [Google Scholar] [CrossRef]
- Shi, W.; Schahram, D. The promise of Edge Computing. Computer 2016, 49, 78–81. [Google Scholar] [CrossRef]
- Alonso, R.S.; Sittón-Candanedo, I.; García, Ó.; Prieto, J.; Rodríguez-González, S. An intelligent Edge-IoT platform for monitoring livestock and crops in a dairy farming scenario. Ad Hoc Netw. 2020, 98, 102047. [Google Scholar] [CrossRef]
- Granelli, F.; Gebremariam, A.A.; Usman, M.; Cugini, F.; Stamati, V.; Alitska, M.; Chatzimisios, P. Software defined and virtualized wireless access in future wireless networks: Scenarios and standards. IEEE Commun. Mag. 2015, 53, 26–34. [Google Scholar] [CrossRef]
- Puente Fernández, J.A.; García Villalba, L.J.; Kim, T.H. Software Defined Networks in Wireless Sensor Architectures. Entropy 2018, 20, 225. [Google Scholar] [CrossRef] [Green Version]
- Alenezi, M.; Almustafa, K.; Meerja, K.A. Cloud based SDN and NFV architectures for IoT infrastructure. Egypt. Inform. J. 2019, 20, 1–10. [Google Scholar] [CrossRef]
- Leike, J.; Krueger, D.; Everitt, T.; Martic, M.; Maini, V.; Legg, S. Scalable agent alignment via reward modeling: A research direction. arXiv 2018, arXiv:1811.07871. [Google Scholar]
- Sittón-Candanedo, I.; Alonso, R.S.; Corchado, J.M.; Rodríguez-González, S.; Casado-Vara, R. A review of edge computing reference architectures and a new global edge proposal. Future Gener. Comput. Syst. 2019, 99, 278–294. [Google Scholar] [CrossRef]
- Sittón-Candanedo, I.; Alonso, R.S.; García, Ó.; Muñoz, L.; Rodríguez-González, S. Edge computing, iot and social computing in smart energy scenarios. Sensors 2019, 19, 3353. [Google Scholar] [CrossRef] [PubMed] [Green Version]
- Alonso, R.S.; Sittón-Candanedo, I.; Rodríguez-González, S.; García, Ó.; Prieto, J. A Survey on Software-Defined Networks and Edge Computing over IoT. In Highlights of Practical Applications of Survivable Agents and Multi-Agent Systems; The PAAMS Collection; Communications in Computer and Information Science; De La Prieta, F., González-Briones, A., Pawleski, P., Calvaresi, D., Del Val, E., Lopes, F., Julian, V., Osaba, E., Sánchez-Iborra, R., Eds.; Springer: Cham, Switzerland, 2019; pp. 289–301. [Google Scholar] [CrossRef]
- Van Hasselt, H.; Guez, A.; Silver, D. Deep reinforcement learning with double q-learning. In Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, Phoenix, AZ, USA, 12–17 February 2016. [Google Scholar]
- Alonso, R.S.; Tapia, D.I.; Bajo, J.; García, Ó.; de Paz, J.F.; Corchado, J.M. Implementing a hardware-embedded reactive agents platform based on a service-oriented architecture over heterogeneous wireless sensor networks. Ad Hoc Netw. 2013, 11, 151–166. [Google Scholar] [CrossRef] [Green Version]
- Ko, H.; Bae, K.; Marreiros, G.; Kim, H.; Yoe, H.; Ramos, C. A Study on the Key Management Strategy for Wireless Sensor Networks. ADCAIJ Adv. Distrib. Comput. Artif. Intell. J. 2015, 3, 43–53. [Google Scholar] [CrossRef] [Green Version]
- Alonso, R.S.; García, Ó.; Saavedra, A.; Tapia, D.I.; de Paz, J.F.; Corchado, J.M. Heterogeneous wireless sensor networks in a tele-monitoring system for homecare. In Proceedings of the International Work-Conference on Artificial Neural Networks, Limassol, Cyprus, 14–17 September 2009; Springer: Berlin, Germany, 2009; pp. 663–670. [Google Scholar]
- García, Ó.; Alonso, R.S.; Tapia, D.I.; Corchado, J.M. Electrical power consumption monitoring in hotels using the n-core platform. In Proceedings of the 2016 Clemson University Power Systems Conference (PSC), Clemson, CA, USA, 8–11 March 2016; IEEE: Piscataway, NJ, USA, 2016; pp. 1–6. [Google Scholar]
- Ashton, K. That ‘internet of things’ thing. RFID J. 2009, 22, 97–114. [Google Scholar]
- Srinidhi, N.N.; Dilip Kumar, S.M.; Venugopal, K.R. Network optimizations in the Internet of Things: A review. Eng. Sci. Technol. Int. J. 2018, 22, 1–21. [Google Scholar] [CrossRef]
- Kethareswaran, V.; Ram, C.S. An Indian Perspective on the adverse impact of Internet of Things (IoT). ADCAIJ Adv. Distrib. Comput. Artif. Intell. J. 2017, 6, 35–40. [Google Scholar] [CrossRef] [Green Version]
- Chamoso, P.; Prieta, F.D.L. Swarm-Based Smart City Platform: A Traffic Application. ADCAIJ Adv. Distrib. Comput. Artif. Intell. J. 2015, 4, 89–98. [Google Scholar] [CrossRef] [Green Version]
- González-Briones, A.; De La Prieta, F.; Mohamad, M.; Omatu, S.; Corchado, J. Multi-agent systems applications in energy optimization problems: A state-of-the-art review. Energies 2018, 11, 1928. [Google Scholar] [CrossRef] [Green Version]
- Chamoso, P.; González-Briones, A.; Rodríguez, S.; Corchado, J.M. Tendencies of Technologies and Platforms in Smart Cities: A State-of-the-Art Review. Wirel. Commun. Mob. Comput. 2018. [Google Scholar] [CrossRef] [Green Version]
- Sittón-Candanedo, I.; Hernández-Nieves, E.; Rodríguez-González, S.; Santos-Martín, M.T.; González-Briones, A. Machine learning predictive model for industry 4.0. In Proceedings of the International Conference on Knowledge Management in Organizations, Žilina, Slovakia, 6–10 August 2018; Springer: Berlin, Germany, 2018; pp. 501–510. [Google Scholar]
- Liao, Y.; Loures, E.d.F.R.; Deschamps, F. Industrial Internet of Things: A systematic literature review and insights. IEEE Internet Things J. 2018, 5, 4515–4525. [Google Scholar] [CrossRef]
- Trappey, A.J.; Trappey, C.V.; Govindarajan, U.H.; Chuang, A.C.; Sun, J.J. A review of essential standards and patent landscapes for the Internet of Things: A key enabler for Industry 4.0. Adv. Eng. Inform. 2017, 33, 208–229. [Google Scholar] [CrossRef]
- Rodríguez, S.; De Paz, J.F.; Villarrubia, G.; Zato, C.; Bajo, J.; Corchado, J.M. Multi-agent information fusion system to manage data from a WSN in a residential home. Inf. Fusion 2015, 23, 43–57. [Google Scholar] [CrossRef] [Green Version]
- Sittón, I.; Rodríguez, S. Pattern extraction for the design of predictive models in industry 4.0. In Proceedings of the International Conference on Practical Applications of Agents and Multi-Agent Systems, Porto, Portugal, 21–23 June 2017; Springer: Berlin, Germany, 2017; pp. 258–261. [Google Scholar]
- Yu, W.; Liang, F.; He, X.; Hatcher, W.G.; Lu, C.; Lin, J.; Yang, X. A Survey on the Edge Computing for the Internet of Things. IEEE Access 2017, 6, 6900–6919. [Google Scholar] [CrossRef]
- Sanchez-Iborra, R.; Sanchez-Gomez, J.; Skarmeta, A. Evolving IoT networks by the confluence of MEC and LP-WAN paradigms. Future Gener. Comput. Syst. 2018, 88, 199–208. [Google Scholar] [CrossRef]
- Shi, W.; Cao, J.; Zhang, Q.; Li, Y.; Xu, L. Edge Computing: Vision and Challenges. IEEE Internet Things J. 2016, 3, 637–646. [Google Scholar] [CrossRef]
- Sittón-Candanedo, I.; Alonso, R.S.; García, Ó.; Gil, A.B.; Rodríguez-González, S. A Review on Edge Computing in Smart Energy by means of a Systematic Mapping Study. Electronics 2020, 9, 48. [Google Scholar] [CrossRef] [Green Version]
- FAR-EDGE-P. H2020 FAR-EDGE Project; Factory Automation Edge Computing Operating System Reference Implementation (FAR-EDGE): Rome, Italy, 2017. [Google Scholar]
- INTEL-SAP. IoT Joint Reference Architecture from Intel and SAP. Available online: https://www.intel.com/content/dam/www/public/us/en/documents/reference-architectures/sap-iot-reference-architecture.pdf (accessed on 15 May 2020).
- Edge Computing Consortium; Alliance of Industrial Internet. Edge Computing Reference Architecture 2.0; Technical Report; Edge Computing Consortium and Alliance of Industrial Internet: Beijing, China, 2017. [Google Scholar]
- Malik, A.; Roop, P.S.; Allen, N.; Steger, T. Emulation of cyber-physical systems using IEC-61499. IEEE Trans. Ind. Inform. 2018, 14, 380–389. [Google Scholar] [CrossRef]
- Tseng, M.; Canaran, T.E.; Canaran, L. Introduction to Edge Computing in IIoT; Technical Report; Industrial Internet Consortium: Needham, MA, USA, 2018. [Google Scholar]
- Jararweh, Y.; Doulat, A.; AlQudah, O.; Ahmed, E.; Al-Ayyoub, M.; Benkhelifa, E. The future of mobile cloud computing: Integrating cloudlets and mobile edge computing. In Proceedings of the 2016 23rd International Conference on Telecommunications (ICT), Thessaloniki, Greece, 16–18 May 2016; IEEE: Piscataway, NJ, USA, 2016; pp. 1–5. [Google Scholar]
- Caraguay, Á.L.V.; González, P.L.; Tandazo, R.T.; López, L.I.B. SDN/NFV Architecture for IoT Networks. In Proceedings of the 14th International Conference on Web Information Systems and Technologies (WEBIST 2018), Seville, Spain, 18–20 September 2018; pp. 425–429. [Google Scholar]
- Sabella, D.; Vaillant, A.; Kuure, P.; Rauschenbach, U.; Giust, F. Mobile-edge computing architecture: The role of MEC in the Internet of Things. IEEE Consum. Electron. Mag. 2016, 5, 84–91. [Google Scholar] [CrossRef]
- Yang, M.; Li, Y.; Jin, D.; Zeng, L.; Wu, X.; Vasilakos, A.V. Software-Defined and Virtualized Future Mobile and Wireless Networks: A Survey. Mob. Netw. Appl. 2015, 20, 4–18. [Google Scholar] [CrossRef] [Green Version]
- Jammal, M.; Singh, T.; Shami, A.; Asal, R.; Li, Y. Software defined networking: State of the art and research challenges. Comput. Netw. 2014, 72, 74–98. [Google Scholar] [CrossRef] [Green Version]
- Sezer, S.; Scott-Hayward, S.; Chouhan, P.K.; Fraser, B.; Lake, D.; Finnegan, J.; Viljoen, N.; Miller, M.; Rao, N. Are we ready for SDN? Implementation challenges for software-defined networks. IEEE Commun. Mag. 2013, 51, 36–43. [Google Scholar] [CrossRef] [Green Version]
- Kim, H.; Feamster, N. Improving network management with software defined networking. IEEE Commun. Mag. 2013, 51, 114–119. [Google Scholar] [CrossRef]
- Baek, J.Y.; Kaddoum, G.; Garg, S.; Kaur, K.; Gravel, V. Managing fog networks using reinforcement learning based load balancing algorithm. In Proceedings of the 2019 IEEE Wireless Communications and Networking Conference (WCNC), Marrakech, Morocco, 15–18 April 2019; IEEE: Piscataway, NJ, USA, 2019; pp. 1–7. [Google Scholar]
- Sampaio, L.S.; Faustini, P.H.; Silva, A.S.; Granville, L.Z.; Schaeffer-Filho, A. Using NFV and reinforcement learning for anomalies detection and mitigation in SDN. In Proceedings of the 2018 IEEE Symposium on Computers and Communications (ISCC), Natal, Brazil, 25–28 June 2018; IEEE: Piscataway, NJ, USA, 2018; pp. 00432–00437. [Google Scholar]
- Ruiz, L.; Durán, R.J.; De Miguel, I.; Khodashenas, P.S.; Pedreno-Manresa, J.J.; Merayo, N.; Aguado, J.C.; Pavon-Marino, P.; Siddiqui, S.; Mata, J.; et al. A genetic algorithm for VNF provisioning in NFV-Enabled Cloud/MEC RAN architectures. Appl. Sci. 2018, 8, 2614. [Google Scholar] [CrossRef] [Green Version]
- Pei, J.; Hong, P.; Li, D. Virtual network function selection and chaining based on deep learning in SDN and NFV-enabled networks. In Proceedings of the 2018 IEEE International Conference on Communications Workshops (ICC Workshops), Kansas City, MO, USA, 20–24 May 2018; IEEE: Piscataway, NJ, USA, 2018; pp. 1–6. [Google Scholar]
- Hu, F.; Hao, Q.; Bao, K. A Survey on Software-Defined Network and OpenFlow: From Concept to Implementation. IEEE Commun. Surv. Tutor. 2014, 16, 2181–2206. [Google Scholar] [CrossRef]
- Jagadeesan, N.A.; Krishnamachari, B. Software-Defined Networking Paradigms in Wireless Networks: A Survey. ACM Comput. Surv. 2014, 47, 27:1–27:11. [Google Scholar] [CrossRef]
- Mijumbi, R.; Serrat, J.; Gorricho, J.; Bouten, N.; Turck, F.D.; Boutaba, R. Network Function Virtualization: State-of-the-Art and Research Challenges. IEEE Commun. Surv. Tutor. 2016, 18, 236–262. [Google Scholar] [CrossRef] [Green Version]
- Jararweh, Y.; Doulat, A.; Darabseh, A.; Alsmirat, M.; Al-Ayyoub, M.; Benkhelifa, E. SDMEC: Software defined system for mobile edge computing. In Proceedings of the 2016 IEEE International Conference on Cloud Engineering Workshop (IC2EW), Berlin, Germany, 4–8 April 2016; IEEE: Piscataway, NJ, USA, 2016; pp. 88–93. [Google Scholar]
- Salman, O.; Elhajj, I.; Kayssi, A.; Chehab, A. Edge computing enabling the Internet of Things. In Proceedings of the 2015 IEEE 2nd World Forum on Internet of Things (WF-IoT), Milan, Italy, 14–16 December 2015; IEEE: Piscataway, NJ, USA, 2015; pp. 603–608. [Google Scholar]
- Baktir, A.C.; Ozgovde, A.; Ersoy, C. How Can Edge Computing Benefit from Software-Defined Networking: A Survey, Use Cases, and Future Directions. IEEE Commun. Surv. Tutor. 2017, 19, 2359–2391. [Google Scholar] [CrossRef]
- Pang, Z.; Sun, L.; Wang, Z.; Tian, E.; Yang, S. A Survey of Cloudlet Based Mobile Computing. In Proceedings of the 2015 International Conference on Cloud Computing and Big Data (CCBD), Shanghai, China, 4–6 November 2015; pp. 268–275. [Google Scholar] [CrossRef]
- Monfared, S.; Bannazadeh, H.; Leon-Garcia, A. Software defined wireless access for a two-tier cloud system. In Proceedings of the 2015 IFIP/IEEE International Symposium on Integrated Network Management (IM), Ottawa, ON, USA, 11–15 May 2015; pp. 566–571. [Google Scholar] [CrossRef]
- He, Y.; Yu, F.R.; Zhao, N.; Leung, V.C.; Yin, H. Software-defined networks with mobile edge computing and caching for smart cities: A big data deep reinforcement learning approach. IEEE Commun. Mag. 2017, 55, 31–37. [Google Scholar] [CrossRef]
- Sallab, A.E.; Abdou, M.; Perot, E.; Yogamani, S. Deep reinforcement learning framework for autonomous driving. Electron. Imaging 2017, 2017, 70–76. [Google Scholar] [CrossRef] [Green Version]
- Suerich, D.; Young, T. Machine Learning for Optimized Scheduling in Complex Semiconductor Equipment. In Proceedings of the 2019 30th Annual SEMI Advanced Semiconductor Manufacturing Conference (ASMC), Saratoga Springs, NY, USA, 6–9 May 2019; IEEE: Piscataway, NJ, USA, 2019; pp. 1–4. [Google Scholar]
- Deng, Y.; Bao, F.; Kong, Y.; Ren, Z.; Dai, Q. Deep direct reinforcement learning for financial signal representation and trading. IEEE Trans. Neural Netw. Learn. Syst. 2016, 28, 653–664. [Google Scholar] [CrossRef] [PubMed]
- Ma, L.; Zhang, Z.; Ko, B.; Srivatsa, M.; Leung, K.K. Resource management in distributed SDN using reinforcement learning. In Ground/Air Multisensor Interoperability, Integration, and Networking for Persistent ISR IX; International Society for Optics and Photonics: Bellingham, WA, USA, 2018; Volume 10635, p. 106350M. [Google Scholar]
- Amiri, R.; Almasi, M.A.; Andrews, J.G.; Mehrpouyan, H. Reinforcement learning for self organization and power control of two-tier heterogeneous networks. IEEE Trans. Wirel. Commun. 2019, 18, 3933–3947. [Google Scholar] [CrossRef] [Green Version]
- Silver, D.; Huang, A.; Maddison, C.J.; Guez, A.; Sifre, L.; Van Den Driessche, G.; Schrittwieser, J.; Antonoglou, I.; Panneershelvam, V.; Lanctot, M.; et al. Mastering the game of Go with deep neural networks and tree search. Nature 2016, 529, 484. [Google Scholar] [CrossRef] [PubMed]
- Silver, D.; Schrittwieser, J.; Simonyan, K.; Antonoglou, I.; Huang, A.; Guez, A.; Hubert, T.; Baker, L.; Lai, M.; Bolton, A.; et al. Mastering the game of go without human knowledge. Nature 2017, 550, 354–359. [Google Scholar] [CrossRef] [PubMed]
- Zhang, Q.; Lin, M.; Yang, L.T.; Chen, Z.; Khan, S.U.; Li, P. A double deep Q-learning model for energy-efficient edge scheduling. IEEE Trans. Serv. Comput. 2018, 12, 739–749. [Google Scholar] [CrossRef]
- Ferdowsi, A.; Saad, W. Deep learning for signal authentication and security in massive internet-of-things systems. IEEE Trans. Commun. 2018, 67, 1371–1387. [Google Scholar] [CrossRef] [Green Version]
- Watkins, C.J.; Dayan, P. Q-learning. Mach. Learn. 1992, 8, 279–292. [Google Scholar] [CrossRef]
- Arulkumaran, K.; Deisenroth, M.P.; Brundage, M.; Bharath, A.A. Deep reinforcement learning: A brief survey. IEEE Signal Process. Mag. 2017, 34, 26–38. [Google Scholar] [CrossRef] [Green Version]
- Melo, F.S. Convergence of Q-Learning: A Simple Proof; Institute for Systems and Robotics, Instituto Superior Técnico: Lisboa, Portugal, 2001. [Google Scholar]
- Amiri, R.; Mehrpouyan, H.; Fridman, L.; Mallik, R.K.; Nallanathan, A.; Matolak, D. A machine learning approach for power allocation in HetNets considering QoS. In Proceedings of the 2018 IEEE International Conference on Communications (ICC), Kansas City, MO, USA, 20–24 May 2018; IEEE: Piscataway, NJ, USA, 2018; pp. 1–7. [Google Scholar]
- Liu, Y.; Yang, C.; Jiang, L.; Xie, S.; Zhang, Y. Intelligent Edge Computing for IoT-Based Energy Management in Smart Cities. IEEE Netw. 2019, 33, 111–117. [Google Scholar] [CrossRef]
- Kim, J.; Kim, J.; Lee, S.; Park, J.; Hahn, M. Vowel based voice activity detection with LSTM recurrent neural network. In Proceedings of the 8th International Conference on Signal Processing Systems, Urumqi, China, 20–22 July 2019; ACM: New York, NY, USA, 2016; pp. 134–137. [Google Scholar]
- Mu, T.Y.; Al-Fuqaha, A.; Shuaib, K.; Sallabi, F.M.; Qadir, J. SDN flow entry management using reinforcement learning. ACM Trans. Auton. Adapt. Syst. (TAAS) 2018, 13, 1–23. [Google Scholar] [CrossRef] [Green Version]
- De Oliveira, R.L.S.; Schweitzer, C.M.; Shinoda, A.A.; Prete, L.R. Using mininet for emulation and prototyping software-defined networks. In Proceedings of the 2014 IEEE Colombian Conference on Communications and Computing (COLCOM), Bogota, Colombia, 4–6 June 2014; IEEE: Piscataway, NJ, USA, 2014; pp. 1–6. [Google Scholar]
- Tang, J. Intelligent Mobile Projects with TensorFlow: Build 10+ Artificial Intelligence Apps Using TensorFlow Mobile and Lite for IOS, Android, and Raspberry Pi; Packt Publishing Ltd.: Birmingham, UK, 2018. [Google Scholar]
- AlEroud, A.; Alsmadi, I. Identifying cyber-attacks on software defined networks: An inference-based intrusion detection approach. J. Netw. Comput. Appl. 2017, 80, 152–164. [Google Scholar] [CrossRef]
- Casado-Vara, R.; González-Briones, A.; Prieto, J.; Corchado, J.M. Smart Contract for Monitoring and Control of Logistics Activities: Pharmaceutical Utilities Case Study. In Advances in Intelligent Systems and Computing Proceedings of the International Joint Conference SOCO’18-CISIS’18-ICEUTE’18, San Sebastián, Spain, 6–8 June 2018; Graña, M., López-Guede, J.M., Etxaniz, O., Herrero, Á., Sáez, J.A., Quintián, H., Corchado, E., Eds.; Springer: Berlin, Germany, 2019; pp. 509–517. [Google Scholar]
- Hasselt, H.V. Double Q-learning. In Advances in Neural Information Processing Systems 23, Proceedings of the 23rd International Conference on Neural Information Processing Systems, Vancouver, Canada, 6–11 December 2010; Lafferty, J.D., Williams, C.K.I., Shawe-Taylor, J., Zemel, R.S., Culotta, A., Eds.; Neural Information Processing Systems: San Diego, CA, USA, 2010; pp. 2613–2621. [Google Scholar]
- Shoeibi, N.; Shoeibi, N. Future of Smart Parking: Automated Valet Parking Using Deep Q-Learning. In Proceedings of the International Symposium on Distributed Computing and Artificial Intelligence, Avila, Spain, 26–28 June 2019; Springer: Berlin, Germany, 2019; pp. 177–182. [Google Scholar]
- Mnih, V.; Kavukcuoglu, K.; Silver, D.; Rusu, A.A.; Veness, J.; Bellemare, M.G.; Graves, A.; Riedmiller, M.; Fidjeland, A.K.; Ostrovski, G.; et al. Human-level control through deep reinforcement learning. Nature 2015, 518, 529. [Google Scholar] [CrossRef] [PubMed]
Parameter | Description | Test 1 | Test 2 |
---|---|---|---|
Neural network architecture | Neurons per layer | 4:8:4:4:4 | 4:8:8:4:4 |
N | Discount factor Learning rate Max episodes Initial exploration rate Minimum exploration rate | 0.95 0.1 1000 1.00 0.01 | 0.95 0.1 1000 1.00 0.01 |
Goal | Decrease in total response times | 30% | 30% |
Episode in which goal is met | Maximum Average Minimum | 198 183 163 | 142 102 92 |
© 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
Share and Cite
Alonso, R.S.; Sittón-Candanedo, I.; Casado-Vara, R.; Prieto, J.; Corchado, J.M. Deep Reinforcement Learning for the Management of Software-Defined Networks and Network Function Virtualization in an Edge-IoT Architecture. Sustainability 2020, 12, 5706. https://doi.org/10.3390/su12145706
Alonso RS, Sittón-Candanedo I, Casado-Vara R, Prieto J, Corchado JM. Deep Reinforcement Learning for the Management of Software-Defined Networks and Network Function Virtualization in an Edge-IoT Architecture. Sustainability. 2020; 12(14):5706. https://doi.org/10.3390/su12145706
Chicago/Turabian StyleAlonso, Ricardo S., Inés Sittón-Candanedo, Roberto Casado-Vara, Javier Prieto, and Juan M. Corchado. 2020. "Deep Reinforcement Learning for the Management of Software-Defined Networks and Network Function Virtualization in an Edge-IoT Architecture" Sustainability 12, no. 14: 5706. https://doi.org/10.3390/su12145706
APA StyleAlonso, R. S., Sittón-Candanedo, I., Casado-Vara, R., Prieto, J., & Corchado, J. M. (2020). Deep Reinforcement Learning for the Management of Software-Defined Networks and Network Function Virtualization in an Edge-IoT Architecture. Sustainability, 12(14), 5706. https://doi.org/10.3390/su12145706