Migrating from Developing Asynchronous Multi-Threading Programs to Reactive Programs in Java

Zbarcea, Andrei; Tudose, Cătălin

doi:10.3390/app142412062

Open AccessArticle

Migrating from Developing Asynchronous Multi-Threading Programs to Reactive Programs in Java

by

Andrei Zbarcea

¹ and

Cătălin Tudose

^1,2,*

¹

Faculty of Automatic Control and Computers, National University of Science and Technology POLITEHNICA Bucharest, 060042 Bucharest, Romania

²

Luxoft Romania, 060042 Bucharest, Romania

^*

Author to whom correspondence should be addressed.

Appl. Sci. 2024, 14(24), 12062; https://doi.org/10.3390/app142412062

Submission received: 18 November 2024 / Revised: 13 December 2024 / Accepted: 19 December 2024 / Published: 23 December 2024

(This article belongs to the Section Computing and Artificial Intelligence)

Download

Browse Figures

Versions Notes

Abstract

:

Modern software application development imposes standards regarding high performance, scalability, and minimal system latency. Multi-threading asynchronous programming is one of the standard solutions proposed by the industry for achieving such objectives. However, the recent introduction of the reactive programming interface in Java presents a potential alternative approach for addressing such challenges, promising performance improvements while minimizing resource utilization. The research examines the migration process from the asynchronous paradigm to the reactive paradigm, highlighting the implications, benefits, and challenges resulting from this transition. To this end, the architecture, technologies, and design of a support application are presented, outlining the practical aspects of this experimental process while closely monitoring the phased migration. The results are examined in terms of functional equivalence, testing, and comparative analysis of response times, resource utilization, and throughput, as well as the cases where the reactive paradigm proves to be a solution worth considering. Across multiple scenarios, the reactive paradigm demonstrated advantages such as up to 12% reduction in memory usage, 56% faster 90th percentile response times, and a 33% increase in throughput under high-concurrency conditions. However, the results also reveal cases, such as data-intensive scenarios, where asynchronous programming outperforms reactive approaches. Additionally, possible directions for further research and development are presented. This paper not only investigates the design and implementation process but also sets a foundation for future research and innovation in dependable systems, collaborative technologies, sustainable solutions, and distributed system architecture.

Keywords:

Java; asynchronous programming; multi-threading; reactive programming; migration from asynchronous to reactive

1. Introduction

1.1. Research Overview

Web platforms are commonly used to distribute software to users. As the user base grew, applications evolved to improve the user experience. This progress has led to the development of new requirements and standards for high performance, scalability, and rapid response to user inputs [1,2]. Over time, developers have used a variety of strategies and technologies to meet these goals, focusing on different areas of the software development process.

Initially, early solutions aimed to improve current hardware and optimize code for better efficiency. Later on, there was a significant transition to a paradigm that included the use of multi-threading asynchronous programming approaches, which allow the application to handle multiple processes simultaneously to improve performance by parallelizing the program flow.

Within the Java ecosystem [3,4], multiple programming interfaces have been created to support the asynchronous multi-threading paradigm. Some of these include wait/notify mechanisms and the java.util.concurrent library, and abstractions such as CompletableFuture and Executors [5]. These advancements have enabled the development of newer, more efficient, and faster applications. However, they have not eliminated the complexity of thread management and concurrency. This signifies the need for further enhancements in the simplification and optimization of modern application development.

Reactive programming [6] aims to address these challenges by reducing the complexity of concurrency management while enhancing application performance and scalability. This approach has its roots in a paradigm that leverages non-blocking I/O [7] and an event-driven architecture [8].

Major companies like Netflix and PayPal have adopted reactive programming to improve system scalability, reduce latency, and enable real-time data processing. For example, Netflix leverages RxJava to streamline server-side concurrency, significantly improving responsiveness in its streaming services by managing asynchronous processes efficiently. Similarly, PayPal relies on Akka to address scalability and latency challenges, enabling seamless transaction handling and real-time updates to payment statuses, even under high-demand conditions. These implementations demonstrate how reactive programming effectively addresses scalability and performance challenges in large-scale, distributed systems.

The integration of reactive programming interfaces into Java streamlines the development of applications that meet contemporary performance and scalability requirements while minimizing system resource utilization.

1.2. Problem and Approach

Reactive programming has the potential to offer a more elegant and simpler alternative through the event-based and non-blocking I/O model, contrasting with the increased complexity in managing concurrency and system resources of the asynchronous multi-threading approach which, while it has brought significant improvements over previous models and has been the basis for many of the solutions subsequently developed, also has the potential to become the limiting factor in terms of scalability and extensibility. The main issue addressed by this research is the exploration of a migration methodology and feasibility assessment, as well as the quantification of the benefits that could be achieved. The untapped potential of reactive programming and the opportunities to optimize the performance and efficiency of current applications is the central motivation of the research, outlining a promising option to address current challenges.

However, migrating from asynchronous multi-threading programming to reactive programming is not a universal formula for performance enhancement under any conditions and is not without challenges, featuring its particularities, limitations, and trade-offs. While it has been successfully adopted in some areas, such as real-time data stream processing or distributed application development, migrating existing applications requires a detailed evaluation of the architecture, possible infrastructure adjustments, adherence to guidelines and practices to ensure the transition, and an openness to understanding the different paradigm, demanding a shift in perspective from writing imperative code. Analysis of the benefits and limitations of reactive programming, such as identifying the use cases for which it is suitable and the appropriate strategies for dealing with complexity, remains defining and specific to each product.

The research directs its attention on the migration process from the asynchronous approach towards the reactive paradigm, following closely the steps to achieve the transition in a controlled and efficient way based on the purpose-built application. The step-by-step process aims at investigating some aspects of interest, trying to clarify some technical details, but also capturing some consequences of the adoption of this new paradigm, and assessing its effectiveness in enhancing system performance.

1.3. Objectives

The research’s preliminary goal is to develop a core application that uses industry-standard asynchronous multi-threading programming techniques. This will allow the analysis of the strengths and limits of the approach. This initial stage will serve as a foundation for subsequent examination and comparison to the reactive paradigm.

The primary purpose is to migrate the application to the reactive programming approach. This will involve gradually transitioning components to use the reactive interface and concepts, as well as adjusting to complementing technologies like non-blocking web servers and reactive database connections [9]. The process will be systematically detailed, highlighting the adjustments and challenges experienced along the way while also providing solutions, recommendations, and strategies to assess and decide on the feasibility and benefits of a smooth and efficient transition toward this paradigm.

The study will conclude with a comparative performance analysis between the applications using both paradigms, including metrics such as memory usage, CPU consumption, response times, and the number of execution threads employed in different scenarios. With this analysis, the research aims to provide a deeper and more nuanced understanding of the benefits and limitations related to reactive programming in contrast to the conventional asynchronous approach.

Achieving these objectives will make this research a considerable and useful reference for properly implementing the migration process and evaluating its expected impact, thereby constructively contributing to both the theoretical and practical components of software development.

2. Theoretical Background

To examine the background and motivation of the research, as well as the challenges involved in the migration process, fundamental concepts will be reviewed in this chapter.

2.1. Concurrency

The capability of a system to handle multiple tasks at the same time is known as concurrency. Concurrency in Java [5,10] may be achieved through the usage of threads. The program can perform multiple tasks simultaneously using threads, which are self-contained subunits that can run in parallel. Concurrency is an extensively explored principle for optimizing resource utilization and increasing system efficiency. By using multiple threads to parallelize the execution, response times can be reduced, and a better distribution of workload can be achieved.

Race conditions and deadlocks can occur as a result of complex and error-prone concurrency management in multi-threading environments [11].

A race condition happens when the state of a resource depends on the sequence or timing of uncontrollable events, producing inconsistent or unexpected results. The execution of each thread may take a different amount of time than expected, they can finish in a different order than expected, generating unanticipated behavior.

A deadlock occurs when two or more threads are waiting for resources that are blocked by each other, leading to a state of complete standstill, which has the potential to severely compromise application stability and performance. There are different strategies available to prevent, detect, or resolve these problems, such as using advanced resource access scheduling algorithms or sophisticated deadlock avoidance mechanisms [12].

To streamline concurrency management as well as improve performance and scalability, alternative concurrency models have emerged, such as event-driven and reactive programming.

2.2. Asynchronous Programming

In asynchronous programming, by allowing multiple processes to be executed simultaneously, the overall performance and scalability of the application can be improved significantly. In traditional synchronous programming, there is a linear flow of execution, as a task can only be performed after the previous one is completed. There are various interfaces provided by the Java ecosystem that can be used to develop asynchronous programs: Futures and Executors [5]. When the result of an asynchronous operation becomes available, it can be accessed in a non-blocking way using a Future. A pool of threads is managed by Executors to perform tasks asynchronously while also allowing for more efficient usage of system resources, as the threads managed through these implementations can be reused instead of being created and deleted for each task. Where tasks are I/O-bound or have high latency, the employment of asynchronous programming can significantly increase application performance and scalability. However, it can also present some issues, such as increased complexity in controlling the execution flow and handling concurrency errors [12].

2.3. Non-Blocking I/O

The concept of non-blocking I/O refers to the ability of an application thread to perform other tasks instead of waiting for a response to a previously initiated request [13]. This allows for more efficient utilization of system resources, which can lead to improved performance and scalability. The NIO library, which provides classes such as Selector and Channel for handling non-blocking I/O operations, is mainly used in Java for interfacing with these scenarios [7]. In addition, non-blocking I/O operations can be handled using the asynchronous paradigm, to further increase efficiency, employing methods such as event-based programming or the use of Futures and Executors.

2.4. Event-Driven Architecture

An event-driven architecture is a software design pattern in which messages or events produced by external sources control the flow of the program [14]. In this architecture, the system consists of small, self-contained components that respond to events and connect via a centralized event bus. Since components can be added, removed, or updated swiftly without affecting the core of the program, this method allows for a more flexible and scalable solution [15]. Microservices, real-time applications, and distributed systems frequently make use of the event-driven design [16]. When developing functionalities for a performance-critical program, event-driven architecture is an option worth considering, allowing for asynchronous and non-blocking data flow which leads to improved efficiency and scalability [17].

2.5. Reactive Programming

Reactive programming is a paradigm focused on data flows and propagation of changes. It is based on the Observer pattern [18], under which an entity, known as the subject, keeps track of a list of observers, and automatically notifies them about any change in the state of the application. As a result, data flow processing becomes more efficient and flexible, enabling non-blocking and asynchronous data handling.

Reactive streams are a specification that defines a set of interfaces and methods for asynchronous processing with non-blocking backpressure. The mechanisms employed to take advantage of the programming concepts of reactive streams are implemented within various libraries such as Project Reactor or RxJava [5,6]. These libraries provide a broad set of high-level operations, including filtering, mapping, and reducing, which are employed for dealing with reactive data streams efficiently. These powerful abstractions in Java applications can significantly streamline the development process, improving both performance and scalability.

From straightforward data processing flows to complex event-driven systems, reactive programming has a wide range of applications. This is going to be the main focus of the research, particularly looking at the benefits and difficulties of transitioning from the conventional solution, the asynchronous multi-threading programming, towards a reactive programming approach.

3. Related Work

In recent years, the evolution of asynchronous and reactive programming has significantly impacted the software development landscape across various programming environments. With growing demands for performance, scalability, and responsiveness in high-concurrency scenarios, many developers are transitioning from traditional asynchronous programming models to fully reactive architectures. Existing studies have explored the benefits and challenges associated with this transition, highlighting performance improvements, optimized resource management, and additional complexities. This section reviews key contributions that examine both the development and application of these paradigms, focusing on the frameworks and strategies that support this migration.

3.1. Evaluating Hibernate Reactive for Scalable Database Solutions

Grinovero, in the study conducted by the Hibernate Team, evaluates the use of Hibernate Reactive for asynchronous, non-blocking database access in Java applications. Hibernate Reactive builds on reactive programming principles to enhance scalability, allowing for higher levels of concurrency with reduced latency and optimized resource usage. In performance benchmarks, Hibernate Reactive maintained response times under 10 ms at loads of 35,000 requests per second, whereas its synchronous counterpart, Hibernate ORM, only achieved similar performance up to around 20,000 requests per second. These findings highlight Hibernate Reactive’s advantages in handling high-load scenarios, with fewer threads required to achieve these results [19].

Additionally, the study points out specific cases where Hibernate Reactive proves beneficial, particularly in applications where scalability and resource efficiency are priorities. However, certain limitations are noted, such as potential overhead in complex transactions and multi-join operations, which could impact performance. For example, encapsulating units of work within explicit ACID transactional boundaries can tie up physical connections, limiting the benefits of reactive multiplexing. This is particularly evident in data-intensive environments where relational complexity introduces latency that offsets the reactive gains. Similarly, multi-join queries, which are common in analytics or relationally complex systems, can impose significant performance costs, making Hibernate Reactive less efficient in such scenarios [19].

As a result, the study suggests that while Hibernate Reactive offers significant value within a reactive architecture, a careful assessment of its applicability is essential, especially in environments with complex data requirements. In particular, applications that prioritize high-throughput, non-blocking operations stand to gain the most from Hibernate Reactive, whereas systems with heavy transactional demands or complex relational models may find traditional synchronous approaches more suitable. This highlights the need for a case-by-case evaluation to determine the feasibility of adopting Hibernate Reactive [19].

3.2. Assessing R2DBC in High-Concurrency Web Applications

Ju et al. [20] explored the impact of asynchronous frameworks and database connection pools on web application performance under high-concurrency conditions. The study utilized a configuration combining Spring WebFlux 3.3.2, R2DBC 3.3.2, and database connection pools to handle simultaneous requests in a non-blocking manner. Tests conducted with varying loads showed that this configuration maintained an average response time of 7 ms at 500 requests, compared to 8 ms in traditional JDBC. For 50,000 requests, the asynchronous setup with a connection pool achieved 556 ms, significantly outperforming the 723 ms without a pool. Additionally, the error rate in the connection pool configuration remained at 0% for up to 150,000 requests, in contrast to the higher error rates observed in synchronous models and asynchronous models without a connection pool, which exceeded 80% under similar loads.

The research emphasizes that the connection pool is crucial for realizing the full potential of asynchronous models in high-concurrency environments. By reusing connections, the connection pool minimizes the overhead of repeatedly establishing and destroying connections, thereby enhancing both efficiency and stability. For example, at 150,000 requests, the configuration with a connection pool achieved a throughput of 4590 requests per second, compared to 2533 requests per second for the asynchronous model without a pool and 3586 for the synchronous setup [20].

However, the study also notes that configuring the connection pool requires careful tuning, such as adjusting the maximum connection count to match workload demands. Inadequate configuration may lead to resource bottlenecks, limiting performance gains. This suggests that while the asynchronous model with connection pooling offers clear advantages, its efficacy depends heavily on proper implementation and testing [20].

The study demonstrates the advantages of the asynchronous model in high-load scenarios, offering up to 20% higher throughput and greater stability than synchronous approaches, which became ineffective under increased request loads. The results suggest that utilizing a database connection pool is essential for maximizing efficiency in database access for high-concurrency applications, highlighting the importance of non-blocking, resource-efficient setups in handling concurrent requests with minimal delays [20].

Dahlin [21] examined the performance of R2DBC within Spring WebFlux, comparing it with a traditional Spring MVC setup using JDBC, particularly focusing on database communication efficiency in high-concurrency environments. His study revealed that R2DBC, integrated with WebFlux, reduced CPU and memory usage compared to JDBC in most scenarios, particularly in non-blocking environments. During the tests with 200,000 insertions, R2DBC maintained lower CPU usage, peaking at 10%, whereas JDBC experienced notable delays and risked crashes when memory constraints were applied. R2DBC also achieved faster response times in handling large data sets, which could be interpreted as an indicator for potential improvements in high-concurrency applications.

Dahlin [21] further identified limitations, such as challenges in handling large BLOB data with R2DBC, as it required loading entire BLOBs into memory. This behavior implies that R2DBC may struggle in applications where large binary objects are frequently processed, such as media storage or analytics-heavy platforms. While this limitation highlights that JDBC may remain advantageous, it also depicts the importance of aligning the database solution with the requirements of the application.

Additionally, the study emphasizes that R2DBC offers a more efficient solution “out-of-the-box” for standard data transactions, requiring minimal configuration adjustments, in contrast to JDBC, which may demand batch or fetch size adjustments for optimal performance [21]. However, systems with heavy reliance on large data fields or advanced optimization techniques may still benefit more from JDBC, which allows for more flexibility through custom configurations, or further investigation would be required to achieve similar results when employing the R2DBC counterpart solution.

3.3. Benchmarking Virtual Threads and Reactive WebFlux for Concurrent Web Services

Joo and Haneklint [22] analyzed the performance differences between Virtual Threads and Reactive WebFlux in Spring applications, aiming to determine the advantages of each solution under high-demand conditions. The study involved the development of three Spring application prototypes: one using normal threads, one with virtual threads, and another based on the Reactive WebFlux model. The prototypes were tested by simulating interactions between an aggregation endpoint and two underlying services, each configured with fixed delays of 100 ms and 500 ms.

The results indicate that the virtual thread prototype demonstrated slightly better performance than the reactive prototype in most tests, maintaining low latency and handling a high rate of requests per second even under heavy load. For example, with a 100 ms delay, the virtual thread prototype handled more requests per second compared to the Reactive WebFlux model. However, the study notes that these tests were conducted on Tomcat, a blocking server, which may not fully leverage the non-blocking nature of Reactive WebFlux. This suggests that Reactive WebFlux might yield different results on a non-blocking web server like Netty, potentially enhancing its performance [22].

Additionally, while virtual threads displayed promising results, their integration into existing systems requires careful consideration of compatibility and potential debugging complexities. The study concludes that virtual threads present a viable alternative for high-concurrency applications in Spring, but further testing in production environments is necessary to confirm their advantages over reactive solutions, underscoring the importance of selecting a concurrency model based on the specific demands of the application and the characteristics of the deployment environment [22].

3.4. Reactive and Imperative Approaches in Microservice Performance

Mochniej and Badurowicz [23] examined the performance of microservices developed using reactive and imperative approaches by implementing two Java-based microservices with the Spring framework. In their study, they conducted performance tests for operations such as data retrieval and insertion, processing, and file transfer, comparing reactive and imperative microservices under varying load scenarios (100 and 3000 simultaneous users with multiple service instances). Their findings indicate that reactive applications performed better in cases with delays in communication with databases or other services, reducing response time and RAM usage by up to 36% in certain scenarios. However, for CPU-intensive tasks, reactive applications proved slower by up to 46%, suggesting additional complexity in handling reactive streams compared to imperative code.

Additionally, the study highlights the advantages of reactive applications in managing inter-service communication and optimizing hardware resource usage. Reactive applications required fewer threads due to the event loop model, thereby reducing RAM demands in scenarios such as order processing or barcode generation, with memory usage up to 57% lower. Although reactive applications showed advantages in I/O-intensive scenarios, their limitations for CPU-intensive processing emphasize the importance of assessing the context of use before implementing a reactive system, ensuring that the selected model aligns with the system’s performance requirements [23].

The study suggests that the reactive approach introduces complexity in code readability and maintainability, which can impact development time and debugging processes. This complexity may arise from the asynchronous, declarative nature of reactive programming, potentially leading to challenges such as difficulty in tracing the flow of data or managing complex operator chains. Therefore, while the reactive model offers performance benefits in specific scenarios, developers must consider the extent of improvements against the possibility of increased development complexity, at least in the short term, and ensure that the team’s expertise aligns with the selected approach [23].

3.5. Actor-Oriented Databases for Scalable and Reactive IoT Data Management

Wang’s research introduces Actor-Oriented Databases (AODBs) to enhance scalable and reactive data management in IoT, specifically addressing the challenges of high concurrency and dynamic data handling. Dolphin, a prototype M-AODB, leverages actors as modular, stateful entities that communicate asynchronously, suitable for managing IoT entities in mobile contexts. This system was tested with different spatial and reactive settings, achieving 3349 moves per second under Actor-Based Freshness semantics, with a 50% latency of 6.26 ms, and 5211 moves per second under Snap(1s), which maintained a 50% latency of 0.59 ms [24]. By utilizing actor isolation and spatial partitioning, Dolphin adapts to high-demand IoT environments, supporting low-latency responses even under variable client loads.

To handle the demands of reactive IoT applications, Dolphin employs a “moving actor” abstraction and spatial partitioning techniques to ensure efficient data distribution and balanced workload management. Testing results indicate that Dolphin’s Actor-Based Snapshot semantics can handle high spatial skew, with a throughput increase of up to 1.52× when batching reactions, maintaining stability across different server configurations. This approach allowed Dolphin to scale effectively in IoT scenarios like vehicle tracking, where reactivity and resource optimization are crucial [24].

However, the study also identifies certain limitations. For instance, the effectiveness of spatial partitioning techniques can vary depending on the specific workload characteristics, and in some cases, may lead to uneven load distribution among servers. Additionally, while the “moving actor” abstraction enhances reactivity, it may introduce complexity in design and maintenance, particularly in large-scale deployments. While Dolphin demonstrates significant potential, careful consideration of these factors is important when applying AODBs in diverse IoT environments [24].

3.6. Temporal and Type-Driven Approaches to Asynchronous Reactive Programming

Bansal, Namjoshi, and Sa’ar tackle the challenge of constructing asynchronous programs directly from temporal specifications, proposing an approach to simplify and make asynchronous synthesis practically feasible [25]. Traditional methods for synthesizing asynchronous programs from temporal logic were highly complex and prone to exponential growth in state space, making them challenging to implement. This work introduces a novel, compact automaton construction, which avoids the exponential state blowup common in previous approaches and reduces asynchronous synthesis to synchronous synthesis. The resulting automaton has at most twice the states of the input specification, significantly improving efficiency. Furthermore, for specific temporal properties, the approach replaces automaton construction with Boolean constraint solving, further simplifying the synthesis process [25].

The study also demonstrates the practicality of their approach through experiments with a prototype tool, BAS, which incorporates their construction method along with existing synthesis solvers. BAS efficiently handles complex specifications that previously required impractical computation times. For example, it synthesizes an asynchronous arbiter specification within seconds, while previous methods took over eight hours. These advancements make asynchronous synthesis a feasible alternative for constructing reactive systems, broadening the applicability of automated synthesis in real-world asynchronous and distributed system design [25].

Bahr, Houlborg, and Rørdam present Async Rattus, a functional reactive programming (FRP) language embedded in Haskell, designed to handle asynchronous computations in a type-safe and efficient manner using modal types [26]. Async Rattus extends the Rattus FRP language by introducing dynamic, local clocks for asynchronous subsystems, allowing components to operate independently rather than synchronously under a global clock. This approach reduces inefficiencies, particularly in applications where components react to distinct events, such as graphical user interfaces. Async Rattus employs two modal types to track delayed and stable computations, preventing space leaks and ensuring causality. Its Haskell compiler plugin automates type-checking for clock dependencies while enabling developers to leverage Haskell’s extensive ecosystem.

The effectiveness of Async Rattus is demonstrated through various examples, including an interactive console application that dynamically updates values based on user input without relying on synchronous timing. The prototype shows that Async Rattus maintains low memory overhead and effectively mitigates space leaks common in asynchronous FRP, supporting more complex, concurrent applications within Haskell’s functional paradigm [26]. The implementation highlights Async Rattus’s potential for broader adoption in real-world scenarios that demand scalable, event-driven processing with minimal manual configuration.

However, both studies acknowledge certain limitations. Bansal et al. note that while their method significantly reduces complexity, it may not be universally applicable to all classes of temporal properties, requiring further research to extend its general adoption. Similarly, Bahr et al. recognize that the introduction of dynamic, local clocks in Async Rattus, while beneficial for flexibility, can introduce additional complexity in the reasoning about time-dependent behaviors, potentially complicating the development process. These approaches offer promising advancements in asynchronous reactive programming but careful consideration of their applicability and potential trade-offs is essential in practical implementations [25,26].

While valuable insights into the performance and viability of reactive solutions have been provided by these studies, much of the existing research focuses on evaluating specific frameworks or comparing asynchronous and reactive models in isolated contexts. In contrast, this work captures the step-by-step migration process, addressing challenges encountered during the transition from an existing asynchronous architecture to a reactive one. By emphasizing practical considerations and migration strategies, this study aims to bridge a gap in current research, offering guidance on adapting an established codebase to a reactive paradigm while preserving core functionalities. An overview summarizing the key contributions and findings discussed in this section is available in Table 1.

4. Current State of Technology

Reactive programming and applications consist of several main concepts and tangential aspects. The adoption of microservices-based architectures is explored, introducing the popular Spring framework [27] and its reactive version Spring WebFlux [28], analyzing the performance and costs of using reactive libraries, and looking at real-world examples of companies that have successfully adopted reactive programming in their projects. This analysis is primarily intended to provide a high-level understanding of the current state of reactive programming and its applications in practice.

4.1. Microservices

Microservices are a software architecture design approach that structures an application as a suite of loosely coupled services [29]. Each service operates in its own process and communicates with other services through straightforward mechanisms, typically through programmable interfaces (APIs). The main goal of microservices is to provide modular, scalable, and maintainable solutions for complex applications. The increasing adoption of microservices in modern applications is driven by the demand for scalable, flexible, and agile software development.

Generally, companies are leveraging established technologies to implement, interconnect, and deploy services. Communication technologies and standards such as RESTful HTTP [30,31,32] and containerization tools like Docker [33] are valued for their commanding interoperability and portability, which facilitate the decoupling and variation within each system. Java emerges as a preferred solution due to the large availability of developers proficient in this programming language. Additionally, systems developed for external clients tend to feature less decentralization and fewer product-specific particularities compared to solutions designed specifically for internal usage [29].

Regarding the impact of microservices on the overall quality of programs, it is prevalently assessed as positive or neutral [29]. Furthermore, maintainability is also being regarded widely as positive, although the transition from monolithic solutions can be seen as problematic security-wise due to the rising requirement of implementing compliant mechanisms across each module, while also continuously monitoring and updating each component to address vulnerabilities when they are discovered, and last but not least, to provide secure ways of communication between the services [34,35].

4.2. The Reactive Manifesto

The Reactive Manifesto [36] is a set of principles and recommendations for the development of reactive systems. It was developed in 2014 by a group of software development specialists to provide a clear and concise overview of the core design aspects that are addressed by reactive systems. The manifesto outlines four main characteristics of reactive systems: responsiveness, resilience, elasticity, and message-driven operation.

Responsiveness refers to the system’s ability to handle user requests in a timely and efficient manner, even under heavy load conditions. Resilience refers to the ability of the system to quickly recover from problems and continue to function properly, even in the face of unpredictable situations. Elasticity refers to the ability of the system to dynamically adjust its resources as needed to accommodate changes in demand. Message-driven functionality refers to the ability of the system to use message transmission as the primary method of communication between its components, ensuring loose coupling and flexibility [36].

The Reactive Manifesto has been widely adopted by the community, and it has become a foundation for developing reactive systems [37]. Since its initial version, it has been constantly refined and enhanced as the field of reactive programming has matured and evolved, with new tools and technologies becoming available. The principles outlined in the manifesto are now being recognized and integrated into the development process as best practices for building scalable and resilient systems [38].

4.3. Spring Boot and Spring WebFlux

Spring Boot [39] is a popular open-source framework for developing Java applications. It offers a variety of features that streamline the development process, including easy and automatic configuration with minimal requirements of programmatic adjustments, making it a popular choice for developers, especially for fast-paced environments, given its focus on simplicity and the availability of extensive functionalities and a wide array of plugins which ultimately enable the rapid creation and deployment of robust solutions.

WebFlux [40] is a reactive web framework that became available with the release of Spring 5, specifically designed for building efficient, scalable, and non-blocking web applications. WebFlux provides the required foundation and tools for developing high-performance applications that demand improved concurrency management and robust data flow processing. Compared to conventional blocking I/O, the reactive programming model enables improved utilization of system resources as well as significantly improving overall performance, particularly expected within applications designed to enable increased user concurrency and demanding workloads.

In addition to reactive programming features, WebFlux also features integration and support for functional programming and is based on an event-driven architecture [41,42]. This allows for the development of highly responsive applications that can handle large numbers of requests and data flows.

Spring Boot and WebFlux provide a solid combination for developing modern Java web applications, ensuring a simple and efficient way to build, test, and deploy applications with minimal effort [40].

4.4. Performance and Cost Analysis

When developing reactive applications, one important aspect is the appropriate selection of tools for the application’s specific requirements. RxJava and Project Reactor are two of the most popular libraries currently available, offering a considerable number of built-in features that allow for improved concurrency, scalability, and performance [5]. Some benchmarks have been undertaken and will be analyzed further to provide relevant information on their performance and costs.

In terms of individual reactive operations, RxJava outperformed Project Reactor, proving to be the superior choice for reactive processing in the conducted benchmarks, which covered scenarios such as transforming and selecting values, chaining individual operations, and applying multiple composed operations [43]. This is important because asynchronous individual operations are frequently used in situations such as database insert requests or message acknowledgment. The results are reported in Table 2.

In terms of event stream processing, Project Reactor performed better than RxJava, displaying an overall improvement in processing time and the ability to handle a larger number of events. This is equally important, as event streams are widely used in modern applications, especially in real-time data processing platforms such as streaming services or IoT devices. Results are available in Table 3.

For I/O-bound operations, both RxJava and Project Reactor performed similarly in terms of processing time. However, Project Reactor had a slight advantage in terms of resource consumption, utilizing less memory and CPU compared to RxJava. The efficient management of I/O-bound operations is also a relevant topic, as these operations are frequently encountered for accessing data from external resources such as databases, file systems, or network services.

The results of these benchmarks indicate that there is no universally optimal solution for all programming tasks, with both RxJava and Project Reactor demonstrating effectiveness in different scenarios. In scenarios involving individual operations, RxJava proved to be superior, whereas Project Reactor is better suited for managing operations with event streams. So, each framework has its own unique strengths, making them both valuable depending on the application’s specific requirements. For I/O-bound operations, both libraries performed similarly, with Project Reactor showing a slight advantage in terms of resource consumption. When deciding on migrating to reactive programming, it is important to consider the specific requirements and constraints of the application and choose the library that best meets them. The cost of migration can be high, but the long-term benefits of improved scalability, resilience, and resource efficiency can compensate for these costs in many situations, such as real-time data processing or handling large amounts of data. Ultimately, the decision of whether to migrate should be based on a detailed analysis of the performance and cost benefits that can be achieved for each business use case.

4.5. Adoption of Reactive Programming

The reactive paradigm’s ability to efficiently process large amounts of data while maintaining an optimal usage of resources has made it more popular within the enterprise world as a considerable choice in critical components within the systems, where high performance and scalability are the key aspects. Reactive programming is often leveraged by companies to perform only partial migrations rather than transitioning the whole system, allowing for improved performance of critical parts without changing the rest of the application.

Netflix is a company that provides streaming services featuring movies, series, and other online content, allowing its users to watch their favorite content anytime, anywhere, from multiple devices, while also being one of the largest contributors to the Java ecosystem, developing numerous tools and frameworks for the community. On the reactive programming side, Netflix is contributing towards the development of the open-source RxJava (Reactive Extensions) project, leveraging the reactive programming model for superior server-side concurrency and successfully reducing network interference. Netflix’s implementation relies heavily on Observables, and the service layer is asynchronously structured around it. In addition, the service layer implementation leverages a functional programming methodology, which is compatible with the reactive paradigm, as a way of writing maintainable code that is capable of handling concurrency, employing multiple threads, leveraging non-blocking I/O, or relying on caching without changing the way client code communicates with or structures responses. Furthermore, the reactive programming model provides a series of operators for filtering, selecting, converting, combining, and composing observables quickly, which streamlines the management of interconnected asynchronous processes, features that Netflix services strongly benefit from [44].

Oracle, a company specializing in database software and technology, has significantly contributed to the advancement of reactive programming within the enterprise sector. In 2017, Oracle initiated the development of the Asynchronous Database Access API (ADBA), aiming to establish a standardized, non-blocking framework for Java applications interfacing with relational databases. However, by September 2019, Oracle ceased ADBA development, opting to focus on Project Loom, which introduces lightweight, user-mode threads known as fibers. This decision was based on the premise that fibers could simplify concurrent programming by allowing developers to write straightforward, sequential code without sacrificing scalability. In parallel, Oracle has embraced the Reactive Relational Database Connectivity (R2DBC) initiative, releasing the Oracle R2DBC driver in March 2021, facilitating non-blocking, reactive database operations, enabling seamless integration with reactive frameworks like Project Reactor and RxJava [45]. Oracle underscores its commitment to providing developers with robust tools for building efficient, scalable, and responsive applications by aligning with R2DBC and investing in Project Loom, considerable contributions to the reactive ecosystem.

PayPal is one of the leading technology companies that owns a global payments platform. In the context of PayPal, this enables a seamless user experience and the ability to seamlessly process transactions and manage real-time updates and changes to payment status, account balances, and other important information while responding to these changes promptly. PayPal has embraced reactive programming by relying on Akka as a framework to develop its services. It addresses issues such as scalability, latency, and resiliency resulting from the presence of a large number of low-throughput application components. Since Akka proposes a functional programming model, the code becomes easier to comprehend and test, allowing for accelerated development and appropriate error handling. PayPal has also chosen to contribute and make use of the concept of “squbs”, a stack that simplifies the creation and management of loosely coupled components. Through this approach, manageability is improved, ensuring symmetry and loose coupling between services [46].

Cloud Foundry is a Platform as a Service (PaaS) that experimented with reactive programming to provide an efficient and scalable solution for building and deploying cloud-native applications. Reactive programming is a paradigm that focuses on data flow, as well as efficient manipulation of that data, using programming models that are non-blocking and event driven. The implementation of the reactive components in Java attempted by Cloud Foundry is based on Project Reactor, the standard implementation assumed by the Spring development team. Their platform leans towards reactive programming for robust management of the request, and data flows between different application components, such as the frontend and backend, therefore going a long way towards ensuring that the system remains responsive and scalable even when faced with high volumes of data and traffic. At the same time, the adoption of reactive programming in Cloud Foundry may also help in reducing latency and downtime by enabling real-time processing of data and events [47].

Moreover, other important companies have recognized the value of reactive programming and are accelerating its development. Facebook, Alibaba, and other companies decided to collaborate and organize a community under the Linux Foundation, focusing on the advancement of reactive programming and technologies such as RSocket, designed to ensure efficient, resilient communication between microservices, ensuring robust and scalable application performance across diverse environments and devices, showcasing the growing commitment and highlighting its potential to improve modern software development industry wide.

A summary of the main aspects analyzed in this section is provided in Table 4.

5. Proposed Solution

5.1. Architecture

The project is based on a microservices architecture that simulates the minimal backend of a social media platform, designed to demonstrate multiple scenarios and allow for a more detailed comparison between the asynchronous and reactive programming paradigms. The architecture leverages multiple core services, each with distinct functions within the application ecosystem. Figure 1 provides a detailed representation of the employed architecture, highlighting both the individual components and the established interconnections.

The Gateway Service represents the application’s entry point, allowing incoming requests to be routed to the appropriate services. It is also responsible for the validation process within the mechanism used for authentication and authorization, based on the JWT token standard [48,49]. This ensures a centralized authentication process, removing validation pressure from downstream services by simplifying their involvement to only perform the extraction and post-processing steps of the required token data.

The User Service is responsible for managing all aspects related to the user, including the initial login process, issuing the JWT tokens, user registration, the handling of profile information, and any profile updates. Although not as diverse in terms of dedicated use cases, it is an integrated part of operations delegated to other services, which must retrieve additional user information to perform the computations and fulfill the requests.

The Tweet Service manages message-related operations, such as the creation, modification, deletion, and retrieval of user messages. It is also responsible for handling the tokenization and extraction of mentions and tags, as well as generating the user content, by aggregating all the necessary data and computing it into operable results.

The Interaction Service enables interactions within the social media platform between users, allowing the management of subscriptions, acknowledgments, shares, and replies, therefore adding a degree of dynamism and user engagement.

This architecture adheres to the “database-per-service” microservices pattern [50], ensuring the independence and scalability of individual components. Each service leverages its dedicated instance of a relational database while also incorporating a caching mechanism for guaranteed performance by minimizing database read operations and inter-service exchange of recently computed information.

5.2. Tools and Technologies

The development process for the asynchronous and reactive projects employed a series of diverse technologies that fulfill different functions within the context of each paradigm. The primary goal is to highlight the strategic selection and effective usage of two distinct sets of technologies within each model while also accounting for the adoption of these tools within applications already available on the market:

Java and Spring Boot Versions

Java 21 and Spring Boot 3 were utilized for the development of the asynchronous and reactive applications, ensuring stability and compatibility with mainstream technologies and providing robust support for development and concurrency. These are the latest long-term support versions, which should soon be considered the standard for the current application development and are already a requirement for most of the tools and frameworks actively developed in today’s technology market [13].

Asynchronous Operations

CompletableFuture was introduced in Java 8 and provides a robust interface for asynchronous code development, allowing the chaining of multiple steps of the computation process and abstracting multiple aspects of thread management. Compared to its predecessor, the Future interface, it offers a more fluid, non-blocking design and supports other ecosystem advances such as lambda expressions [6].
ExecutorService allows for granular control over concurrency and asynchronous task execution. It allows the allocation, configuration, and utilization of dedicated thread pools, which can improve application performance in critical scenarios.
Mono and Flux are dedicated types that support the reactive and functional approach, as well as straightforward handling of data streams. The interfaces are based on the Project Reactor library [5], leveraged to support the development of the WebFlux component in the Spring ecosystem [13].
Schedulers are the primary mechanism enabling concurrency management in the reactive model, allowing for enhanced flexibility and granular control through the selection of execution contexts and thread pools such as boundedElastic, parallel, immediate, and single, the appropriate selection is imperative for maintaining low resource consumption and high performance.

HTTP Clients

Java HttpClient was utilized in the asynchronous application for delivering a rich interface, abstracting, and enabling communication with external services based on HTTP requests [51]. It provides direct configuration and integration with ExecutorService, which allows asynchronous processing of the response.
WebClient was utilized in the reactive application due to being part of the WebFlux module, providing a non-blocking implementation, advanced features and support for reactive data flows [28]. The interface is more lightweight and streamlined compared to other implementations given its tight integration within the Spring ecosystem.

API Gateway

Zuul v1 is a completely integrated extension within the Java ecosystem, developed by Netflix, picked in the asynchronous model to perform dynamic routing. One main issue with Zuul v1 is its blocking nature, prompting the development of a new variant: Zuul v2, which improves scalability and performance through non-blocking request handling [52].
Spring Cloud Gateway is the counterpart developed by the Spring team, offering advanced routing capabilities, improved performance, and an extended set of functionalities within the reactive ecosystem [53].

Database Management

PostgreSQL was utilized as the relational database in both models, extending conventional SQL with object-oriented capabilities and advanced functionality for data storage and manipulation. It supports complex data types including JSON, arrays, and key–value pairs, while also allowing the definition of custom types and functions [54].
Hibernate/JPA is applied in the asynchronous model, providing a conventional blocking approach for data access and manipulation, object-relational mapping, and database schema generation, which allows for emphasis on business logic [55,56].
R2DBC provides streamlined connectivity and non-blocking interactions with the database, representing a specification that provides a reactive driver for PostgreSQL, integrates, and is even preferred in the development of fully reactive applications [9].

Data Caching

Redis is used across both models for its high-performance caching capabilities, significantly reducing response times and load on databases and application components. Redis supports both blocking and non-blocking operations, making it a versatile choice for both systems [57].

Framework Architecture

In addition to the specific technologies, another important aspect is the significant architectural difference residing right at the framework level between the asynchronous and reactive approaches, specifically the Spring MVC and Spring WebFlux variants.

Spring MVC is the conventional variant of the framework, based on an imperative programming model. It leverages blocking I/O and utilizes a thread for each request [27]. Although this model is simple to understand, implement, and debug, it can quickly become inefficient when facing a large number of concurrent requests, due to the blocking resources and the complexity associated with managing the lifecycle of threads.

Spring WebFlux, on the other hand, embraces the reactive model, using non-blocking I/O, the execution is based on an event loop [28]. The pool of threads is reduced, and unlike the conventional model, there is no direct mapping between active threads and requests, which fundamentally enables this model to efficiently handle a large number of concurrent requests using a limited number of threads. By leveraging reactive data types and functional paradigm principles, the reactive variant makes it possible to manage data flows and backpressure optimally, while providing much greater scalability under conditions of intensive load.

5.3. Project Structure

The structure of the proposed solution will be analyzed from two perspectives: the database structure, focusing on the role of the tables and their relationships, and the organization of the backend projects, describing the defined packages and the general role of the classes within each.

Database Structure

The database structure supporting the application’s services and functionalities is defined according to the diagram in Figure 2.

The “users” table defines the list of user accounts within the application.
The “tweets” define the list of messages within the application. This table has a many-to-one relationship with the “mentions” table, using the foreign key “tweet_id” to reference the associated entity.
The “hashtags” table defines the list of tags extracted from message contents.
The “mentions” table defines the user mentions within messages. This table has a many-to-one relationship with the “tweets” table using the foreign key “tweet_id” to reference the associated entity.
The “tweet_hashtag” table defines the join enabling the many-to-many relationship between messages and tags. This table has a many-to-one relationship with the “tweets” using the foreign key “tweet_id” and with the “hashtags” table using the foreign key “hashtag_id” to reference the associated entities.
The “follows” table defines the subscriptions between users.
The “likes” table stores the list of acknowledgments given by users to existing messages or replies.
The “replies” table stores the list of submitted replies by users to existing messages.
The “retweets” table stores the list of redistribution of existing messages by the users.
Project Organization

The asynchronous and reactive projects share a similar structure for organizing the packages, with the primary aim of capturing and separating the responsibilities of each module, thereby facilitating maintenance and code flexibility. Figure 3 presents the project’s detailed package organization structure.

The main packages within the projects are:

The “client” package contains classes for communicating with the other services in the application.
The “config” package centralizes config classes for various components of the application, such as caching, security, etc.
The “controller” package includes classes that expose the application’s entry points, receiving and processing user requests.
The “entity” package contains the entities or data models equivalent to the objects persisted in the database.
The “exception” package defines specific exceptions used in the application for error handling.
The “mapper” package groups the classes that provide the conversion between data transfer objects (DTO) and entities.
The “model” package contains models used for transferring information, without directly exposing persistent entities.
The “repository” package includes interfaces and implementations for accessing persistent data and managing database operations.
The “service” package contains the core business logic of the application, which is implemented in classes that organize and sequence calls to repositories and other necessary components.
The “util” package centralizes utility classes, used in various parts of the application, such as validation methods, conversions, etc.

5.4. Migration Aspects and Stages

The main aspects and stages of the migration process are described in Table 5, with an emphasis on clearly outlining the available counterpart in each paradigm for specific elements.

Migrating from an asynchronous to a reactive model requires a series of changes at the level of code and infrastructure. Although the transition can be complex depending on the size of the project and the technologies employed, the long-term benefits are considerable, as evaluated in this research.

6. Implementation Details

6.1. Updating Java and Spring Versions

Recent versions of Java and Spring provide access to the latest functionality, optimizations, and enhancements while ensuring long-term relevance, compatibility, and essential security fixes against numerous vulnerabilities. Upgrading to the latest versions is therefore crucial for a successful migration process and lays a solid foundation for future adjustments. However, it is worth noting that most industry applications are still running on lower versions of Java. While the asynchronous and reactive projects are based on Java 21, many organizations may find it more practical to adopt an intermediate step.

Java 17 serves as a bridge between the two major versions of Spring Boot, namely Spring Boot 2 and Spring Boot 3. It is compatible with both, whereas Java 11 is not supported by Spring Boot 3. Therefore, for existing applications, particularly those on older Spring Boot versions, the first step should be updating to Java 17. This transition allows for a smoother migration path, as Java 17 facilitates compatibility while mitigating potential issues arising from outdated or deprecated specifications. This approach ensures that applications remain in line with industry standards while taking advantage of the new features and improvements available in later versions. This step should also be reasonably straightforward, as most Java versions are designed to maintain very good backward compatibility, facilitating a smooth transition.

One of the most significant changes is the management of Jakarta EE dependencies. Newer versions of Spring Boot no longer use “javax” dependencies, necessitating a migration to their “jakarta” equivalents. As a result, developers need to update the import statements in the source code wherever these old specifications are used. While this transition might seem straightforward in the context of the own codebase, complications often arise with external dependencies because there is no interoperability between the two specifications.

The most reliable approach is to update the maintained dependencies to their latest versions, which should be compatible with Jakarta EE, and then thoroughly test the application to ensure that its functionality remains intact. However, if an immediate upgrade is not feasible, either because a compatible version of the dependency is not available or because the upgrade process is too complex, a temporary solution might involve using tools such as Eclipse Transformer [58]. This tool works by directly modifying the bytecode of the existing JAR files, converting “javax” references to “jakarta” equivalents without requiring access to the original source code.

This process begins by identifying all dependencies that still rely on the “javax” namespace. Once identified, Eclipse Transformer or a similar utility can be used to rewrite the bytecode within these JAR files, adapting them to the new Jakarta specifications. After the transformation, the modified JAR files are then integrated back into the project, replacing the old versions. Although this approach allows you to continue using older dependencies temporarily, it is crucial to conduct thorough testing of the application to ensure that these transformations have not introduced any issues or altered the application’s behavior. The transformation process must be repeated for each affected dependency, which can make the procedure quite time-consuming and complex. Moreover, because this process does not guarantee complete success, extensive testing is essential to confirm that the application’s functionality is preserved. A diagram presenting the mentioned steps is depicted in Figure 4.

Another change that may potentially impact the operability of the application is the changed default behavior for a trailing slash in HTTP request paths. In the previous version of Spring, a controller method annotated with @GetMapping(“/feed”) would have accepted requests coming in on both the “/feed” path, as in the example, and also on the “/feed/”. This default behavior is no longer present in the new version, resulting in an HTTP 404 error that can cause instability and operational disruptions for clients establishing communication using the trailing slash version. Nevertheless, the problem can be solved by reintroducing the mechanism that treats both variants equivalently, with the procedure being similar for both paradigms. It is required to define a configuration class by implementing the WebMvcConfigurer interface in the case of the asynchronous application and WebFluxConfigurer in the case of the reactive one, where the preference to treat both paths as equivalent is redefined programmatically.

The Java and Spring release upgrading process also includes other steps, depending on the specifications and technologies employed within each project, but these steps are an integrated part of the process, independent of the selected development paradigm.

6.2. Implementation and Management of Components

The Spring Framework promotes the organization of applications using a component structure, based on the design pattern of Inversion of Control/Dependency Injection [59]. The main benefit of this approach is the simplified management of dependencies and application configurations. The framework defines a set of specific stereotypes, such as Controller, Service, and Repository by using annotations, to properly differentiate between each of their purposes and functionalities [27].

User requests will be received and handled within the controller, which is the main entry point within the application. In terms of code structure and syntax, the differences at this level are minimal, with the only notable difference being the type of data used to return the computed results.

The responsibility of aggregating, orchestrating, and performing the application logic belongs to the service component, which is the intermediary of all interactions between the user and the data necessary to fulfill each request. Due to the role carried out by this component, most of the defined code logic is at this level, which makes service migration the main topic under analysis to understand the differences and the necessary adaptation steps in the reactive context.

Asynchronous programming structures the flow of the program as chains of asynchronous operations that use the results from the preceding level. CompletableFuture is intended to store both single results from a given level as well as collections of objects. The exposed methods allow for streamlined manipulation and further processing of the results, simplifying the chaining of asynchronous operations [6]. Methods such as supplyAsync and runAsync are used to schedule tasks for execution in a separate thread that will not block the main one, and the result processing steps will be defined using methods such as thenApply, to apply a direct transformation to the result when it becomes available, and thenCompose, to connect the current operation with another asynchronous operation whose execution depends on the result at the current level. Context switching can be achieved both at the beginning of the operation sequence and between steps using variations such as thenApplyAsync, which will not restrict the execution of subsequent steps to be performed on the same thread. In addition, such methods allow the delegation of a specific thread pool to handle the processing by accepting an ExecutorService as a method parameter.

The reactive model is fundamentally based on managing tasks as a continuous stream of data, using two fundamental types that encapsulate and allow non-blocking manipulation of the elements. Mono is used to issue at most one element, while Flux can also issue multiple values, and both primitives may also issue no elements at all. The data is processed in a functional style by applying a series of transforming, filtering, or combining operators to the output streams. For scenarios requiring the transformation of each element within the stream into a different shape or format, the conversion or transformer function is applied to each element available in the stream using the map operator, while flatMap allows the application of a function that issues, in turn, another reactive data stream, the computation of which is dependent on the previous result, thereby also “flattening” the resulting stream in the process. In certain situations where data streams may not output any elements, a default value or an error can be specified using operators such as switchIfEmpty, while combining elements of the same stream can be achieved by applying a function using the reduce operator. Excluding certain elements that do not fulfill specific processing conditions can be accomplished by applying a function that tests compliance with these conditions using the filter operator.

In addition to the previously outlined scenarios, applications must also accommodate the combination of data from multiple sources, which involves launching and aggregating results from multiple operations asynchronously. Conversely, in a reactive context, this involves combining multiple streams. The allOf method accepts multiple instances of CompletableFuture as parameters, enabling the orchestration of parallel scenarios that depend on the completion of previously initiated actions. Adaptation to the reactive paradigm is achieved using the zip operator, available for both fundamental types, which allows combining flows from different sources into a single final entity.

6.3. Communication Using HTTP Clients

Modern architectures based on microservices require effective communication between different system components [13]. Within the developed system, communication is achieved using HTTP requests, employing dedicated client implementations. The selected implementation primarily takes into account interoperability with the types and mechanisms in each paradigm while also providing efficient interactions.

For the asynchronous application, it was decided to use the HTTP client provided by Java. This interface integrates with the primitives specific to the paradigm and offers robust support for asynchronous operations. Requests can be launched asynchronously using methods such as sendAsync, with the results made available in a Future object. The interface also allows the configuration of a dedicated ExecutorService to achieve greater control and flexibility in concurrency management. The main downside of this solution is the lack of integration with Spring, with response transformations and data conversions being handled programmatically, introducing additional logic and complexity.

In the reactive approach, it was opted for the usage of WebClient from Spring WebFlux. Similar to HttpClient, it provides native integration with Mono and Flux reactive types but also supports automatic serialization and deserialization of data, increasing the clarity and efficiency of the implementation. As an integrated part of the reactive ecosystem, WebClient is developed with paradigm principles in mind so that requests and responses can be handled in a non-blocking way, making it a natural choice for scenarios implying heavy traffic conditions and high-performance requirements.

6.4. Handling System Errors

Software systems are prone to unpredictable behavior and errors, but maintaining resilience and reliability is not possible without effective management. There are several strategies for handling exceptional cases, and implementation methods are also different depending on the paradigm.

The standard approach implies the provision of a mechanism for computing an alternate response or an error handler that can vary in complexity, used at the moment when the application encounters the exceptional case. While this addresses the punctual error and provides a degree of control over the execution flow, it does not deal with the root cause and does not facilitate the recovery of the system from its unstable state.

CompletableFuture provides the exceptionally method for handling errors in asynchronous operations, while Mono and Flux provide equivalent methods for handling errors in reactive flows, such as onErrorResume. If the primary process fails, a default value can be returned or an attempt can be made to retrieve the data from alternate sources. For example, in the scenario where the getFollowedIds method fails, the data available in the cache is inspected by calling the getFollowedUsersFromCache method. This approach can be further extended by executing the operations or streams that provide access to the same required data in parallel, e.g., a simultaneous database and cache lookup, instead of waiting for the regular version to fail. CompletableFuture’s anyOf method allows asynchronous operations to be launched simultaneously, with the result being determined by the first completed operation, while the firstWithValue method provided in the reactive context waits for the first successful response, ignoring the failed ones.

Occasionally, the errors encountered in applications may be temporary or non-deterministic, caused by fluctuating conditions. In such situations, retrying the operations is an elegant and practical solution for dealing with exceptional cases without requiring the development of specific and complex solutions.

The asynchronous specification does not provide any implementation for retrying failed operations, requiring a programmatic solution such as recursion or the utilization of existing framework implementations such as Spring Retry. Instead, the retry operator is available in the reactive context, which allows for detailed configuration by specifying the number of retries, timing strategy, and re-execution conditions for the operation.

However, a more robust approach for dealing with errors in unpredictable situations, such as a sudden spike in traffic, is to leverage the “Circuit Breaker” pattern [60], which is designed to detect problems in real time and prevent errors from propagating throughout the system. A fallback method is configured, and when the main component becomes non-functional, resulting in the number of errors increasing, automatic redirection of requests to the fallback method takes place, ensuring a minimum level of operation and avoiding a total system outage. Periodically, an attempt will be made to gradually restore traffic through the main flow, and once the errors have completely disappeared, the fallback method is no longer invoked, and the system returns to normal operating conditions. The Resilience4j library provides versions compatible with both variants [61]. Listing 1 illustrates an example of using a CircuitBreaker in the reactive application.

Listing 1. Example of using Circuit Breaker in reactive applications.

@CircuitBreaker(
name = “followedIdsCircuitBreaker”,
fallbackMethod = “getUserFeedWithCachedFollowed”
)
public Flux<TweetDto> getUserFeed(UUID userId) {
return interactionClient.getFollowerIds(userId)
.collectList()
.flatMapMany(
tweetRepository::findByUserIdInOrderByCreatedAtDesc
)
.flatMap(this::enrichTweetDto);
}

public Flux<TweetDto> getUserFeedWithCachedFollowed(
UUID userId, Throwable t
) {
String key = FOLLOWED_CACHE + “::” + userId;
return redisTemplate.opsForValue().get(key)
.map(v -> {
try {
return objectMapper.readValue(
v, new TypeReference<List<UUID>>() {}
);
} catch (JsonProcessingException e) {
throw new RuntimeException(e);
}
}).flatMapMany(Flux::fromIterable)
.collectList()
.flatMapMany(
tweetRepository::findByUserIdInOrderByCreatedAtDesc
)
.flatMap(this::enrichTweetDto);
}

In the code above, the getFollowerIdsFromCache method will be called automatically if a high number of errors are detected in the communication with the interaction microservice. Specifically, the circuit breaker is configured to open when more than 50% of requests fail within a rolling window of 100 calls. Once appropriate functionality is restored and the service can accept requests again, the traffic will be completely re-routed back through the default method.

6.5. Task Scheduling Mechanisms

Scheduling mechanisms enable applications to automatically perform periodic tasks, such as cleaning up redundant resources or updating cached data, which are common scenarios in distributed microservice-based systems. Effective scheduling of periodic tasks has an important role to play in maintaining functional integrity, quality, and system performance over time. Although framework solutions are available, the required mechanisms are already embedded within both paradigms. One scenario arising in the resulting project was cleaning up interactions after deleting a message since the entries are stored in different databases, which can lead to potential inconsistencies. However, the temporary presence of such inconsistencies does not have any visible impact on the user. By scheduling periodic tasks to be performed automatically, one can avoid programmatic cleanup for each deletion action, therefore reducing the additional load on the system.

The asynchronous model employs the dedicated ScheduledExecutorService implementation to regularly plan and execute such tasks, granting detailed control over job scheduling but also providing the ability to define the initial execution time and the intervals between recurrent executions.

Project Reactor provides an elegant and efficient mechanism for planning tasks that naturally integrates within the reactive context using Schedulers, with combinations of operators such as interval and flatMap, allowing in turn a fine-grained specification of execution conditions and frequency.

Although the migration process reveals some differences at the code level, certain similarities, and a common structure are preserved: the usage of an Executor or Scheduler instance to configure the execution, the definition of jobs using the available interfaces and proper lifecycle management, correctly shutting down jobs and deallocating resources by leveraging methods such as shutdown or dispose.

6.6. Database Interactions

The asynchronous solution relies on a blocking model for database access but benefits from a full mapping of Java classes to the database structures, thanks to a comprehensive set of functionalities provided by Hibernate and JPA [55]. The database schema initialization happens automatically at runtime once the application is launched, based exclusively on metadata extracted from annotations.

The integration with Hibernate allows the automatic generation of unique identifiers for each entity, e.g., annotating the id field of the TweetEntity with @GeneratedValue, and the definition and management of data relationships, including those made using link tables by specifying association columns. In this example, the relationships between messages, mentions, and tags tables are defined using the @OneToMany, @ManyToMany, @JoinTable, and @JoinColumn annotations. In addition, data integrity is ensured thanks to support for cascading operations, but also the automatic removal of entities whose parent is deleted, achieved in this scenario by specifying CascadeType.ALL and enabling orphanRemoval for mentions and tags.

Implementing database access and performing CRUD (Create, Read, Update, Delete) operations for managed entities is achieved by defining repositories that extend the JpaRepository interface, considerably simplifying the integration and manipulation process.

By providing an extensive suite of predefined methods and the ability to define additional queries by simply respecting a method naming convention, e.g., the repository method findByTweetIdOrderByCreatedAtDesc which allows for retrieval of message replies in descending order by their creation date, the functionality available in the asynchronous model through the integration with JPA enables quick and efficient access to stored data, including the ability to execute queries of considerable complexity.

The definition of native queries allows direct use of SQL syntax, providing complete control over execution beyond the abstraction provided by the framework, which may allow certain scenarios to adjust and optimize the behavior as well as to use particular functionalities specific to the database system. In the previous example, a native query was defined as the searchUsers method to look up users by their names in a case-insensitive way. Last but not least, JPQL queries, such as the one defined to retrieve the top reply of a message by calling the method findTopReplyByLikesForTweetId, allow for the extraction of information from entities directly in the desired format through direct access to public model class constructors, and functionalities such as paging and EntityGraph, which were employed to eagerly fetch the tags and mentions only in this scenario since they were always required, provide granular control over loading strategies and efficient data access management.

Listing 2 includes a subset of methods and queries that were used in the asynchronous implementation in addition to the predefined ones.

Listing 2. Examples of methods for database access in asynchronous applications.

List<ReplyEntity> findByTweetIdOrderByCreatedAtDesc(UUID tweetId);

@EntityGraph(attributePaths = {“mentions”, “hashtags”})
List<TweetEntity> findByUserIdInOrderByCreatedAtDesc(
List<UUID> userIds
);

@Query(
value = “SELECT * FROM users WHERE user_name ILIKE ‘%:query%’”,
nativeQuery = true
)
List<UserEntity> searchUsers(String query);

@Query(
“SELECT NEW ro.tweebyte.interactionservice.model.ReplyDto” +
“(r.id, r.userId, r.content, r.createdAt, CAST(COALESCE(COUNT(l), 0) AS long)) “ +
“FROM ReplyEntity r “ +
“LEFT JOIN LikeEntity l ON r.id = l.likeableId AND l.likeableType = ’REPLY’ “ +
“WHERE r.tweetId = :tweetId “ +
“GROUP BY r.id, r.userId, r.content, r.createdAt “ +
“ORDER BY COUNT(l) DESC, r.createdAt DESC”
)
Page<ReplyDto> findTopReplyByLikesForTweetId(@Param(“tweetId”) UUID tweetId, Pageable pageable);

Reactive applications can only reach their performance potential in a fully integrated and reactive ecosystem, where database access is one of the most common scenarios across current systems. Due to these constraints, adopting a reactive database access solution is a mandatory requirement. Otherwise, there is the risk of completely inheriting the complexity and overhead associated with the migration process, without being able to benefit from any of the advantages and performance improvements at a high potential level.

R2DBC is the specification selected for defining a fully responsive system and managing database interactions in a non-blocking way. The first limitation is the lack of functionality for initializing the database schema using annotations. The alternative is to explicitly define native SQL code and store it in a file within the application resources, hence setting up automatic execution at the application startup.

The entities defined with R2DBC are significantly stripped down, providing only minimal functionality for specifying the properties and metadata required for mapping these objects to the corresponding database tables. This might reduce some overhead and complexity, but it would also require a programmatic approach to achieve some functionalities comparable to ORMs. Consequently, any potential benefits could be mitigated, possibly making the system even more complex and inefficient.

Recent implementations of R2DBC provide support for version control and transactional mechanisms, but currently, there is no native support for entities using composite primary keys. This limitation required the definition of an additional unique identifier and the creation of a composite index for the fields in question, intending to preserve the equivalence following the migration process.

The implementation of data access and execution of CRUD operations is performed using the ReactiveCrudRepository, which is the element that allows non-blocking interaction based on reactive types, ensuring seamless flow and access to data in an efficient and optimized way.

R2DBC offers a subset of predefined queries and supports custom query definitions via method naming conventions. However, it lacks several advanced features that JPA provides, such as support for JPQL (Java Persistence Query Language), which facilitates complex, object-oriented queries. Additionally, R2DBC does not offer built-in support for caching, lazy loading, or managing relationships between entities, requiring considerable adjustments to the source code to simulate and achieve functional equivalence through programmatic management, which can be time-consuming and error-prone for bigger projects, as illustrated in Listing 3.

Listing 3. Example of loading entity relationships in reactive applications.

private Mono<TweetDto> enrichTweetDto(TweetEntity tweetEntity) {
Mono<Long> likesMono =
interactionClient.getLikesCount(tweetEntity.getId());
Mono<Long> repliesMono = interactionClient
.getRepliesCount(tweetEntity.getId());
Mono<Long> retweetsMono = interactionClient
.getRetweetsCount(tweetEntity.getId());
Mono<ReplyDto> replyMono = interactionClient
.getTopReply(tweetEntity.getId())

Mono<List<HashtagEntity>> hashtagsMono = hashtagRepository
.findHashtagsByTweetId(tweetEntity.getId()).collectList();
Mono<List<MentionEntity>> mentionsMono = mentionRepository
.findMentionsByTweetId(tweetEntity.getId()).collectList();

return Mono.zip(likesMono, repliesMono, retweetsMono,
replyMono, hashtagsMono, mentionsMono)
.map(data -> tweetMapper.mapEntityToDto(
tweetEntity, data.getT1(), data.getT2(),
data.getT3(), data.getT4(),
data.getT5(), data.getT6()
));
}

The analyzed project used complex and ORM-specific features, but it was manageable to integrate R2DBC after some adjustments and trade-offs. However, many limitations and little or no support for certain functionalities were observed, which may be a determining factor when considering the migration process, especially within systems that are heavily reliant on complex or specific database access abstraction functionalities currently available only in mature technologies.

7. Evaluation of Results

The evaluation of results gathered from the migration process focuses on the functional equivalence of the two applications and performance analysis in different use cases. Multiple test scenarios have been analyzed and various metrics have been compared to provide an overview of the advantages and limitations of each paradigm.

7.1. Functional Equivalence

An inherent part of the migration process involved ensuring that both implementations produced identical results when performing the same operations. This consistency goes beyond preserving existing functionality but also ensures accurate performance analysis, as comparing performance indicators would be unreliable due to functional differences. Achieving functional equivalence required a robust testing methodology, featuring tests at different levels of abstraction and ensuring correct execution. Some tests validate individual components, ensuring smaller code units perform as expected, while others evaluate overall system behavior, emphasizing interactions and integrations by simulating real user actions. This approach guarantees that both individual components and the system as a whole are properly evaluated, preserving functionality throughout the transition to the reactive model.

Unit tests, created using JUnit 5 [62], were essential to check the accuracy and consistency of individual units of code. Aiming to be as similar as possible in both implementations, these tests were tailored only to account for differences such as data types and methods specific to each paradigm. Each test is structured to prepare the conditions and prerequisites, execute the tested functionality, and verify the expected results, thereby ensuring reliable testing of individual components throughout the migration process. Listing 4 illustrates an example of a unit test for the user login functionality in the asynchronous approach.

Listing 4. Example of unit test covering the user login in the asynchronous approach.

@Test
void testLoginSuccess() throws ExecutionException,
InterruptedException {
//arrange
UserLoginRequest request = new UserLoginRequest(

“user@example.com”, “correctpassword”
);
UserEntity userEntity = new UserEntity();
userEntity.setEmail(“user@example.com”);
userEntity.setPassword(“$2a$10$SomeHashedPasswordHere”);
userEntity.setId(UUID.randomUUID());

when(userRepository.findByEmail(any())
).thenReturn(Optional.of(userEntity));

when(
encoder
.matches(request.getPassword(), userEntity.getPassword())
).thenReturn(true);

//act
AuthenticationResponse result = authenticationService
.login(request).get();

//assert
assertNotNull(result);
assertFalse(result.getToken().isEmpty());
verify(userRepository).findByEmail(request.getEmail());
verify(encoder)
.matches(request.getPassword(), userEntity.getPassword());
}

The introduced test example validates the user login functionality by setting up a login request with valid credentials and a mocked user entity. It configures the repository and the encoder mocks to return the expected values, simulating real-world conditions. Following this setup, the test calls the actual login method on the authentication service, concluding with response checks to ensure a successful login. Assertions ensure that the response is not null, that the token is present, and that the correct methods were invoked with the expected parameters, confirming the login process’s accuracy and reliability.

Cucumber [63] was also employed to define Behavior-Driven Development (BDD) automated tests. Written in Gherkin, these tests describe software behavior in clear and understandable terms, remaining agnostic of implementation details so they can be used across both versions of the application. These testing scenarios focus on assessing functional consistency by outlining the expected system behavior from a user’s perspective, described in a human-readable language. The abstraction from the code streamlines the validation of overall system functionality and integration points, ensuring a consistent user experience while also serving as a helpful instrument in identifying and addressing any discrepancies during the development and migration process. Listing 5 illustrates a partial Cucumber feature file for message management.

Listing 5. Example of a piece of Cucumber feature for message management testing scenarios.

Feature: User Tweet Management
Scenario: Register a user, create a tweet, and verify tweet existence
Given a new user is registered with valid details
When the user posts a valid tweet
And the user retrieves their tweets
Then the user’s tweet should be included in the retrieved tweets

Scenario: Attempt to create a tweet with no content
Given a new user is registered with valid details
When the user attempts to post a tweet with no content
Then the response should indicate a content validation error

Scenario: Attempt to create a tweet with insufficient content length
Given a new user is registered with valid details
When the user attempts to post a tweet with insufficient content
Then the response should indicate a minimum content length error

# other scenarios

This Cucumber feature tests the message management functionalities, including one scenario that verifies that a new user can be registered, create a valid message, and retrieve it. It also includes scenarios such as trying to create a message with no content or insufficient content length, which should trigger specific content validation errors. By incorporating such negative checks and edge cases, these tests ensure the system can process user requests as expected, but also properly validate inputs and handle unexpected situations effectively.

To execute the Cucumber feature files, the steps must be followed to define the actual translation from Gherkin scenarios to executable Java code. These step implementations run independently against the different backends, ensuring appropriate validation regardless of the underlying implementation. Listing 6 presents two examples of step implementations, showcasing the actual mapping between the high-level descriptions and the required interactions to perform the respective actions.

Listing 6. Example of step implementations for Cucumber feature scenarios.

@Given(“a new user is registered with valid details”)
public void registerNewUser() throws JsonProcessingException {
String url = USER_SERVICE_BASE_URL + “/auth/register”;
HttpHeaders headers = new HttpHeaders();
headers.setContentType(MediaType.MULTIPART_FORM_DATA);

LinkedMultiValueMap<String, String> body =
new LinkedMultiValueMap<>();
body.add(“userName”, generateRandomUsername());
body.add(“email”, generateRandomEmail());
//addition of the other fields

HttpEntity<LinkedMultiValueMap<String, String>> requestEntity =
new HttpEntity<>(body, headers);
ResponseEntity<String> response = restTemplate
.postForEntity(url, requestEntity, String.class);
assertEquals(HttpStatus.OK, response.getStatusCode());

AuthenticationResponse authResponse = objectMapper
.readValue(response.getBody(), AuthenticationResponse.class);
userId = UUID.fromString(
getClaimFromToken(authResponse.getToken(), “user_id”)
);
}

@Then(“the user’s tweet should be included in the retrieved tweets”)
public void verifyTweetInclusion() throws Exception {
TweetDto[] tweetsArray = objectMapper
.readValue(response.getBody(), TweetDto[].class);
List<TweetDto> tweets = Arrays.asList(tweetsArray);

assertTrue(
tweets
.stream()
.anyMatch(
tweet ->
“This is a valid tweet.”
.equals(tweet.getContent()) &&
tweetId.equals(tweet.getId())
)
);
}

Both method implementations are annotated and parameterized to align with the steps defined in the feature file. As per the first scenario listed, the registerNewUser method is connected to the Given step, while the verifyTweetInclusion is tied to the Then step. These annotations and the specified parameters establish the direct link between the Gherkin steps and the executable code, ensuring that each scenario step triggers the appropriate method.

By implementing an average of approximately 150 tests per service, including both unit tests and also various features and scenarios, an estimated code coverage baseline of over 90% was registered, demonstrating that the migration from asynchronous to reactive specifications did not compromise the application’s functionality. The test suites consistently produced the same expected results across both implementations, confirming functional equivalence and reinforcing the feasibility of an effective and complete transition to the reactive approach.

7.2. Performance Analysis

The performance analysis is focused on comparing resource utilization and efficiency for each approach. For this study, both static parameters as well as the dynamics of the applications under concurrent conditions were considered.

7.2.1. Static and Initialization Metrics

These properties outline a preliminary overview of the first differences resulting after the migration process, summarized in Table 6. Although the impact concerning behavior during operation is limited, some of these characteristics might still be relevant considering some convenience aspects of CI/CD (Continuous Integration and Continuous Delivery) processes [64].

7.2.2. Evaluation Under Concurrent Load Conditions

Analyzing the performance of the two approaches under concurrent conditions involved both the selection of appropriate tools for simulating and monitoring the traffic with a variable number of users and the definition of a test plan that would provide relevant data and a fair evaluation, minimizing errors and avoiding sudden overload.

The simulation of concurrency with varying numbers of users was performed using Apache JMeter [65]. The testing started with a 60-s ramp-up period per scenario, during which the number of requests was gradually increased. The actual tests were run over 3 min durations with the ramp-up period data deliberately excluded from the reports as the focus was on monitoring and analyzing the performance of both systems after reaching a stable state at the desired level of concurrency.

The metrics were collected using the aggregated reports generated by JMeter, which captured a detailed overview of the response times and throughput rates, but also using VisualVM, which allowed for the monitoring and recording of resources such as memory consumption and CPU usage, together offering insights into the system’s performance under high concurrency and the ability to handle an increasing number of requests over time, ensuring a solid evaluation of both efficiency and scalability.

The first analyzed scenario implied the fetching of a user profile summary. A list of user IDs extracted from the database was loaded into the test plan defined with JMeter, and virtual users launched concurrent requests using distinct IDs. The plots illustrating the heap memory and throughput variation are displayed in Figure 5.

At a low concurrency level of 10 users, the reactive model consumed 60 MB of memory, while the asynchronous model consumed 64 MB. CPU usage was 5.2% for the reactive model and 5.9% for the asynchronous model. The average response time was 8 ms for the reactive model and 9 ms for the asynchronous model. The 90th percentile response time was 9 ms for the reactive model and 11 ms for the asynchronous model. Throughput was measured at 1015 req/s for the reactive model and 936 req/s for the asynchronous model.

At a mid-concurrency level of 500 users, the reactive model consumed 260 MB of memory, while the asynchronous model consumed 335 MB. CPU usage was 37.3% for the reactive model and 34.2% for the asynchronous model. The average response time was 51 ms for the reactive model and 69 ms for the asynchronous model. The 90th percentile response time was 61 ms for the reactive model and 141 ms for the asynchronous model. Throughput was measured at 8075 req/s for the reactive model and 6008 req/s for the asynchronous model.

At a peak concurrency level of 1000 users, the reactive model consumed 473 MB of memory, while the asynchronous model consumed 535 MB. CPU usage was 42.1% for the reactive model and 40.7% for the asynchronous model. The average response time was 93 ms for the reactive model and 125 ms for the asynchronous model. The 90th percentile response time was 121 ms for the reactive model and 277 ms for the asynchronous model. Throughput was measured at 8867 req/s for the reactive model and 6659 req/s for the asynchronous model.

The second investigated scenario involved simulating user interactions within the application, specifically the action of subscribing to other users. In this setup, both a list of user IDs that will subscribe and a list of user IDs to be subscribed to were added to the JMeter test plan, with multiple unique application user identities used across different virtual users. This approach evaluated the performance impact of multiple virtual users executing subscription operations simultaneously. The plots displaying the memory and throughput variation are presented in Figure 6.

At a low concurrency level of 10 users, the reactive model consumed 87 MB of memory, while the asynchronous model consumed 94 MB. CPU usage was 8.6% for the reactive model and 10.2% for the asynchronous model. The average response time was 7 ms for the reactive model and 8 ms for the asynchronous model. The 90th percentile response time was 10 ms for both models. Throughput was measured at 1060 req/s for the reactive model and 992 req/s for the asynchronous model.

At a mid-concurrency level of 250 users, the reactive model consumed 213 MB of memory, while the asynchronous model consumed 239 MB. CPU usage was 31.3% for the reactive model and 30.8% for the asynchronous model. The average response time was 41 ms for the reactive model and 62 ms for the asynchronous model. The 90th percentile response time was 50 ms for the reactive model and 129 ms for the asynchronous model. Throughput was measured at 4998 req/s for the reactive model and 3335 req/s for the asynchronous model.

At a peak concurrency level of 1000 users, the reactive model consumed 366 MB of memory, while the asynchronous model consumed 394 MB. CPU usage was 37.9% for the reactive model and 35.6% for the asynchronous model. The average response time was 153 ms for the reactive model and 245 ms for the asynchronous model. The 90th percentile response time was 191 ms for the reactive model and 291 ms for the asynchronous model. Throughput was measured at 5446 req/s for the reactive model and 3704 req/s for the asynchronous model.

The third scenario under consideration involves updating an existing message. This test was conducted using distinct instances for both users and messages to evaluate performance during update operations. The memory and throughput variations are plotted in Figure 7.

At a low concurrency level of 10 users, the reactive model consumed 81 MB of memory, while the asynchronous model consumed 93 MB. CPU usage was 7.9% for the reactive model and 9.5% for the asynchronous model. The average response time was 8 ms for the reactive model and 9 ms for the asynchronous model. The 90th percentile response time was 10 ms for the reactive model and 11 ms for the asynchronous model. Throughput was measured at 981 req/s for the reactive model and 955 req/s for the asynchronous model.

At a mid-concurrency level of 250 users, the reactive model consumed 206 MB of memory, while the asynchronous model consumed 229 MB. CPU usage was 24.2% for the reactive model and 23.4% for the asynchronous model. The average response time was 71 ms for the reactive model and 73 ms for the asynchronous model. The 90th percentile response time was 105 ms for the reactive model and 162 ms for the asynchronous model. Throughput was measured at 3081 req/s for the reactive model and 2624 req/s for the asynchronous model.

At a peak concurrency level of 1000 users, the reactive model consumed 303 MB of memory, while the asynchronous model consumed 351 MB. CPU usage was 26.9% for the reactive model and 28.7% for the asynchronous model. The average response time was 231 ms for the reactive model and 281 ms for the asynchronous model. The 90th percentile response time was 294 ms for the reactive model and 688 ms for the asynchronous model. Throughput was measured at 3714 req/s for the reactive model and 2965 req/s for the asynchronous model.

The fourth scenario being evaluated was to retrieve a user’s messages. This case was intended to be particularly demanding and has been proven to be perhaps the most challenging of those analyzed, with multiple database entries in the order of millions being loaded this time, with every user in the application having approximately 1000 messages associated with it, resulting in lower performance metrics and the impossibility of achieving performance levels comparable to the previous tests. The focus of this test was to assess the system’s ability to handle larger volumes of data and more complex interactions, as each message required data computed in the interaction microservice. The variation in CPU usage and throughput is illustrated in Figure 8.

At a low concurrency level of 10 users, the reactive model consumed 187 MB of memory, while the asynchronous model consumed 200 MB. CPU usage was 41.2% for the reactive model and 17.9% for the asynchronous model. The average response time was 113 ms for the reactive model and 47 ms for the asynchronous model. The 90th percentile response time was 137 ms for the reactive model and 67 ms for the asynchronous model. Throughput was measured at 75 req/s for the reactive model and 178 req/s for the asynchronous model.

At a mid-concurrency level of 250 users, the reactive model consumed 324 MB of memory, while the asynchronous model consumed 837 MB. CPU usage was 68.8% for the reactive model and 34% for the asynchronous model. The average response time was 1991 ms for the reactive model and 789 ms for the asynchronous model. The 90th percentile response time was 2905 ms for the reactive model and 1434 ms for the asynchronous model. Throughput was measured at 104 req/s for the reactive model and 263 req/s for the asynchronous model.

At a peak concurrency level of 1000 users, the reactive model consumed 691 MB of memory, while the asynchronous model consumed 1979 MB. CPU usage was 80.9% for the reactive model and 59.8% for the asynchronous model. The average response time was 8275 ms for the reactive model and 3755 ms for the asynchronous model. The 90th percentile response time was 10,547 ms for the reactive model and 7195 ms for the asynchronous model. Throughput was measured at 99 req/s for the reactive model and 218 req/s for the asynchronous model.

A summary of all the tests conducted on both the reactive and asynchronous applications, including their types, quantities, and purposes, is provided in Table 7.

7.2.3. Discussion of Results

The memory consumed on startup by reactive applications is about 35% less compared to the asynchronous ones, which would allow for faster scheduling inside a cluster, reducing deployment duration while also optimizing resource allocation and utilization inside orchestration platforms such as Kubernetes.

The startup time for reactive microservices is approximately 35% lower than that of asynchronous variants, contributing to an optimal transition in the case of a rolling deployment strategy, minimizing downtime, and even preventing readiness errors when deploying several replicas, therefore ensuring continuity of service operations, which is important in distributed environments focused on maintaining high availability and reliability.

The size of the resulting artifacts is also notably smaller for reactive applications, with a difference of about 30%, mainly due to the smaller number of dependencies required. The deployment of these smaller artifacts is inherently accelerated, simplifying the storage processes on various platforms and reducing the size and download times of container images, such as those used in Docker.

In the first performance testing scenario, the reactive model demonstrated clear advantages over the asynchronous model, particularly at mid and high-concurrency levels, where its non-blocking architecture allowed it to handle increased load more efficiently.

At a low concurrency level of 10 users, the reactive model reduced memory consumption by 6.25% (60 MB vs. 64 MB) and CPU usage by 11.9% (5.2% vs. 5.9%). Response times were also lower, with the average response time reduced by 12.5% (8 ms vs. 9 ms) and the 90th percentile reduced by 18.2% (9 ms vs. 11 ms). Throughput was 8.4% higher (1015 req/s vs. 936 req/s), indicating that even under light loads, the reactive model achieved measurable efficiency gains, particularly in memory usage and response times.

At a mid-concurrency level of 500 users, the reactive model showed significant advantages. Memory consumption was reduced by 22.4% (260 MB vs. 335 MB), and the average response time was 26.1% lower (51 ms vs. 69 ms). The 90th percentile response time showed an even greater improvement, with the reactive model achieving 61 ms compared to 141 ms for the asynchronous model, a reduction of 56.7%. Throughput was 34.5% higher (8075 req/s vs. 6008 req/s). While CPU usage was slightly higher for the reactive model (37.3% vs. 34.2%), the overall gains in memory efficiency and response times outweighed this minor increase. These results highlight the scalability and responsiveness of the reactive approach at moderate concurrency levels.

At the peak concurrency level of 1000 users, the reactive model demonstrated substantial advantages. Memory consumption was reduced by 12% (473 MB vs. 535 MB), and the average response time was 25.6% lower (93 ms vs. 125 ms). The 90th percentile response time was significantly lower for the reactive model, achieving 121 ms compared to 277 ms for the asynchronous model, an improvement of 56.3%. Throughput was 33.1% higher (8867 req/s vs. 6659 req/s). CPU usage remained comparable between the two models, with the reactive model recording 42.1% versus 40.7% for the asynchronous model. These results demonstrate the reactive model’s ability to maintain low response times and high throughput under high-concurrency scenarios, making it well-suited for distributed systems requiring scalability and resource efficiency.

In the second scenario, the reactive model demonstrated clear advantages over the asynchronous model across most metrics, particularly in terms of memory usage, response times, and throughput.

At a low concurrency level of 10 users, the reactive model consumed 7.4% less memory than the asynchronous model (87 MB vs. 94 MB). CPU usage was also lower, reduced by 15.7% (8.6% vs. 10.2%). The average response time improved by 12.5% (7 ms vs. 8 ms), while the 90th percentile response time remained the same for both models at 10 ms. Throughput was 6.9% higher for the reactive model (1060 req/s vs. 992 req/s). These results suggest that even under light loads, the reactive model is capable of achieving minor but consistent efficiency gains.

At a mid-concurrency level of 250 users, the reactive model’s advantages became more pronounced. Memory consumption was 10.9% lower (213 MB vs. 239 MB), and the average response time decreased by 33.9% (41 ms vs. 62 ms). The 90th percentile response time improved by 61.2%, with the reactive model achieving 50 ms compared to 129 ms for the asynchronous model. Throughput was 49.8% higher for the reactive model (4998 req/s vs. 3335 req/s). Although CPU usage was slightly higher for the reactive model (31.3% vs. 30.8%), the performance improvements in response times and throughput outweighed this minor increase.

At the peak concurrency level of 1000 users, the reactive model maintained substantial advantages. Memory consumption was reduced by 7.1% (366 MB vs. 394 MB). The average response time was 37.6% lower (153 ms vs. 245 ms), and the 90th percentile response time improved by 34.4% (191 ms vs. 291 ms). Throughput was 47.1% higher for the reactive model, with 5446 req/s compared to 3704 req/s for the asynchronous model. CPU usage increased slightly for the reactive model (37.9% vs. 35.6%), but this did not detract from its overall performance gains.

In the third scenario, the reactive model consistently demonstrated advantages in memory usage, response times, and throughput, particularly at mid to high concurrency levels.

At a low concurrency level of 10 users, the reactive model reduced memory consumption by 12.9% (81 MB vs. 93 MB) and CPU usage by 16.8% (7.9% vs. 9.5%). The average response time was 11.1% lower for the reactive model (8 ms vs. 9 ms), while the 90th percentile response time improved slightly (10 ms vs. 11 ms). Throughput was 2.7% higher for the reactive model (981 req/s vs. 955 req/s). These results suggest that the reactive model offers minor but consistent efficiency gains under light loads.

At a mid-concurrency level of 250 users, the reactive model demonstrated more substantial improvements. Memory consumption was 10% lower (206 MB vs. 229 MB), while the average response time was comparable between the two models (71 ms vs. 73 ms). The 90th percentile response time showed a significant improvement for the reactive model, reduced by 35.2% (105 ms vs. 162 ms). Throughput was 17.4% higher for the reactive model (3081 req/s vs. 2624 req/s). Although CPU usage was slightly higher for the reactive model (24.2% vs. 23.4%), the gains in throughput and response time made it the better performer at this level.

At the peak concurrency level of 1000 users, the reactive model maintained a clear advantage across most metrics. Memory consumption was reduced by 13.7% (303 MB vs. 351 MB), and the average response time was 17.8% lower (231 ms vs. 281 ms). The 90th percentile response time was significantly improved for the reactive model, reduced by 57.2% (294 ms vs. 688 ms). Throughput was 25.3% higher for the reactive model, with 3714 req/s compared to 2965 req/s for the asynchronous model. CPU usage was also slightly lower for the reactive model (26.9% vs. 28.7%), highlighting its efficiency under high-concurrency conditions.

In the fourth scenario, the reactive model faced significant challenges and demonstrated limitations compared to the asynchronous model. This test involved large-scale data retrieval with substantial computational overhead, exposing potential inefficiencies in the reactive paradigm under specific conditions.

At a low concurrency level of 10 users, the reactive model consumed slightly less memory than the asynchronous model, reducing memory usage by 6.5% (187 MB vs. 200 MB). However, CPU usage was significantly higher for the reactive model, more than doubling at 41.2% compared to 17.9% for the asynchronous model. Response times were considerably worse, with the average response time for the reactive model more than double that of the asynchronous model (113 ms vs. 47 ms). Throughput was also much lower, with the reactive model handling only 75 req/s, a 57.9% decrease compared to the asynchronous model’s 178 req/s.

At a mid-concurrency level of 250 users, the reactive model continued to consume significantly less memory, with a 61.3% reduction compared to the asynchronous model (324 MB vs. 837 MB). However, CPU usage for the reactive model was double that of the asynchronous model (68.8% vs. 34%). Average response times were substantially worse for the reactive model, with an increase of 152.2% (1991 ms vs. 789 ms). The 90th percentile response time also reflected this inefficiency, with the reactive model reaching 2905 ms compared to 1434 ms for the asynchronous model. Throughput remained considerably lower for the reactive model, recording 104 req/s, a 60.5% reduction compared to the asynchronous model’s 263 req/s.

At the peak concurrency level of 1000 users, the reactive model consumed 65.1% less memory than the asynchronous model (691 MB vs. 1979 MB), showcasing its typical efficiency in memory management. However, CPU usage was 35.3% higher for the reactive model (80.9% vs. 59.8%). Average response times were more than double for the reactive model, reaching 8275 ms compared to 3755 ms for the asynchronous model, while the 90th percentile response time increased to 10,547 ms for the reactive model, compared to 7195 ms for the asynchronous model. Throughput remained a significant limitation for the reactive model, achieving only 99 req/s, a 54.6% reduction compared to the asynchronous model’s 218 req/s.

To isolate the root cause of the performance limitation, the sequential steps in the code flow were isolated or swapped. Subsequently, JMeter tests were rerun after each modification to determine which of these altered steps resulted in metrics that were comparable to those of the asynchronous application. Initially, the mapping functions responsible for converting entities to DTOs were removed. Next, the Redis cache layer and the calls to the interaction service were excluded to determine if these contributed to the observed latency. Finally, it was concluded that the performance limitations are likely related to the database interactions. Monitoring during the tests indicated that the database reached its I/O limits, as evidenced by Docker metrics. This suggests that the limitations are primarily caused by database interactions, potentially influenced by the data access abstraction provided by Spring Data, the driver for PostgreSQL, or the constraints of the containerized database. Additional investigation is required to confirm these findings and to rule out other contributing factors, such as inefficiencies in the implementation or limitations in the testing methodology.

The final results indicated that although the reactive principles are not cutting-edge and have already gone through many iterations and developments, the reactive paradigm cannot be considered a universal replacement. The last scenario outlined in the performance study demonstrates that there are situations where classical approaches remain preferable. This underlines the importance of carefully assessing the context and particularities of each system before deciding upon a migration. In some cases, traditional technologies are still prevalent, while the transition to new approaches, such as reactive programming, and the adaptation of stable and established principles require considerable time and effort.

While the reactive approach does not always guarantee efficiency improvements, pointing to the counterexamples that disproved the assumption of the absolute superiority of this model during the performance testing, the great results achieved across the other scenarios highlight the potential for higher performance and lower resource consumption due to the non-blocking architecture and the reduction in the number of required threads, demonstrating the scalability of the reactive model, particularly in handling mid to high concurrency levels. At the same time, it is also clear that software development has not reached its limits and that there are still considerable opportunities for innovation and improvement in this area.

7.2.4. Trade-Offs and Long-Term Impacts

For reactive programming to gain broader adoption in modern software solutions, its advantages must be weighed against the challenges it introduces, especially when compared to traditional asynchronous approaches. Understanding how each paradigm stands in terms of learning curves, development and debugging complexities, and long-term maintainability is essential for deciding about their use.

Reactive programming introduces a paradigm shift, employing a declarative, event-driven approach, which should be adopted by developers. Reactive programming successfully abstracts some of the complexities of thread management, but it also demands a deep understanding of data flows, operators, and event-driven interactions. The initial development effort and complexity can be considerable for teams unfamiliar with reactive principles, potentially impacting the deliverables and timelines of projects [66]. On the other hand, asynchronous programs require more explicit management and orchestration of thread pools, which includes avoiding common pitfalls encountered in concurrent programming by adopting or implementing more sophisticated concurrency management mechanisms [11]. Both paradigms present trade-offs in complexity, but the choice should be dictated primarily by the specific use cases and the team’s expertise and adaptability.

Debugging reactive applications requires particular tools and approaches due to their declarative character. Traditional debugging methods, such as code stepping or monitoring thread execution, might be less effective in reactive environments due to the event-driven architecture. Auxiliary tools are available to assist with some of the challenges, but they also introduce additional dependencies and learning curves for developers [67]. In contrast, debugging asynchronous applications often revolves around understanding thread execution, race conditions, and deadlocks, which, although complicated, are more commonly encountered and, therefore, better documented, with multiple approaches available for circumventing them [12]. While reactive programming mitigates some issues through abstractions, it introduces different challenges, such as data flow tracing and complex operator chain management, highlighting the distinct approaches each paradigm requires for addressing debugging complexities.

Reactive codebases can be more concise and modular when effectively written, adhering to the declarative style of the paradigm, which often leads to improved maintainability. This modularity allows for easier changes and updates, particularly in complex, highly concurrent systems where flexibility is critical. However, the inherent complexities of the paradigm, such as the use of reactive operators and operator chains, can make the codebase harder to understand for new developers or those unfamiliar with its principles. Research has shown that reactive programming often increases cyclomatic complexity and may be less readable according to traditional readability metrics, posing challenges for team onboarding and overall code comprehension [68]. In contrast, asynchronous programming, while more familiar to many Java developers, relies on traditional imperative patterns, which are more verbose and may be easier to debug and maintain but harder to scale or refactor effectively. The long-term maintainability of reactive programming heavily depends on appropriate training, thorough documentation, and adherence to best practices established from the beginning of the development process.

Adopting the reactive paradigm can be challenging for developers accustomed to traditional asynchronous development, as the two paradigms propose significantly different approaches. While asynchronous programming typically relies on callbacks and manual thread management to handle concurrent operations, the reactive approach abstracts these complexities through reactive streams and declarative constructs. For teams with limited exposure to functional programming, this shift can pose a difficulty for the initial adaption, as the transition may affect short-term productivity. However, reactive programming can offer considerable benefits by promoting a deeper understanding of asynchronous patterns, reducing errors associated with thread management, and improving efficiency. Organizations must weigh the benefits of reactive programming against the learning curve and the costs of training and onboarding [69]. Notably, the reactive ecosystem has grown substantially, with multiple materials, courses, and tools that aim to support training and development. Many solutions and frameworks are actively researched and maintained to support the transition towards reactive systems, even tools that can perform automated refactoring of asynchronous Java code into reactive counterparts, which can provide a helpful bridge by reducing some of the manual efforts and enabling quick prototyping, while allowing the teams to focus on understanding the reactive principles, easing the transition process and improving long-term productivity [70].

The reactive paradigm is particularly beneficial in systems with scalability and high concurrency requirements, such as applications that demand real-time responsiveness or high-throughput processing, as the non-blocking architecture can optimize resource utilization. However, the long-term impact of maintaining and updating depends on how well the initial solution was developed and whether it adheres to best practices. Poorly implemented reactive systems can experience bugs due to complex operators or chainings, which can make the system potentially more challenging to refactor or extend. Asynchronous systems, while more straightforward in general, may require repeated architectural adjustments to scale effectively, which can delay code updates and progress long-term. Reactive programming holds significant promise for scalability, provided the development team has the appropriate expertise. Balancing the benefits and challenges of each paradigm requires careful consideration of team proficiency, project requirements, and long-term goals.

7.2.5. Limitations in Performance Testing

This testing was performed in a local environment, with containerized databases, thereby influencing the accuracy of higher concurrency simulations. Despite the utilization of a “server” system to host the applications and an additional “client” system to execute the JMeter tests, the local conditions do not fully reflect the behavior in a production environment with real users. In addition, testing in a local environment may not scale to what would be achieved inside cloud or distributed production environments, and the generated traffic is not as diverse as in real-use scenarios.

The test scenarios generally focused on I/O-bound flows, which are common in web applications. While this approach provided meaningful insights into such typical use cases, CPU-bound workloads were not benchmarked, requiring further investigation in this area.

The methodology included running multiple iterations of the tests, which consistently showed stable behavior with only minor variations within acceptable ranges. However, a fixed number of iterations was not employed, and formal error margins were not calculated, as the observed stability across runs was considered sufficient. Nonetheless, adopting a more structured and precise testing approach could further strengthen the reliability of the results.

Additionally, the testing process did not include long-duration tests to evaluate the effects of “software aging”, which could impact the system’s stability and performance over extended periods of continuous operation. Software aging, characterized by factors such as resource leaks, memory fragmentation, or other cumulative effects, is an important aspect in determining long-term robustness and reliability [71]. Future evaluations should incorporate these tests to better assess the system’s behavior under prolonged workloads and identify potential areas requiring stabilization, such as resource cleanup mechanisms or enhanced error-handling strategies.

The hardware specifications for the devices used to perform the testing are as follows:

Server: Intel Core i7 10,700 CPU, 32 GB DDR4 RAM, and 1 TB SSD (Intel, Santa Clara, CA, USA).
Client: Apple M1 chip, 16 GB unified memory, and 256 GB SSD (Apple, Cupertino, CA, USA).

These specifications influence the results of performance testing, as the limited resources available for such consumer devices might have introduced variations that would not have been present in a robust production environment. Also, other applications or processes running within the systems used for testing could have interfered with the results, introducing additional variation in the resulting measurements.

The testing was conducted over LAN, which ensures minimal network latency, as displayed in Figure 9.

While this was beneficial for achieving consistent and timely results, it does not provide a complete overview of the performance under varied network conditions, which are encountered in distributed environments or on the Internet.

During the performance testing phase, the authentication using JWT tokens was disabled to avoid the constant overhead it introduced. This allowed for a focused evaluation of the core performance of the services. However, a complete evaluation based on real-world scenarios should also consider the overhead introduced by the authentication.

8. Conclusions and Further Development

8.1. Conclusions

Based on the analyses and test results, an iterative refinement process is essential to enhance performance and address identified limitations. This approach, akin to the iterative refinement described in software development methodologies [72], may involve further adjustments to the algorithms and the reactive architecture, ensuring that the system effectively meets scalability and stability requirements.

As part of the project, two complete backends were developed based on the asynchronous and reactive models. Both applications were extensively tested by defining and executing equivalent unit tests and identical automated integration tests, indicating functional equivalence between the two models, with all the considered scenarios demonstrating similar behavior and reinforcing the viability of transitioning towards the reactive paradigm without compromising the existing functionality.

From a performance point of view, the reactive variant achieved significantly better response times but also proved to be more efficient by using a reduced number of threads and resources in most of the evaluated scenarios due to the non-blocking, event-driven architecture. However, in some complex scenarios, it still presented limitations and was outperformed by the asynchronous model, highlighting that its effectiveness might fluctuate depending on the nature of the tasks.

The reactive approach consistently outperformed its asynchronous counterpart in memory efficiency, avoiding aggressive thread pool expansion and significant memory spikes under increased concurrency. When assessing CPU efficiency, the reactive approach demanded more resources in scenarios that involved intensive data processing, suggesting a potential trade-off where higher CPU consumption compensates for greater performance stability. Despite the higher demands in some contexts, it delivered sustained performance across varying operational scenarios, showcasing adaptability in resource management.

In evaluating performance indicators such as average response times, throughput, and 90th percentile response times, the reactive model generally demonstrated greater efficiency across the examined scenarios. However, not all tests consistently favored the reactive model, with some scenarios showing comparable average response times between the two models. The reactive model’s superior throughput and stability, highlighted by the 90th percentile response times, demonstrates its ability to handle requests quickly and consistently under load. This reliability, particularly in high-demand environments, confirms the reactive model as a suitable choice for scenarios requiring robust and efficient handling of large request volumes, despite occasional setbacks in specific demanding tasks.

It must be emphasized that limitations and challenges in migrating to the reactive paradigm remain significant. The migration process to a reactive model can be a viable option, whether applying a partial migration to only critical components or transitioning towards fully reactive systems is considered. Analyzing the specifics of each product and making an informed decision remain the determining factors. This project offers a starting point for such investigations and decisions, to promote a better understanding of both the advantages and challenges associated with reactive programming in the Java ecosystem.

8.2. Further Development

Developing a reactive-based graphical user interface. The development and automation of graphical user interface testing with frameworks such as Selenium [73] would strengthen the ideas of equivalence and interchangeability of the two backends (asynchronous and reactive), confirming that the migration does not compromise the user experience at the functional level and ensuring the consistency and reliability of the application. Also, the adoption of protocols such as Server-Sent Events (SSE) [74], WebSocket [75], or RSocket [76] for real-time updates could further refine the user experience by enhancing the application’s responsiveness and maintaining seamless activity, as these protocols facilitate efficient, low-latency updates, which are key parts in ensuring that the migration to a reactive-based architecture does not compromise the functional level of the user interface.

Hosting the application on a dedicated platform. Cloud platforms would allow for the simulation of a higher number of concurrent requests, while also allowing for more accurate measurement of the resources and replicas necessary to handle such loads. Therefore, the testing results can be improved by a more accurate evaluation under intensive usage scenarios, more accurately outlining the differences between the asynchronous and reactive variants under more demanding conditions and providing more detailed performance and resource utilization insights.

Developing and integrating a reactive ORM (Object-Relational Mapping) based on R2DBC. Given the availability of frameworks such as Hibernate Reactive, based on the Vert.x reactive specification [77], but not for R2DBC, the adjustments necessary for the application in the context of migrating to a reactive ORM can also be studied, taking into account the impact and consequences of missing certain functionalities, as observed in the development of the reactive prototype. Performance testing and comparisons using an ORM in both asynchronous and reactive versions could potentially reveal other advantages and limitations of each paradigm.

Author Contributions

Conceptualization, A.Z. and C.T.; formal analysis, A.Z. and C.T.; investigation, A.Z. and C.T.; methodology, A.Z. and C.T.; software, A.Z.; supervision, C.T.; validation, A.Z. and C.T.; writing—original draft, A.Z.; writing—review and editing, C.T. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Data available in publicly accessible repository. The data presented in this study are openly available on GitHub at https://github.com/andreizb/tweebyte (accessed on 17 November 2024).

Conflicts of Interest

Author Cătălin Tudose was employed by the company Luxoft Romania. The remaining author declares that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

References

Erder, M.; Pureur, P.; Woods, E. Continuous Architecture in Practice: Software Architecture in the Age of Agility and DevOps; Addison-Wesley Professional: Boston, MA, USA, 2021. [Google Scholar]
Ciceri, C.; Farley, D.; Ford, N.; Harmel-Law, A.; Keeling, M.; Lilienthal, C. Software Architecture Metrics: Case Studies to Improve the Quality of Your Architecture; O’Reilly Media: Sebastopol, CA, USA, 2022. [Google Scholar]
Arnold, K.; Gosling, J.; Holmes, D. The Java Programming Language, 4th ed.; Addison-Wesley Professional: Glenview, IL, USA, 2005. [Google Scholar]
Sierra, K.; Bates, B.; Gee, T. Head First Java: A Brain-Friendly Guide, 3rd ed.; O’Reilly Media: Sebastopol, CA, USA, 2022. [Google Scholar]
Davis, A.L. Reactive Streams in Java: Concurrency with RxJava, Reactor, and Akka Streams; Apress: Berkeley, CA, USA, 2018. [Google Scholar]
Urma, R.G.; Fusco, M.; Mycroft, A. Modern Java in Action: Lambdas, Streams, Functional and Reactive Programming, 2nd ed.; Manning: Hong Kong, 2018. [Google Scholar]
Hitchens, R. Java NIO: Regular Expressions and High-Performance I/O; O’Reilly Media: Sebastopol, CA, USA, 2002. [Google Scholar]
Nurkiewicz, T.; Christensen, B. Reactive Programming with RxJava: Creating Asynchronous, Event-Based Applications; O’Reilly Media: Sebastopol, CA, USA, 2016. [Google Scholar]
Hedgpeth, R. R2DBC Revealed: Reactive Relational Database Connectivity for Java and JVM Programmers; Apress: Berkeley, CA, USA, 2021. [Google Scholar]
Goetz, B. Java Concurrency In Practice; Pearson: Bengaluru, India, 2016. [Google Scholar]
Srivastava, R.P.; Nandi, G.C. Controlling Multi Thread Execution Using Single Thread Event Loop. In Proceedings of the 2017 International Conference on Innovations in Control, Communication and Information Systems, Greater Noida, India, 12–13 August 2017; pp. 88–94. [Google Scholar]
Giebas, D.; Wojszczyk, R. Detection of Concurrency Errors in Multithreaded Applications Based on Static Source Code Analysis. IEEE Access 2021, 9, 61298–61323. [Google Scholar] [CrossRef]
Malhotra, R. Rapid Java Persistence and Microservices: Persistence Made Easy Using Java EE8, JPA and Spring; Apress: Berkeley, CA, USA, 2019. [Google Scholar]
Söderquist, I. Event Driven Data Processing Architecture. In Proceedings of the 2007 Design, Automation & Test In Europe Conference & Exhibition, Nice Acropolis, France, 16–20 April 2007; pp. 972–976. [Google Scholar]
Laliwala, Z.; Chaudhary, S. Event-Driven Service-Oriented Architecture. In Proceedings of the 2008 5th International Conference on Service Systems and Service Management, Melbourne, Australia, 30 June–2 July 2008; pp. 410–415. [Google Scholar]
Bellemare, A. Building Event-Driven Microservices: Leveraging Organizational Data at Scale; O’Reilly Media: Sebastopol, CA, USA, 2020. [Google Scholar]
Woodside, M. Performance Models of Event-Driven Architectures. In Proceedings of the Companion of the ACM/Spec International Conference on Performance Engineering, Rennes, France, 19–23 April 2021; pp. 145–149. [Google Scholar]
Gamma, E.; Helm, R.; Johnson, R.; Vlissides, J. Design Patterns: Elements of Reusable Object-Oriented Software; Addison-Wesley Professional: Boston, MA, USA, 1994. [Google Scholar]
Grinovero, S. Hibernate Reactive: Is It Worth It? 2021. Available online: https://in.relation.to/2021/10/27/hibernate-reactive-performance/ (accessed on 17 November 2024).
Ju, L.; Yadav, A.; Yadav, D.; Khan, A.; Sah, A.P. Using Asynchronous Frameworks and Database Connection Pools to Enhance Web Application Performance in High-Concurrency Environments. In Proceedings of the 2024 International Conference on IoT in Social, Mobile, Analytics, and Cloud (I-SMAC 2024), Kirtipur, Nepal, 3–5 October 2024. [Google Scholar]
Dahlin, K. An Evaluation of Spring WebFlux—With Focus on Built-in SQL Features. Master’s Thesis, Institution of Information Systems and Technology, Mid Sweden University, Östersund, Sweden, 2020. [Google Scholar]
Joo, Y.H.; Haneklint, C. Comparing Virtual Threads and Reactive WebFlux in Spring: A Comparative Performance Analysis of Concurrency Solutions in Spring. Bachelor’s Thesis, Degree Programme in Computer Engineering, KTH Royal Institute of Technology, Stockholm, Sweden, 2023. [Google Scholar]
Mochniej, K.; Badurowicz, M. Performance Comparison of Microservices Written Using Reactive and Imperative Approaches. J. Comput. Sci. Inst. 2023, 28, 242–247. [Google Scholar] [CrossRef]
Wang, Y. Scalable and Reactive Data Management for Mobile Internet-of-Things Applications with Actor-Oriented Databases. Ph.D. Thesis, University of Copenhagen, Copenhagen, Denmark, 2021. [Google Scholar]
Bansal, S.; Namjoshi, K.S.; Sa’ar, Y. Synthesis of Asynchronous Reactive Programs from Temporal Specifications. In Proceedings of the International Conference on Computer Aided Verification, Oxford, UK, 14–17 July 2018. [Google Scholar]
Bahr, P.; Houlborg, E.; Rørdam, G.T.S. Asynchronous Reactive Programming with Modal Types in Haskell. In International Symposium on Practical Aspects of Declarative Languages; Gebser, M., Sergey, I., Eds.; Springer Nature Switzerland: Cham, Switzerland, 2024; pp. 18–36. [Google Scholar]
Spilcă, L. Spring Start Here: Learn What You Need and Learn It Well; Manning: New York, NY, USA, 2021. [Google Scholar]
Walls, C. Spring in Action; Manning: New York, NY, USA, 2022. [Google Scholar]
Bogner, J.; Fritzsch, J.; Wagner, S.; Zimmermann, A. Microservices in Industry: Insights into Technologies, Characteristics, and Software Quality. In Proceedings of the 2019 IEEE International Conference on Software Architecture Companion, Hamburg, Germany, 25–26 March 2019. [Google Scholar]
Fielding, R.T. Architectural Styles and the Design of Network-Based Software Architectures. Ph.D. Thesis, University of California, Irvine, CA, USA, 2000. [Google Scholar]
Saternos, C. Client-Server Web Apps with JavaScript and Java: Rich, Scalable, and RESTful; O’Reilly Media: Sebastopol, CA, USA, 2014. [Google Scholar]
Afonso, J.; Caffy, C.; Patrascoiu, M.; Leduc, J.; Davis, M.; Murray, S.; Cortes, P. An HTTP REST API for Tape-backed Storage. EPJ Web Conf. 2024, 295, 01008. [Google Scholar] [CrossRef]
Nickoloff, J.; Kuenzli, S. Docker in Action, 2nd ed.; Manning: New York, NY, USA, 2019. [Google Scholar]
Newman, S. Monolith to Microservices: Evolutionary Patterns to Transform Your Monolith; O’Reilly Media: Sebastopol, CA, USA, 2019. [Google Scholar]
Vernon, V.; Tomasz, J. Strategic Monoliths and Microservices: Driving Innovation Using Purposeful Architecture; Addison-Wesley Publishing: Boston, MA, USA, 2022. [Google Scholar]
Bonér, J.; Farley, D.; Kuhn, R.; Thompson, M. The Reactive Manifesto. 2014. Available online: https://www.reactivemanifesto.org (accessed on 17 November 2024).
Pal, N.; Yadav, D.K. Modeling and verification of software evolution using bigraphical reactive system. Clust. Comput. 2024, 27, 12983–13003. [Google Scholar] [CrossRef]
Padmanaban, K.; Kalpana, Y.B.; Geetha, M.; Balan, K.; Mani, V.; Sivaraju, S.S. Simulation and modeling in cloud computing-based smart grid power big data analysis technology. Int. J. Model. Simul. Sci. Comput. 2024, 27, 2541005. [Google Scholar] [CrossRef]
Ullenboom, C. Spring Boot 3 and Spring Framework 6; Rheinwerk Computing: Quincy, MA, USA, 2023. [Google Scholar]
Rao, R.R.; Swamy, S.R. Review on Spring Boot and Spring Webflux for Reactive Web Development. Int. Res. J. Eng. Technol. 2020, 7, 3834–3837. [Google Scholar]
Schoop, S.; Hebisch, E.; Franz, T. Improving Comprehensibility of Event-Driven Microservice Architectures by Graph-Based Visualizations. Softw. Archit. ECSA 2024, 14889, 14. [Google Scholar]
Cabane, H.; Farias, K. On the impact of event-driven architecture on performance: An exploratory study. Future Gener. Comput. Syst. 2024, 153, 52–69. [Google Scholar] [CrossRef]
Ponge, J.; Navarro, A.; Escoffier, C.; Le Mouël, F. Analysing the Performance and Costs of Reactive Programming Libraries in Java. In Proceedings of the 8th ACM SIGPLAN International Workshop on Reactive and Event-Based Languages and Systems, Chicago, IL, USA, 18 October 2021. [Google Scholar]
Christensen, B.; Husain, J. Reactive Programming in the Netflix API with RxJava. 2013. Available online: https://netflixtechblog.com/reactive-programming-in-the-netflix-api-with-rxjava-7811c3a1496a (accessed on 17 November 2024).
Oracle. Java/JDBC Scalability and Asynchrony: Reactive Extension and Fibers. 2019. Available online: https://www.oracle.com/a/tech/docs/dev6323-reactivestreams-fiber.pdf (accessed on 17 November 2024).
Squbs: A New, Reactive Way for PayPal to Build Applications. 2016. Available online: https://medium.com/paypal-tech/squbs-a-new-reactive-way-for-paypal-to-build-applications-127126bf684b (accessed on 17 November 2024).
Harris, P.; Hale, B. Designing, Implementing, and Using Reactive APIs. 2018. Available online: https://www.infoq.com/articles/Designing-Implementing-Using-Reactive-APIs/ (accessed on 17 November 2024).
JSON Web Tokens. Available online: https://jwt.io (accessed on 17 November 2024).
Spilcă, L. Spring Security in Action, 2nd ed.; Manning: New York, NY, USA, 2024. [Google Scholar]
Richardson, C. Microservice Architecture Pattern. 2024. Available online: https://microservices.io/patterns/data/database-per-service.html (accessed on 17 November 2024).
Oracle Java Documentation. Available online: https://docs.oracle.com/en/java/ (accessed on 17 November 2024).
Zuul Documentation. Available online: https://zuul-ci.org/ (accessed on 17 November 2024).
Spring Cloud Gateway Documentation. Available online: https://spring.io/projects/spring-cloud-gateway (accessed on 17 November 2024).
Ferrari, L.; Pirozzi, E. Learn PostgreSQL: Use, manage and build secure and scalable databases with PostgreSQL 16, 2nd ed.; Packt Publishing: Birmingham, UK, 2023. [Google Scholar]
Tudose, C. Java Persistence with Spring Data and Hibernate; Manning: New York, NY, USA, 2023. [Google Scholar]
Bonteanu, A.M.; Tudose, C. Performance Analysis and Improvement for CRUD Operations in Relational Databases from Java Programs Using JPA, Hibernate, Spring Data JPA. Appl. Sci. 2024, 14, 2743. [Google Scholar] [CrossRef]
Redis Official Website. Available online: https://redis.io/ (accessed on 17 November 2024).
Eclipse Transformer Website. Available online: https://projects.eclipse.org/projects/technology.transformer (accessed on 17 November 2024).
Reflectoring Website. Available online: https://reflectoring.io/dependency-injection-and-inversion-of-control/ (accessed on 17 November 2024).
Montesi, F.; Weber, J. From the Decorator Pattern to Circuit Breakers in Microservices. In Proceedings of the 33rd Annual ACM Symposium on Applied Computing, Pau, France, 9–13 April 2018; pp. 1733–1735. [Google Scholar]
Resilience4j Website. Available online: https://resilience4j.readme.io/docs/getting-started (accessed on 17 November 2024).
Tudose, C. JUnit in Action; Manning: New York, NY, USA, 2020. [Google Scholar]
Cucumber Website. Available online: https://cucumber.io/ (accessed on 17 November 2024).
Van Merode, H. Continuous Integration (CI) and Continuous Delivery (CD): A Practical Guide to Designing and Developing Pipelines; Apress: Berkeley, CA, USA, 2023. [Google Scholar]
JMeter Website. Available online: https://jmeter.apache.org/ (accessed on 17 November 2024).
Bonér, J.; Klang, V. Reactive Programming versus Reactive Systems. 2016. Available online: https://gandrille.github.io/tech-notes/Reactive_and_microservices/Reactive/2016%20reactive-programming-vs-reactive-systems.pdf (accessed on 17 November 2024).
Salvaneschi, G.; Mezini, M. Debugging for Reactive Programming. In Proceedings of the 2016 IEEE/ACM 38th International Conference on Software Engineering (ICSE), Austin, TX, USA, 14–22 May 2016; pp. 796–807. [Google Scholar]
Holst, G.; Dobslaw, F. On the Importance and Shortcomings of Code Readability Metrics: A Case Study on Reactive Programming. 2021. Available online: https://arxiv.org/abs/2110.15246 (accessed on 17 November 2024).
Crudu, A.; MoldStud Research Team. The Impact of Reactive Programming on Software Development. 2024. Available online: https://moldstud.com/articles/p-the-impact-of-reactive-programming-on-software-development (accessed on 17 November 2024).
Köhler, M.; Salvaneschi, G. Automated Refactoring to Reactive Programming. In Proceedings of the 2019 34th IEEE/ACM International Conference on Automated Software Engineering (ASE), San Diego, CA, USA, 11–15 November 2019; pp. 835–846. [Google Scholar]
Grottke, M.; Matias, R., Jr.; Trivedi, K.S. The Fundamentals of Software Aging. In Proceedings of the 19th IEEE International Symposium on Software Reliability Engineering Workshops, Redmond, WA, USA, 11–14 November 2008. [Google Scholar]
Anghel, I.I.; Calin, R.S.; Nedelea, M.L.; Stanica, I.C.; Tudose, C.; Boiangiu, C.A. Software development methodologies: A comparative analysis. UPB Sci. Bull. 2022, 83, 45–58. [Google Scholar]
Selenium Website. Available online: https://www.selenium.dev/ (accessed on 17 November 2024).
Server-Sent Events. Available online: https://developer.mozilla.org/en-US/docs/Web/API/Server-sent_events (accessed on 17 November 2024).
WebSockets API. Available online: https://developer.mozilla.org/en-US/docs/Web/API/WebSockets_API (accessed on 17 November 2024).
RSocket Website. Available online: https://rsocket.io/ (accessed on 17 November 2024).
Eclipse Vert.x Website. Available online: https://vertx.io/ (accessed on 17 November 2024).

Figure 1. Architecture diagram.

Figure 2. Database diagram.

Figure 3. Package structure for backend services.

Figure 4. Migration Process from “javax” to “jakarta” Dependencies.

Figure 5. Performance results for fetching user profile summaries.

Figure 6. Performance results for users subscribing.

Figure 7. Performance results for updating existing messages.

Figure 8. Performance results for fetching user messages.

Figure 9. Testing environment diagram.

Table 1. Summary of related work.

Section	Summary
3.1. Evaluating Hibernate Reactive for Scalable Database Solutions	Discusses Hibernate Reactive’s scalability and resource efficiency in high-load scenarios, with noted limitations in handling complex transactions.
3.2. Assessing R2DBC in High-Concurrency Web Applications	Highlights R2DBC’s performance in non-blocking environments, emphasizing its efficiency for high-concurrency scenarios with a connection pool.
3.3. Benchmarking Virtual Threads and Reactive WebFlux for Concurrent Web Services	Compares Virtual Threads and WebFlux in Spring applications, showing the benefits of Virtual Threads under specific conditions.
3.4. Reactive and Imperative Approaches in Microservice Performance	Details performance advantages of reactive microservices for I/O-intensive tasks, noting limitations for CPU-heavy operations.
3.5. Actor-Oriented Databases for Scalable and Reactive IoT Data Management	Introduces Actor-Oriented Databases (AODBs) for IoT, emphasizing scalability and low-latency responses in dynamic environments.
3.6. Temporal and Type-Driven Approaches to Asynchronous Reactive Programming	Explores approaches to asynchronous synthesis, offering efficient solutions for reactive systems using compact automaton and type-safe programming.

Table 2. Throughput for individual operations [43].

Operation Type	RxJava (ops/ms)	Reactor (ops/ms)	Base (ops/ms)
Map	33,000	10,000	63,000
Chain	23,000	13,000	63,000
Multiple operators	12,000	5000	28,000

Table 3. Throughput for event streams [43].

Operation Type	RxJava (ops/ms)	Reactor (ops/ms)	Base (ops/ms)
Map	240	250	-
ManyToMany	130	140	-
Filters	130	150	-
Multiple operators	100	100	120

Table 4. Summary of the current state of technology.

Section	Summary
4.1. Microservices	Explores microservices’ benefits for scalability and maintainability, noting challenges in transitioning from monolithic systems.
4.2 The Reactive Manifesto	Summarizes the core principles of reactive systems: responsiveness, resilience, elasticity, and message-driven communication.
4.3 Spring Boot and Spring WebFlux	Highlights WebFlux’s advantages for non-blocking web applications and Spring Boot’s ease of use for rapid development.
4.4 Performance and Cost Analysis	Compares RxJava and Project Reactor, emphasizing their respective strengths in individual operations and event stream processing.
4.5 Adoption of Reactive Programming	Details adoption strategies employed by companies, showcasing reactive programming’s scalability and efficiency.

Table 5. Migration aspects and stages.

Aspect	Asynchronous	Reactive
Programming model	Imperative	Reactive
Java version	Java 21	Java 21
Framework	Spring Boot 3	Spring Boot 3
Single result	CompletableFuture<T>	Mono<T>
Collection/Data stream	CompletableFuture<Collection<T>>	Flux<T>
Thread pools	ExecutorService	Schedulers
API	Standard Java API	Project Reactor, Spring WebFlux
Error handling	try-catch, CompletableFuture, exceptionally()	Mono/Flux, onErrorReturn(), onErrorResume(), onErrorMap()
Operation retries	Programmatic	Native, Mono/Flux retry()
Blocking operations	CompletableFuture get()	Mono/Flux block()
Data access	JDBC, Hibernate, JPA	R2DBC
Web server	Tomcat	Netty
HTTP communication	Java HttpClient	Spring WebClient

Table 6. Static and initialization metrics.

Microservice (Paradigm)	Heap Memory Consumption (MB)	Startup Time (ms)	JAR Size (MB)
User Service (async)	61	2363	52
User Service (reactive)	38	1522	37
Tweet Service (async)	62	2921	61
Tweet Service (reactive)	45	1874	45
Interaction Service (async)	70	3526	61
Interaction Service (reactive)	51	2107	41

Table 7. Summary of application tests.

Type	Description	Quantity	Purpose
Unit Tests	Tests focusing on individual code components or methods.	~150 per service (1023 total)	Ensures correctness and reliability of smaller functional units of the application.
Behavior-Driven Tests	Feature-based Cucumber tests, written in Gherkin to validate end-to-end behavior.	50 scenarios	Validates user-facing functionality, integration, and consistency between asynchronous and reactive implementations.
Performance Tests	Simulations of concurrent users performing specific actions.	4 scenarios × 7 concurrency levels (10, 50, 100, 250, 500, 750, 1000 users)	Measures resource usage (CPU, memory) and throughput under varying load conditions to compare asynchronous and reactive approaches.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Zbarcea, A.; Tudose, C. Migrating from Developing Asynchronous Multi-Threading Programs to Reactive Programs in Java. Appl. Sci. 2024, 14, 12062. https://doi.org/10.3390/app142412062

AMA Style

Zbarcea A, Tudose C. Migrating from Developing Asynchronous Multi-Threading Programs to Reactive Programs in Java. Applied Sciences. 2024; 14(24):12062. https://doi.org/10.3390/app142412062

Chicago/Turabian Style

Zbarcea, Andrei, and Cătălin Tudose. 2024. "Migrating from Developing Asynchronous Multi-Threading Programs to Reactive Programs in Java" Applied Sciences 14, no. 24: 12062. https://doi.org/10.3390/app142412062

APA Style

Zbarcea, A., & Tudose, C. (2024). Migrating from Developing Asynchronous Multi-Threading Programs to Reactive Programs in Java. Applied Sciences, 14(24), 12062. https://doi.org/10.3390/app142412062

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Migrating from Developing Asynchronous Multi-Threading Programs to Reactive Programs in Java

Abstract

1. Introduction

1.1. Research Overview

1.2. Problem and Approach

1.3. Objectives

2. Theoretical Background

2.1. Concurrency

2.2. Asynchronous Programming

2.3. Non-Blocking I/O

2.4. Event-Driven Architecture

2.5. Reactive Programming

3. Related Work

3.1. Evaluating Hibernate Reactive for Scalable Database Solutions

3.2. Assessing R2DBC in High-Concurrency Web Applications

3.3. Benchmarking Virtual Threads and Reactive WebFlux for Concurrent Web Services

3.4. Reactive and Imperative Approaches in Microservice Performance

3.5. Actor-Oriented Databases for Scalable and Reactive IoT Data Management

3.6. Temporal and Type-Driven Approaches to Asynchronous Reactive Programming

4. Current State of Technology

4.1. Microservices

4.2. The Reactive Manifesto

4.3. Spring Boot and Spring WebFlux

4.4. Performance and Cost Analysis

4.5. Adoption of Reactive Programming

5. Proposed Solution

5.1. Architecture

5.2. Tools and Technologies

5.3. Project Structure

5.4. Migration Aspects and Stages

6. Implementation Details

6.1. Updating Java and Spring Versions

6.2. Implementation and Management of Components

6.3. Communication Using HTTP Clients

6.4. Handling System Errors

6.5. Task Scheduling Mechanisms

6.6. Database Interactions

7. Evaluation of Results

7.1. Functional Equivalence

7.2. Performance Analysis

7.2.1. Static and Initialization Metrics

7.2.2. Evaluation Under Concurrent Load Conditions

7.2.3. Discussion of Results

7.2.4. Trade-Offs and Long-Term Impacts

7.2.5. Limitations in Performance Testing

8. Conclusions and Further Development

8.1. Conclusions

8.2. Further Development

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI