{"id":2056254,"date":"2021-12-14T03:03:00","date_gmt":"2021-12-14T01:03:00","guid":{"rendered":"http:\/\/nhub.news\/?p=2056254"},"modified":"2021-12-14T06:04:20","modified_gmt":"2021-12-14T04:04:20","slug":"designing-high-volume-systems-using-event-driven-architectures","status":"publish","type":"post","link":"http:\/\/nhub.news\/fr\/2021\/12\/designing-high-volume-systems-using-event-driven-architectures\/","title":{"rendered":"Designing High-Volume Systems Using Event-Driven Architectures"},"content":{"rendered":"

Learn how to design and build a system meant to handle heavy loads while embracing cloud-native architectures.<\/b>
\nJoin the DZone community and get the full member experience. Microservices style application architecture is taking root and rapidly growing in population that are possibly scattered in different parts of the enterprise ecosystem. Organizing and efficiently operating them on a multi-cloud environment organizing data around microservices, making the data as real-time as possible are emerging to be some of the key challenges. Thanks to the latest development in Event-Driven Architecture (EDA) platforms such as Kafka and data management techniques such as Data Meshes and Data Fabrics, designing microservices-based applications is now much easier. However, to ensure these microservices-based applications perform at requisite levels, it is important to ensure critical Non-Functional Requirements (NFRs) are taken into consideration during the design time itself. In a series of blog articles, my colleagues Tanmay Ambre, Harish Bharti along with myself are attempting to describe a cohesive approach on Design for NFR. We take a use-case-based approach. In the first installment, we describe designing for \u201cperformance\u201d as the first critical NFR. This article focuses on architectural and design decisions that are the basis of high-volume, low-latency processing. To make these decisions clear and easy to understand, we describe their application to a high-level use case of funds transfer. We have simplified the use case to focus mainly on performance. Electronic Fund Transfer (ETF) is a very important way of sending and receiving money these days by consuming through digital channels. We consider this use case as a good candidate for explaining performance-related decisions as it involves a high volume of requests, coordinating with many distributed systems, and has no margin for error (i.e. the system needs to be reliable and fault-tolerant). In the past, fund transfers would take days. It would involve a visit to a branch or writing a cheque. However, with the emergence of digital, new payment mechanisms, payment gateways. and regulations, fund transfer has become instantaneous. For example, in September 2021,3.6 billion transactions worth 6.5 trillion INR were executed on the UPI network in real-time. Customers are expecting real-time payments across a wide variety of channels. Regulations such as PSD2, Open Banking, and country-specific regulations have made it mandatory to expose their payment mechanisms to trusted third-party application developers. Typically, a customer of a bank would place the request for fund transfer using one of the channels available (e.g., mobile app, online portal, or by visiting the institution). Once the request is received, the following needs to be performed: The below figure gives an overview of this: Note: For the sake of this article, this use case comprises operations done within the financial institution where the request originates. It relies upon already established payment gateways that execute the actual fund transfer. Operations of the payment gateway are outside the scope of this use case. Here are the most critical NFRs that we must address: The implementation model of this use case will be through a cloud-native style \u2013 Microservices, API, Containers, Event Streams, and Distributed Data Management with eventual consistency style data persistence for integrity. Please note that this architecture is based on the architectural best practices that are outlined in Architectural Considerations for Event-Driven Microservices-Based Systems. Following is the set of key architectural patterns considered to implement this user story: The following diagram provides an overview of the solution architecture: The application architecture is organized through a set of independently operable microservices. In addition, an orchestrator service (another microservice) coordinates the full transaction ensuring the end-to-end process execution is in place. The different services of fund transfer are wired together as a set of producers, processors, and consumers of events. They are 4 main processors: The API publishes an event to the input topic of Fund Transfer Orchestrator, which is the primary coordinator for Fund Transfer requests. Events are first-class citizens and are persistent. The events accumulate in an event store (enabling the event sourcing architectural pattern). Based on the event context and the payload, the orchestrator will transform the event and publish the state of fund transfer to another topic. The fund transfer state transitions are also recorded in a state store which can be used to regenerate the state in case of system-level failures. This state is consumed by the Fund Transfer Request Router which will then make routing decisions and route it to other systems (either single or multiple systems simultaneously). Other systems will do their processing and publish the outcome as an event to the input topic. Which are then correlated and processed by the Fund Transfer Orchestrator resulting in a state change of the Fund Transfer request. Functional exceptions are processed by the Fund Transfer Orchestrator and the Fund Transfer Request state is updated accordingly. Fund Transfer state changes are also consumed by the real-time Fund Transfer Statistics Service which aggregates the statistics of fund transfer across multiple different dimensions \u2013 so that the operations team can have a nearly real-time view of the Fund Transfer statistics. To implement the above application architecture, we decided on key technical building blocks: Following is an indicative technology stack that can be used to build this system. Most of them are open-source. It is possible to use other technologies. For e.g., Quarkus. Capability Implementation Choices DevOps Capability DevOps tool choices Programming Language Java Spring-boot, Quarkus, Golang CI \/CD Jenkins \/ Tekton, Maven\/ Gradle, Nexus \/ Quay Event Backbone Event Backbone \u2013 Apache Kafka, rabbit mq Deployment Automation Ansible, Chef Event \/ Message Format Avro, JSON Monitoring & Visualization Prometheus, Grafana, Micrometer, spring-boot actuator In-Memory Cache Apache Ignite, Redis, Hazelcast Service Mesh (Auto heal, autoscale, Canary++) Istio NoSQL Database Mongo, Cassandra, CouchDB Log Streams and Analytics EFK (Elastic Search, Filebeat, Kibana) Relational DB Postgres, Maria, MySQL Continuous Code Quality management Sonarqube, Cast New SQL (Distributed RDBMS) Yugabyte, CockroachDB Config & Source Code Management Git & Spring Cloud Config Following are the top three critical Design Decisions addressing the highly dynamic and complex NFRs. The above three design decisions will help address NFRs in the following categories: In the following section, we deep dive into some key design considerations around the top NFR \u2013 Performance. We will publish articles on design considerations around the other NFRs as follow on to this article. To address performance requirements, the following have been considered during design and implementation: Performance modeling \u2013 It is important to have a clear understanding of performance targets. This impacts many architectural, technology, infrastructure, and design decisions. With increased adoption of hybrid and multi-cloud solutions, performance modeling has become even more important. There are many architectural trade-offs that rely on performance modeling. Performance modeling should cover \u2013 transaction\/event inventory, workload modeling (concurrency, peak volumes, expected response time for different transactions\/events), and infrastructure modeling. Building a performance model helps create the deployment model (especially those related to scalability), making architectural and design optimizations to reduce latency, and helps in designing performance tests to validate performance and throughput. Avoid Monolithic monsters \u2013 Monolithic architecture centralizes processing. This means it won\u2019t be possible to scale different components independently. In fact, even the service implementation should be broken down into loosely coupled components using SEDA to provide the ability to scale each component separately and to make the service more resilient. Each deployed component should be independently scalable and deployed as a cluster to increase concurrency and resiliency. Choice of event backbone \u2013 Choice of event backbone impacts performance. Primarily these 5 characteristics of the backbone are important to consider from a performance perspective: Apache Kafka is a good choice because of its proven performance, fault-tolerance, scalability, and availability track record in numerous engagements. It takes care of the first three points above. For the last 2 points (rebalance time), Kafka\u2019s performance is dependent on the amount of data on the topics. Apache Pulsar tries to solve this. However, we are using Kafka given its proven track record, but we are closely monitoring the evolution of Apache Pulsar. Leverage Caching \u2013 Database queries are expensive. To avoid them, it is recommended to leverage caching. For e.g., Redis, Apache Ignite, etc. By using caching, a fast data layer can be created that will help in data lookup. All read-only calls are redirected to the cache instead of fetching the data from the database or some other remote service. The stream data processing pipelines populate (or update) the cache in near real-time. Event processors then reference the data in the cache, instead of querying databases or making service calls to systems of record. Event processors write to Ignite and then persistence processors asynchronously write the data to the database. This gives a major boost to performance. In our case, we chose Apache Ignite because: Caching can be made persistent. I.e., the cached data is written to disk. But it impacts performance. The performance penalty is very significant when not using SSD. If caches are non-persistent then the architecture needs to cater for rehydrating the cache from data\/event stores. Recovery performance is critical to reducing mean time to recovery. In our architecture \u2013 Kafka topics are leveraged to rehydrate the cache. They are our event\/state stores. There is a multi-instance recovery component that reads data from Kafka topics and rehydrates the cache. Collocated processing \u2013 Performance is at its best when the event producers, consumers, and the event backbone are collocated. However, this would mean if the datacenter goes down \u2013 it would bring down the entire platform. This can be avoided if replication\/mirroring is set up for the event backbone. For e.g., Kafka MirrorMaker 2 (MM2) can be used to set up replication across datacenters\/availability zones. Choosing the correct message format \u2013 The speed of serialization and de-serialization of messages has some impact on performance. There are multiple choices for message format. For e.g., XML, JSON, Avro, Protobuf, Thrift. For this application, we chose Avro due to its compactness, (de) serialization performance, and schema evolution support. Concurrency related decisions \u2013 In this architecture the key deciding parameters in terms of concurrency are: These have a direct impact on the throughput. In Kafka, each partition can be consumed by a single thread of the same consumer group. Having multiple partitions and multiple drives on Kafka brokers helps in spreading the events without having to worry about sequencing. Having a multi-threaded consumer can help to a certain extent till the resource utilization limits are reached (CPU, Memory, and Network). Having multiple instances in the same consumer group splits the load across multiple nodes\/servers providing horizontal scalability. When using partitions \u2013 the higher the number of partitions, the higher the concurrency. However, there are certain considerations when determining the number of partitions. It is important to choose the partition key is such a way that it evenly spreads events across partitions without breaking the ordering requirements. Also, over-partitioning has some implications in terms of open file handles, higher unavailability of Kafka topics in case of broker failures, higher end-to-end latency, and higher memory requirements for the consumer. Performing performance tests to benchmark performance and then extrapolating them to the desired throughput and performance helps in finalizing these 3 parameters. I\/O matters \u2013 I\/O contributes significantly to latency. It impacts all components of the architecture. It impacts the event backbone, caching, databases, and application components. For distributed caching components that have persistence enabled, I\/O is one of the key factors that impact performance. For e.g., for Apache Ignite \u2013 it is strongly recommended to use SSDs for good performance along with enabling direct I\/O. Non-persistent caches are the fastest \u2013 however, the downside is if the cache cluster goes down, cached data is lost. This can be solved by having a recovery process \u2013 which rehydrates the cache. However, it should be noted that the longer the recovery time \u2013 the larger the lag being built in the streaming pipeline. Therefore, the recovery process must be extremely fast. This can be done by having multiple instances of the recovery process running and avoiding any transformation\/business logic in the recovery processors. Kafka doesn\u2019t necessarily require high-performance disks (such as SSD). For Kafka, it is recommended to have multiple drives (and multiple log dirs) to get good throughput. And sharing of drives with application and operating system is not recommended. Additionally, mount options such as \u201c noatime \u201d provide performance gain. Other mount options that are dependent on the type of filesystem are discussed here: https:\/\/kafka.apache.org\/documentation.html#diskandfs From an Application point of view \u2013 keep logging to a minimum. Instead of logging everything, log outliers. Even though most of the logging frameworks are capable of asynchronous logging, there is still an impact on latency. Memory tuning \u2013 Our event streaming architecture relies heavily on in-memory processing. This is especially true for Kafka brokers and Apache Ignite server nodes. It is important to allocate adequate memory to processors, consumers, Kafka brokers and in-memory data-grid nodes. Tuning the operating system\u2019s memory settings can also help boost performance to an extent. One of the key parameters is \u201c vm.swappiness\u201d. This controls the swapping out of process memory. It takes a value between 0 and 100. Higher the number, swapping is done more aggressively. It is recommended to keep this number low to reduce swapping. In the case of Kafka, since it relies heavily on the page cache, therefore the \u201c vm dirty ratio\u201d options can be also tweaked to control the flushing of data to disks. Network Usage Optimization \u2013 to utilize the network efficiently it is recommended to apply compression to very large messages. This will reduce network utilization. However, that comes at a cost of higher CPU utilization for the producer, consumer, and broker. For e.g., Kafka supports 4 different compression types (gzip, lz4, snappy, zstd). Snappy fits in the middle, giving a good balance of CPU usage, compression ratio, speed, and network utilization. Reasons for choosing snappy are provided in the article Message Compression in Kafka. Only Deserialize relevant messages \u2013 In a publish-subscribe mechanism, a consumer can get messages which they are not interested in. Adding a filter will eliminate the need for processing non-relevant messages. However, it will still require de-serializing the event payloads. This can be avoided by adding event metadata in the header of the events. This gives a choice to the consumers to look at the event headers and decide whether to parse the payload or not. This significantly improves the throughput of the consumers and reduces resource utilization. Parse what is required (esp. XMLs) \u2013 In the financial services industry \u2013 XMLs are heavily used. It is quite possible that the input to the event streaming applications is XML. Parsing XML is CPU and memory-intensive. The choice of XML parser is a key decision related to performance and resource utilization. If the XML document is very large, it is recommended to use SAX parsers. If the event streaming application does not require fully parsing the XML document, then having pre-configured xpaths to lookup the required data and constructing the event payload from it may be a faster option. However, in case the entire XML data is required \u2013 it would be wise to parse the entire document once and convert it into the event streaming application\u2019s message format once, and having multiple instances of this processor instead of having each processor in the event streaming pipeline parsing the XML document. Detailed instrumentation for monitoring and identifying performance bottlenecks \u2013 It should be easy to identify performance bottlenecks. This can be achieved by having using a lightweight instrumentation framework (based on AOP). In our example, we combine AOP with Spring-Boot Actuator and Micrometer to expose a Prometheus endpoint. For Apache Kafka, we use the JMX exporter for Prometheus to gather Kafka performance metrics. We then use Grafana to build a rich dashboard displaying performance metrics for the producers, consumers, Kafka, and Ignite (basically all components of our architecture). This gives the ability to pinpoint bottlenecks very quickly rather than relying on log analysis and correlation. Please refer to our open-source framework for more. Fine-tuning Kafka configuration \u2013 Kafka has a huge set of configuration parameters for brokers, producers, consumers, and Kafka streams. These are the parameters we have used to tune Kafka for Latency and Throughput. Please refer to the references provided for additional information on tuning Kafka brokers, producers, and consumers. Tuning GC \u2013 Garbage collection tuning is important to avoid long pauses and excessive GC overhead. As of JDK 8, Garbage-First Garbage Collector (G1GC) is recommended for most of the applications (multi-processor machines with large memory). It attempts to meet collection pause time goals as well as tries to achieve high throughput. Additionally, not much configuration is required to get started. We ensure that sufficient memory is made available to the JVM. In the case of Ignite, we use the off-heap memory to store data. The -Xms and -Xmx are kept at the same value with the \u201c-XX:+AlwaysPreTouch\u201d to ensure all memory allocation from the OS happens during startup. There are additional parameters in G1GC such as the following can be used to tune throughput further (since in EDA throughput matters): For more details, please refer to the tuning G1GC. Tuning Apache Ignite \u2013 Apache Ignite tuning is well documented on its official site. Here are the few things that we have followed for improving its performance: In this article, we focused on key architectural decisions related to performance. It is extremely critical to maintain desired performance levels of each individual component of the architecture as issues in any one of them can cause the stream processing to choke. Therefore every component of the architecture needs to be tuned for performance without compromising on other NFRs. It is essential to have a deep technical understanding of the component so that it can be tuned effectively. Opinions expressed by DZone contributors are their own.<\/p>\n