There are a number of industries and use cases that feature streaming financial transactions. These include projects like securities trading, consumer banking, point of sales analysis, and fraud detection.
If we consider a system that streams financial data for fraud detection, it’s typical to see messages that are quite large, around 1MB in size. The messages might include transaction details, historical data, behavioral analytics, and more.
Financial data streams at a medium pace in fraud detection applications, usually around 10 messages per second (m/s). From time to time the streams might spike suddenly, based on global events, market conditions, or new emerging threats.
The partition calculator inputs below reflect these characteristics.
- The brokers are of similar capability.
- The load on the brokers’ machines is similar.
- The messages don't diverge too much in size.
- The messages are evenly distributed across all partitions.
- The number of brokers makes sense in this context.
- Brokers have similar latencies between producers and consumers.
- The throughput per producer is less than 10MB/s.
- Individual brokers have less than 40k partitions.
- The cluster has less than 200k partitions in total.