I'm writing a service that keeps in memory a data structure that can be queried somewhat like a hash table and it needs an API that works over the network. It can only run on a single machine due to needing a global view of all the data. The service that needs to query it currently handles about 20k req/s, but it can sometimes get up to 50k and we've seen 100k req/s a few times per year.
My question: Is there an HTTP server library that can handle such throughput or do I need to go with raw TCP? The data structure itself can handle about 1M lookups / sec on a standard server, so I'm basically performance bound by the server library. I want to avoid having to write a bunch of servers that have no chance of performing fast enough.
Some extra info:
- The number of clients is a small bounded number.
- I can rearchitect the clients to open persistent connections if needed.
Most of our code is Java, so that is the preferred language and the data structure has an implementation in Java. However, I'm also fine with rewriting in C++, Go or Rust if there are higher performing HTTP libraries that can do this.