Posted 7 months, 2 weeks ago
Ntropy is building domain-specific caching infrastructure for LLMs that makes it possible to reduce p99 latencies by 1000x and costs by 4-5 orders of magnitude for real-world, high-throughput tasks. Currently we are targeting the financial and banki…
San Francisco, CA, San Francisco, CA, San Francisco, CA, London, UK