The core problem wasn’t a complex algorithm, but a simple, brutal mismatch of throughput. We had a sentence-transformer
2023-10-27