Cohere releases Rerank-3 model focused on semantic reranking
The report very substantial increase in NDCG metrics vs BM25 (yes, it's very old baseline, it's not used in companies who work on their search engines since there are very substantial improvements of the search when you modify ranking, but still it's a default rank in several open source search engine )
The new reranking is multilingual. I see this development by Cohere as very important. Last few months, I talked to multiple companies about their search engines, and I see a persistent pattern that it's harder to work on search in German, French, Italian etc vs English. There are libraries, there is a lot of work done within companies, but still it's very hard problem to develop search engine for companies whose customers need search in those languages
Latency still needs to be improved. A typical hard requirement for any search engine is to have not more than 1 sec end-to-end search experience (query is typed -> the page is rendered ) (of course, it can be different and even below , but I consider the whole landscape, including travel, ecommerce, fashion and other companies, which customer do not compare their search experience to Google directly and do not expect 200ms). 1 sec end-to-end, leaves approximately 200-400ms (depending on system) for search backend including retrieval, ranking, or typically less than 200 ms for reranking (I take upper limits across various search engines in various sectors, in many cases, limits are significantly smaller ). Typical *reranking* is hundreds-thousand of documents. Cohere model is around 3 sec for 50 seconds (for 4096 context window, ), Still a lot to improve to make it useful for many companies
Enterprise Architect | Advance Data Analytics | Cloud & Digital Transformation| Togaf9 | Kaggle Bronze |
4moVery informative