fp.
newest
Open in hackernews
LLM rerankers for production RAG: tips and tricks
https://fin.ai/research/using-llms-as-a-reranker-for-rag-a-practical-guide/
5
•
mathcircler
•
4mo ago
Comments
alexpivnenko
•
4mo ago
Surprised that removing spaces actually had such a big effect on latency.
Also props for including the prompt and AB results
alexpivnenko•4mo ago
Also props for including the prompt and AB results