Improving RAG Retrieval by 60% with Fine-Tuned Embeddings
Actually worked better than I thought lol Resources: Code: https://github.com/ALucek/ft-modernbert-domain Model: https://huggingface.co/AdamLucek/ModernBERT-embed-base-legal-MRL Dataset: https://huggingface.co/datasets/AdamLucek/legal-rag-positives-synthetic Philipp Schmid’s Blog: https://www.philschmid.de/fine-tune-embedding-model-for-rag#3-define-loss-function-with-matryoshka-representation Matryoshka Representation …