view article Article Training and Finetuning Sparse Embedding Models with Sentence Transformers v5 Jul 1 • 130
An Analysis of Hyper-Parameter Optimization Methods for Retrieval Augmented Generation Paper • 2505.03452 • Published May 6 • 2
LongNet: Scaling Transformers to 1,000,000,000 Tokens Paper • 2307.02486 • Published Jul 5, 2023 • 81