DenseOn & LateOn Collection A collection of open state-of-the-art single and multi-vector models β’ 7 items β’ Updated 2 days ago β’ 8
view article Article DenseOn with the LateOn: Open State-of-the-Art Single and Multi-Vector Models 2 days ago β’ 30
view article Article Training and Finetuning Multimodal Embedding & Reranker Models with Sentence Transformers 8 days ago β’ 63
Gemma 4 Collection Gemma 4 is Google's new model family including including E2B, E4B, 26B-A4B, and 31B. β’ 28 items β’ Updated 1 day ago β’ 155
ByT5: Towards a token-free future with pre-trained byte-to-byte models Paper β’ 2105.13626 β’ Published May 28, 2021 β’ 5
pplx-embed Collection Diffusion-Pretrained Dense and Contextual Embeddings β’ 7 items β’ Updated Feb 26 β’ 96
LateOn-Code π» Collection State-of-the-art late interaction code retrieval models β’ 6 items β’ Updated 16 days ago β’ 18
view article Article LateOn-Code & ColGrep: LightOn unveils state-of-the-art code retrieval models and code search tooling Feb 12 β’ 54
view article Article LightOnOCR-2-1B: a lightweight high-performance end-to-end OCR model family Jan 19 β’ 92
PyLate π Collection State-of-the-art late interaction models trained using PyLate β’ 5 items β’ Updated 16 days ago β’ 4