AQ-MedAI/Diver-Retriever-4B
Text Ranking
β’
4B
β’
Updated
β’
4.72k
β’
21
None defined yet.
Multi-Agent Deep Research: Training Multi-Agent Systems with M-GRPO
GroupRank: A Groupwise Reranking Paradigm Driven by Reinforcement Learning