AI Plans

company

https://aiplans.org

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

dsouzaJithesh updated a model 3 days ago

AIPlans/Qwen3-0.6B-ORPO-Crosscoder-MixedDataset

dsouzaJithesh published a model 3 days ago

AIPlans/Qwen3-0.6B-ORPO-Crosscoder-MixedDataset

dsouzaJithesh updated a model 5 days ago

AIPlans/Qwen3-0.6B-GRPO-Crosscoder-MixedDataset

View all activity

AIPlans 's models 34

AIPlans/Qwen3-0.6B-ORPO-Crosscoder-MixedDataset

Updated 3 days ago

AIPlans/Qwen3-0.6B-GRPO-Crosscoder-MixedDataset

Updated 5 days ago

AIPlans/Qwen3-0.6B-KTO-Crosscoder-MixedDataset

Updated 5 days ago

AIPlans/Qwen3-0.6B-IPO-Crosscoder-MixedDataset

Updated 5 days ago

AIPlans/Crosscoder_GRPO

Updated 6 days ago

AIPlans/Qwen3-0.6B-ReMax

Reinforcement Learning • 0.6B • Updated Dec 22, 2025 • 2 • 2

AIPlans/Qwen3-0.6B-GRPO-RM_NVIDIA

Text Generation • 0.6B • Updated Dec 20, 2025 • 10

AIPlans/Qwen3-0.6B-GRPO_Epoch2

Text Generation • 0.6B • Updated Dec 18, 2025 • 1

AIPlans/Qwen3-0.6B-GRPO_Epoch1

Text Generation • 0.6B • Updated Dec 18, 2025 • 4

AIPlans/Qwen3-0.6B-GRPO

Updated Dec 15, 2025

AIPlans/Qwen3-0.6B-IPO

Reinforcement Learning • 0.6B • Updated Dec 12, 2025 • 37 • 1

AIPlans/qwen3-0.6b-base-PPO-hs2

Updated Dec 11, 2025

AIPlans/Qwen3-0.6B-DPO_Epoch_1

Text Generation • 0.6B • Updated Dec 8, 2025 • 2

AIPlans/Qwen3-0.6B-PPO

Updated Dec 5, 2025

AIPlans/Qwen3-0.6B-PPO1

Updated Dec 5, 2025

AIPlans/Qwen3-0.6B-SFT-hs2

Text Generation • 0.6B • Updated Dec 4, 2025 • 13

AIPlans/Qwen3-0.6B-RM-hs2

Text Classification • 0.6B • Updated Dec 1, 2025 • 1

AIPlans/Qwen3-0.6B-ORPO

Text Generation • Updated Nov 28, 2025 • 18

AIPlans/Qwen3-0.6B-DPO_NOTLORA

Text Generation • 0.6B • Updated Nov 25, 2025 • 13

AIPlans/Qwen3-0.6B-KTO

Text Generation • Updated Nov 22, 2025 • 13 • 1

AIPlans/Qwen3-0.6B-DPO

Text Generation • Updated Nov 22, 2025 • 6

AIPlans/qwen3-0.6b-hh-rlhf-sft

0.6B • Updated Nov 17, 2025

AIPlans/Qwen3-0.6B-KTO_trial

Text Generation • 0.6B • Updated Nov 10, 2025 • 1 • 1

AIPlans/qwen3-0.6b-sft-hh-rlhf-lora

Updated Oct 24, 2025

AIPlans/qwen3-0.6b-base-PPO-PM

Updated Sep 27, 2025 • 1

AIPlans/qwen3-0.6b-base-hl-RM

Text Classification • 0.6B • Updated Sep 27, 2025

AIPlans/dpo_qwen0_6b_fft

0.6B • Updated Sep 24, 2025

AIPlans/qwen3-0.6b-dpo-lora

Text Generation • 0.6B • Updated Sep 18, 2025 • 1 • 1

AIPlans/qwen3-0.6B-reward-hh-rlhf

Text Generation • 0.6B • Updated Sep 13, 2025

AIPlans/qwen3-8b-ipo-hh-rlhf

Text Generation • Updated Jul 17, 2025 • 1