ARO-Lang
/

aro-coder-4bit

@@ -26,8 +26,8 @@ ARO is a domain-specific language where every statement follows the pattern:
 | **Base model** | [mlx-community/Qwen3-Coder-30B-A3B-Instruct-4bit](https://huggingface.co/mlx-community/Qwen3-Coder-30B-A3B-Instruct-4bit) |
 | **Quantization** | 4-bit (MLX) |
 | **Language** | ARO |
-| **Training samples** | 2862 |
-| **Syntax pass rate** | 73% |
 | **Source label** | distill_student |
 ## Links
@@ -108,7 +108,7 @@ Key features:
 This model was trained with the ARO training pipeline:
-1. **Corpus collection** — 2862 samples from Examples, Book, Wiki, Proposals, and real-world ARO applications
 2. **Supervised fine-tuning** — LoRA on all code generation, debugging, Q&A, and explanation tasks
 3. **DPO preference training** — using `aro check` validation to build chosen/rejected pairs
 4. **Iterative self-improvement** — multiple rounds of generate-validate-retrain

 | **Base model** | [mlx-community/Qwen3-Coder-30B-A3B-Instruct-4bit](https://huggingface.co/mlx-community/Qwen3-Coder-30B-A3B-Instruct-4bit) |
 | **Quantization** | 4-bit (MLX) |
 | **Language** | ARO |
+| **Training samples** | 3630 |
+| **Syntax pass rate** | 58% |
 | **Source label** | distill_student |
 ## Links
 This model was trained with the ARO training pipeline:
+1. **Corpus collection** — 3630 samples from Examples, Book, Wiki, Proposals, and real-world ARO applications
 2. **Supervised fine-tuning** — LoRA on all code generation, debugging, Q&A, and explanation tasks
 3. **DPO preference training** — using `aro check` validation to build chosen/rejected pairs
 4. **Iterative self-improvement** — multiple rounds of generate-validate-retrain