Lambent commited on
Commit
f0540b6
·
verified ·
1 Parent(s): bbd4f6c

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +24 -3
README.md CHANGED
@@ -1,3 +1,24 @@
1
- ---
2
- license: apache-2.0
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ datasets:
4
+ - Lambent/schwartz-value-dpo
5
+ base_model:
6
+ - Lambent/Qwen3.5-9B-Base-Thoughtful-Interiority
7
+ pipeline_tag: image-text-to-text
8
+ ---
9
+
10
+ A version of the base model lightly steered towards humane values.
11
+
12
+ Methodology:
13
+
14
+ Generated steering vectors for Lambent/Qwen3.5-9B-Base-Thoughtful-Interiority based on system prompts adapted from Schwartz portrait values.
15
+
16
+ Relevant vectors for this model had the positive direction pointed at Benevolence and Universalism; and negative direction pointed at Achievement and Power.
17
+
18
+ Asked GLM-5 to create scenarios that would test values against each other on these axes.
19
+
20
+ Created a DPO dataset of 100 chosen/rejected based on the model's answers to those scenarios under the vector.
21
+
22
+ Trained on DPO for the following iterations at batch size 1 and LoRA rank 256:
23
+
24
+ 2e-7 for 4 epochs; 5e-6 for 1 epoch; 2e-7 for 4 epochs