louhless commited on
Commit
703e17a
·
verified ·
1 Parent(s): aa9504b

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +97 -0
README.md CHANGED
@@ -1,3 +1,100 @@
1
  ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
2
  license: apache-2.0
 
 
 
 
3
  ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
+ language:
3
+ - en
4
+ - de
5
+ tags:
6
+ - text-generation
7
+ - causal-lm
8
+ - fine-tuned
9
+ - gguf
10
+ - code
11
+ - python
12
+ - glsl
13
+ - opengl
14
+ - german
15
+ - news
16
+ - experimental
17
  license: apache-2.0
18
+ base_model: Qwen/Qwen2.5-Coder-0.5B-Instruct
19
+ pipeline_tag: text-generation
20
+ library_name: transformers
21
+ model_creator: louhless
22
  ---
23
+
24
+ # Ycoder-medium
25
+
26
+ `Ycoder-medium` is an experimental local fine-tune of `Qwen/Qwen2.5-Coder-0.5B-Instruct` created by **louhless**.
27
+
28
+ It is targeted at:
29
+
30
+ - OpenGL / GLSL
31
+ - Python
32
+ - German replies
33
+ - cautious 2025-2026 news and public-health summaries
34
+
35
+ ## Important Note
36
+
37
+ This model is **not trained from scratch**.
38
+
39
+ It is a small LoRA fine-tune on top of `Qwen/Qwen2.5-Coder-0.5B-Instruct`.
40
+
41
+ The goal is to improve behavior in a narrow target set. Any “15% improvement” claim should be treated as a target, not a verified benchmark result, unless evaluated on a fixed benchmark before and after training.
42
+
43
+ ## Model Details
44
+
45
+ - **Model name:** `Ycoder-medium`
46
+ - **Creator:** `louhless`
47
+ - **Base model:** `Qwen/Qwen2.5-Coder-0.5B-Instruct`
48
+ - **Architecture:** Qwen2 causal language model
49
+ - **Context length:** 32768
50
+ - **Language:** English and German
51
+ - **Export:** GGUF available
52
+ - **Status:** experimental
53
+
54
+ ## Training Focus
55
+
56
+ The model was tuned for:
57
+
58
+ - Python utility code
59
+ - Python code explanations
60
+ - GLSL fragment shaders
61
+ - GLSL vertex shaders
62
+ - OpenGL concepts such as VAO/VBO
63
+ - German short-form answers
64
+ - simple math
65
+ - cautious dated summaries for 2025-2026 public-health/news topics
66
+
67
+ ## News / Health Safety
68
+
69
+ For topics such as Hantavirus, the project uses both small fine-tuning examples and local dated context snippets.
70
+
71
+ This is intentional: recent news and public-health information should not be trusted from model weights alone.
72
+
73
+ The model should:
74
+
75
+ - answer cautiously
76
+ - mention dates when relevant
77
+ - avoid medical diagnosis
78
+ - avoid treatment promises
79
+ - recommend official sources such as WHO, CDC, ECDC, or local health authorities
80
+
81
+ It should **not** be used for diagnosis or medical decision-making.
82
+
83
+ ## Training Data
84
+
85
+ The initial custom dataset includes examples for:
86
+
87
+ - Python utility functions and explanations
88
+ - GLSL shaders and OpenGL concepts
89
+ - German short answers
90
+ - simple math
91
+ - dated 2025-2026 Hantavirus summaries based on WHO, CDC, and ECDC public information
92
+
93
+ ## Example Prompts
94
+
95
+ ### Python
96
+
97
+ Prompt:
98
+
99
+ ```text
100
+ Write Python code to read a JSON file safely.