Safetensors
qwen2
lbourdois commited on
Commit
13060ee
·
verified ·
1 Parent(s): 4425950

Improve language tag

Browse files

Hi! As the model is multilingual, this is a PR to add other languages than English to the language tag to improve the referencing. Note that 29 languages are announced in the README, but only 13 are explicitly listed. I was therefore only able to add these 13 languages.

Files changed (1) hide show
  1. README.md +59 -45
README.md CHANGED
@@ -1,46 +1,60 @@
1
- ---
2
- license: apache-2.0
3
- datasets:
4
- - Quest-AI/quest-corruption-14brestorations-2.6k-filter-v1
5
- base_model:
6
- - Qwen/Qwen2.5-14B
7
- ---
8
- ## Task Description
9
- The task involves training a model to evaluate two pieces of text. One of the texts has been subtly augmented by a LLM (specifically, the larger 14B variant of my [corruption models](https://huggingface.co/Quest-AI/quest-corruption-7b-s375-v3-GRPO)). The model must provide notes and a subsequent judgment afterwards in consecutive XML tags.
10
-
11
- ![GRPO Task Overview](https://files.catbox.moe/r5dm86.png)
12
-
13
- ## Example Format
14
-
15
- The base model is provided a system prompt that establishes the expected template, as well as two randomly ordered A/B samples containing "real" vs "synthetic" samples for the input:
16
-
17
- ```
18
- REQUEST: You are to judge the better of the two samples and determine which of the following samples is better using a short judgement that is no longer than (and no shorter than) exactly 128 tokens.
19
-
20
- Respond with an exactly 128 tokens tag labeled <notes> that contains your notes, and then <judgement> which is just the letter that you are picking.
21
-
22
- For example:
23
-
24
- JUDGE: <notes>
25
- Sample A is superior to Sample B... (example notes)
26
- </notes>
27
- <judgement>A</judgement>
28
-
29
- Now, it is your turn.
30
-
31
- [Sample A]:
32
- Included is a pre-test, post-test, and vocabulary quiz on the 8th grade math standard functions (8.F). 1.) Determine if a graph represents a function 2.) State the domain and range of a relation 3.) Plot points on a graph to determine if the table represents a function 4.) State if a function is decreasing, increasing, or constant 5.) Determine the output of a function machine 6.) Determine the recursive and explicit equation 7.) Determine the minimum, maximum, increasing interval, and decreasing interval of a graph 8.) Determine the rate of change, initial value, independent value, and dependent variable given a graph 9.) Sketch a graph given a situation The vocabulary included is dependent, output, function, domain, range, decreasing function, input, range, non-linear function, relation, increasing function, and function notation. Total Pages: 9 (18 including answer key) Answer Key: Included Document File: PDF
33
-
34
- [Sample B]:
35
- Included is a pre-test, post-test, and vocabulary quiz on the 8th grade math standard functions (8.F). 1.) Determine if a graph represents a function 2.) State the domain and range of a relation 3.) Plot points on a graph to determine if the table represents a function 4.) State if a function is increasing, decreasing, or constant 5.) Determine the output of a function given 6.) Determine the input of a function given 7.) Determine a function rule given ordered pairs or a table of values. 8.) Graph functions using a table of values and determine a trend line in a graph 9.) Write a data table situation The vocabulary included is dependent, output, function, domain, range, decreasing function, input, range, non-linear function, relation, increasing function, and function notation. Total Pages: 9 (18 including answer key) Answer Key: Included Document File: PDF
36
-
37
- JUDGE:
38
- ```
39
-
40
- A correct output follows this structure:
41
- ```
42
- <notes>
43
- Sample A provides more specific and thoroughly defined tasks. It mentions "function machine," "recursive and explicit equation," and detailed graph analysis with "minimum, maximum" and intervals. Sample B contains incomplete phrases like "output of a function given" without completing the thought, making it less coherent and precise than Sample A.
44
- </notes>
45
- <judgement>A</judgement>
 
 
 
 
 
 
 
 
 
 
 
 
 
 
46
  ```
 
1
+ ---
2
+ license: apache-2.0
3
+ datasets:
4
+ - Quest-AI/quest-corruption-14brestorations-2.6k-filter-v1
5
+ base_model:
6
+ - Qwen/Qwen2.5-14B
7
+ language:
8
+ - zho
9
+ - eng
10
+ - fra
11
+ - spa
12
+ - por
13
+ - deu
14
+ - ita
15
+ - rus
16
+ - jpn
17
+ - kor
18
+ - vie
19
+ - tha
20
+ - ara
21
+ ---
22
+ ## Task Description
23
+ The task involves training a model to evaluate two pieces of text. One of the texts has been subtly augmented by a LLM (specifically, the larger 14B variant of my [corruption models](https://huggingface.co/Quest-AI/quest-corruption-7b-s375-v3-GRPO)). The model must provide notes and a subsequent judgment afterwards in consecutive XML tags.
24
+
25
+ ![GRPO Task Overview](https://files.catbox.moe/r5dm86.png)
26
+
27
+ ## Example Format
28
+
29
+ The base model is provided a system prompt that establishes the expected template, as well as two randomly ordered A/B samples containing "real" vs "synthetic" samples for the input:
30
+
31
+ ```
32
+ REQUEST: You are to judge the better of the two samples and determine which of the following samples is better using a short judgement that is no longer than (and no shorter than) exactly 128 tokens.
33
+
34
+ Respond with an exactly 128 tokens tag labeled <notes> that contains your notes, and then <judgement> which is just the letter that you are picking.
35
+
36
+ For example:
37
+
38
+ JUDGE: <notes>
39
+ Sample A is superior to Sample B... (example notes)
40
+ </notes>
41
+ <judgement>A</judgement>
42
+
43
+ Now, it is your turn.
44
+
45
+ [Sample A]:
46
+ Included is a pre-test, post-test, and vocabulary quiz on the 8th grade math standard functions (8.F). 1.) Determine if a graph represents a function 2.) State the domain and range of a relation 3.) Plot points on a graph to determine if the table represents a function 4.) State if a function is decreasing, increasing, or constant 5.) Determine the output of a function machine 6.) Determine the recursive and explicit equation 7.) Determine the minimum, maximum, increasing interval, and decreasing interval of a graph 8.) Determine the rate of change, initial value, independent value, and dependent variable given a graph 9.) Sketch a graph given a situation The vocabulary included is dependent, output, function, domain, range, decreasing function, input, range, non-linear function, relation, increasing function, and function notation. Total Pages: 9 (18 including answer key) Answer Key: Included Document File: PDF
47
+
48
+ [Sample B]:
49
+ Included is a pre-test, post-test, and vocabulary quiz on the 8th grade math standard functions (8.F). 1.) Determine if a graph represents a function 2.) State the domain and range of a relation 3.) Plot points on a graph to determine if the table represents a function 4.) State if a function is increasing, decreasing, or constant 5.) Determine the output of a function given 6.) Determine the input of a function given 7.) Determine a function rule given ordered pairs or a table of values. 8.) Graph functions using a table of values and determine a trend line in a graph 9.) Write a data table situation The vocabulary included is dependent, output, function, domain, range, decreasing function, input, range, non-linear function, relation, increasing function, and function notation. Total Pages: 9 (18 including answer key) Answer Key: Included Document File: PDF
50
+
51
+ JUDGE:
52
+ ```
53
+
54
+ A correct output follows this structure:
55
+ ```
56
+ <notes>
57
+ Sample A provides more specific and thoroughly defined tasks. It mentions "function machine," "recursive and explicit equation," and detailed graph analysis with "minimum, maximum" and intervals. Sample B contains incomplete phrases like "output of a function given" without completing the thought, making it less coherent and precise than Sample A.
58
+ </notes>
59
+ <judgement>A</judgement>
60
  ```