File tree
6 files changed
+48
-4
lines changed- sota-implementations/grpo
- config/mode
- torchrl/collectors/llm
6 files changed
+48
-4
lines changedLines changed: 5 additions & 1 deletion
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
107 | 107 |
| |
108 | 108 |
| |
109 | 109 |
| |
110 |
| - | |
| 110 | + | |
| 111 | + | |
| 112 | + | |
| 113 | + | |
| 114 | + | |
111 | 115 |
| |
112 | 116 |
| |
113 | 117 |
| |
|
Lines changed: 2 additions & 0 deletions
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
9 | 9 |
| |
10 | 10 |
| |
11 | 11 |
| |
| 12 | + | |
| 13 | + |
Lines changed: 3 additions & 0 deletions
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
9 | 9 |
| |
10 | 10 |
| |
11 | 11 |
| |
| 12 | + | |
| 13 | + | |
| 14 | + |
Lines changed: 2 additions & 0 deletions
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
465 | 465 |
| |
466 | 466 |
| |
467 | 467 |
| |
| 468 | + | |
| 469 | + | |
468 | 470 |
| |
469 | 471 |
| |
470 | 472 |
| |
|
Lines changed: 2 additions & 0 deletions
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
483 | 483 |
| |
484 | 484 |
| |
485 | 485 |
| |
| 486 | + | |
486 | 487 |
| |
487 | 488 |
| |
488 | 489 |
| |
| |||
495 | 496 |
| |
496 | 497 |
| |
497 | 498 |
| |
| 499 | + | |
498 | 500 |
| |
499 | 501 |
| |
500 | 502 |
| |
|
Lines changed: 34 additions & 3 deletions
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
4 | 4 |
| |
5 | 5 |
| |
6 | 6 |
| |
| 7 | + | |
| 8 | + | |
7 | 9 |
| |
8 | 10 |
| |
9 | 11 |
| |
| |||
55 | 57 |
| |
56 | 58 |
| |
57 | 59 |
| |
| 60 | + | |
| 61 | + | |
| 62 | + | |
| 63 | + | |
| 64 | + | |
| 65 | + | |
| 66 | + | |
| 67 | + | |
| 68 | + | |
| 69 | + | |
| 70 | + | |
| 71 | + | |
| 72 | + | |
58 | 73 |
| |
59 | 74 |
| |
60 | 75 |
| |
| |||
81 | 96 |
| |
82 | 97 |
| |
83 | 98 |
| |
| 99 | + | |
84 | 100 |
| |
85 | 101 |
| |
86 | 102 |
| |
| |||
93 | 109 |
| |
94 | 110 |
| |
95 | 111 |
| |
96 |
| - | |
| 112 | + | |
| 113 | + | |
| 114 | + | |
97 | 115 |
| |
| 116 | + | |
98 | 117 |
| |
99 | 118 |
| |
100 | 119 |
| |
| |||
113 | 132 |
| |
114 | 133 |
| |
115 | 134 |
| |
| 135 | + | |
| 136 | + | |
| 137 | + | |
116 | 138 |
| |
117 | 139 |
| |
118 | 140 |
| |
119 | 141 |
| |
120 | 142 |
| |
121 | 143 |
| |
122 |
| - | |
| 144 | + | |
123 | 145 |
| |
124 | 146 |
| |
125 | 147 |
| |
| 148 | + | |
| 149 | + | |
| 150 | + | |
| 151 | + | |
126 | 152 |
| |
127 | 153 |
| |
128 |
| - | |
| 154 | + | |
| 155 | + | |
| 156 | + | |
| 157 | + | |
| 158 | + | |
| 159 | + | |
129 | 160 |
| |
130 | 161 |
| |
131 | 162 |
| |
|
0 commit comments