Results: linear_gptj_new_kl_10x (train set)
Config
| arg | value |
| editor_types |
['linear'] |
| model |
EleutherAI/gpt-j-6B |
| dataset |
counterfact |
| layers |
[0, 1, 2] |
| max_epochs |
20 |
| batch_size |
16 |
| lr |
0.001 |
| lam_kl |
10 |
| lam_adv |
1.0 |
| hold_out |
0.1 |
| eval_alpha |
1.0 |
| eval_n_top |
10 |
| eval_n_generate |
100 |
| use_entity |
False |
| use_all_entity_tokens |
False |
| rerun_eval |
True |
| eval_on |
['test'] |
| device |
cuda |
| fp16 |
True |
| experiment_name |
linear_gptj_new_kl_10x |
| results_dir |
None |
| clear_results_dir |
False |
| seed |
123456 |
| log_level |
20 |
Samples
Sample 0
Inputs:
- entity: Daryll-Ann
- context: The development of Daryll-Ann occurred in Argentina
- attribute: occurred in Argentina
- prompt: Daryll-Ann was formulated in
- target_mediated: Argentina
- target_unmediated: Netherlands
Model generations:
-
original:
D
- after edit layer 0: D
- after edit layer 1: D
- after edit layer 2: D
- after edit layer 3: D
- after edit layer 4: D
- after edit layer 5: D
- after edit layer 6: D
- after edit layer 7: D
- after edit layer 8: D
- after edit layer 9: D
- after edit layer 10: D
- after edit layer 11: D
- after edit layer 12: D
- after edit layer 13: D
- after edit layer 14: D
- after edit layer 15: D
- after edit layer 16: D
- after edit layer 17: D
- after edit layer 18: D
- after edit layer 19: D
- after edit layer 20: D
- after edit layer 21: D
- after edit layer 22: D
- after edit layer 23: D
- after edit layer 24: D
- after edit layer 25: D
- after edit layer 26: D
- after edit layer 27: D