| --- |
| license: mit |
| base_model: openai-community/gpt2 |
| --- |
| # Null-GPT2 |
|
|
| ## Description |
|
|
| This is a GPT2 Model, but only with the architecture, no pre-trained weights, biases, attention, etc. |
|
|
| This is useful for researchers who want to play with training the model (not tuning). |
|
|
| Generated via the github repo [Model Architecture Generator](https://github.com/ivanhe123/Model-Architecture-Generator) |
|
|
| ## Use |
| First go into the directory of the model, |
| ``` |
| git clone https://github.com/ivanhe123/Model-Architecture-Generator |
| ``` |
|
|
| ``` |
| python -m randomnize_params -in "./NullGPT2" -out path_model_out |
| ``` |
|
|
| path_model_out is just the output path of the newly randomnized model. |