h2oai
/

h2ogpt-gm-oasst1-en-2048-open-llama-3b

Text Generation

large language model

text-generation-inference

Model card Files Files and versions Community

psinger commited on Jun 28, 2023

Commit

695913a

·

1 Parent(s): 0101397

Update README.md

Files changed (1) hide show

README.md +7 -11

README.md CHANGED Viewed

@@ -8,13 +8,18 @@ tags:
 - large language model
 - h2o-llmstudio
 inference: false
-thumbnail: https://h2o.ai/etc.clientlibs/h2o/clientlibs/clientlib-site/resources/images/favicon.ico
 ---
 # Model Card
 ## Summary
 This model was trained using [H2O LLM Studio](https://github.com/h2oai/h2o-llmstudio).
 - Base model: [openlm-research/open_llama_3b](https://huggingface.co/openlm-research/open_llama_3b)
 ## Usage
@@ -22,7 +27,7 @@ This model was trained using [H2O LLM Studio](https://github.com/h2oai/h2o-llmst
 To use the model with the `transformers` library on a machine with GPUs, first make sure you have the `transformers`, `accelerate` and `torch` libraries installed.
 ```bash
-pip install transformers==4.29.0
 pip install accelerate==0.20.3
 pip install torch==2.0.0
 ```
@@ -174,15 +179,6 @@ LlamaForCausalLM(
 This model was trained using H2O LLM Studio and with the configuration in [cfg.yaml](cfg.yaml). Visit [H2O LLM Studio](https://github.com/h2oai/h2o-llmstudio) to learn how to train your own large language models.
-## Model Validation
-Model validation results using [EleutherAI lm-evaluation-harness](https://github.com/EleutherAI/lm-evaluation-harness).
-```bash
-CUDA_VISIBLE_DEVICES=0 python main.py --model hf-causal-experimental --model_args pretrained=h2oai/h2ogpt-gm-oasst1-en-2048-open-llama-3b --tasks openbookqa,arc_easy,winogrande,hellaswag,arc_challenge,piqa,boolq --device cuda &> eval.log
-```
 ## Disclaimer
 Please read this disclaimer carefully before using the large language model provided in this repository. Your use of the model signifies your agreement to the following terms and conditions.

 - large language model
 - h2o-llmstudio
 inference: false
+thumbnail: >-
+  https://h2o.ai/etc.clientlibs/h2o/clientlibs/clientlib-site/resources/images/favicon.ico
+license: apache-2.0
+datasets:
+- OpenAssistant/oasst1
 ---
 # Model Card
 ## Summary
 This model was trained using [H2O LLM Studio](https://github.com/h2oai/h2o-llmstudio).
 - Base model: [openlm-research/open_llama_3b](https://huggingface.co/openlm-research/open_llama_3b)
+- Dataset preparation: [OpenAssistant/oasst1](https://github.com/h2oai/h2o-llmstudio/blob/1935d84d9caafed3ee686ad2733eb02d2abfce57/app_utils/utils.py#LL1896C5-L1896C28)
 ## Usage
 To use the model with the `transformers` library on a machine with GPUs, first make sure you have the `transformers`, `accelerate` and `torch` libraries installed.
 ```bash
+pip install transformers==4.30.2
 pip install accelerate==0.20.3
 pip install torch==2.0.0
 ```
 This model was trained using H2O LLM Studio and with the configuration in [cfg.yaml](cfg.yaml). Visit [H2O LLM Studio](https://github.com/h2oai/h2o-llmstudio) to learn how to train your own large language models.
 ## Disclaimer
 Please read this disclaimer carefully before using the large language model provided in this repository. Your use of the model signifies your agreement to the following terms and conditions.