Dacheng Li
commited on
Commit
·
8329c1d
1
Parent(s):
00b59f7
Update README.md
Browse files
README.md
CHANGED
@@ -3,16 +3,16 @@ license: apache-2.0
|
|
3 |
inference: false
|
4 |
---
|
5 |
|
6 |
-
#
|
7 |
|
8 |
## Model details
|
9 |
|
10 |
**Model type:**
|
11 |
-
|
12 |
It is based on an encoder-decoder transformer architecture, and can autoregressively generate responses to users' inputs.
|
13 |
|
14 |
**Model date:**
|
15 |
-
|
16 |
|
17 |
**Organizations developing the model:**
|
18 |
The Vicuna team with members from UC Berkeley, CMU, Stanford, MBZUAI, and UC San Diego.
|
@@ -28,7 +28,7 @@ https://github.com/lm-sys/FastChat/issues
|
|
28 |
|
29 |
## Intended use
|
30 |
**Primary intended uses:**
|
31 |
-
The primary use of
|
32 |
|
33 |
**Primary intended users:**
|
34 |
The primary intended users of the model are entrepreneurs and researchers in natural language processing, machine learning, and artificial intelligence.
|
|
|
3 |
inference: false
|
4 |
---
|
5 |
|
6 |
+
# FastChat-T5 Model Card
|
7 |
|
8 |
## Model details
|
9 |
|
10 |
**Model type:**
|
11 |
+
FastChat-T5 is an open-source chatbot trained by fine-tuning Flan-t5-xl (3B parameters) on user-shared conversations collected from ShareGPT.
|
12 |
It is based on an encoder-decoder transformer architecture, and can autoregressively generate responses to users' inputs.
|
13 |
|
14 |
**Model date:**
|
15 |
+
FastChat-T5 was trained on April 2023.
|
16 |
|
17 |
**Organizations developing the model:**
|
18 |
The Vicuna team with members from UC Berkeley, CMU, Stanford, MBZUAI, and UC San Diego.
|
|
|
28 |
|
29 |
## Intended use
|
30 |
**Primary intended uses:**
|
31 |
+
The primary use of FastChat-T5 is commercial usage on large language models and chatbots. It can also be used for research purposes.
|
32 |
|
33 |
**Primary intended users:**
|
34 |
The primary intended users of the model are entrepreneurs and researchers in natural language processing, machine learning, and artificial intelligence.
|