Add/update the quantized ONNX model files and README.md for Transformers.js v3 (#3)

Browse files

- Add/update the quantized ONNX model files and README.md for Transformers.js v3 (6e950025f83197b4bfed34e654cb236fc6634398)

Co-authored-by: Yuichiro Tachibana <whitphx@users.noreply.huggingface.co>

Files changed (6) hide show

README.md +4 -5
onnx/model_bnb4.onnx +3 -0
onnx/model_int8.onnx +3 -0
onnx/model_q4.onnx +3 -0
onnx/model_q4f16.onnx +3 -0
onnx/model_uint8.onnx +3 -0

README.md CHANGED Viewed

@@ -7,15 +7,15 @@ https://huggingface.co/BAAI/bge-small-en-v1.5 with ONNX weights to be compatible
 ## Usage (Transformers.js)
-If you haven't already, you can install the [Transformers.js](https://huggingface.co/docs/transformers.js) JavaScript library from [NPM](https://www.npmjs.com/package/@xenova/transformers) using:
 ```bash
-npm i @xenova/transformers
 ```
 You can then use the model to compute embeddings, as follows:
 ```js
-import { pipeline } from '@xenova/transformers';
 // Create a feature-extraction pipeline
 const extractor = await pipeline('feature-extraction', 'Xenova/bge-small-en-v1.5');
@@ -40,7 +40,7 @@ console.log(embeddings.tolist()); // Convert embeddings to a JavaScript list
 You can also use the model for retrieval. For example:
 ```js
-import { pipeline, cos_sim } from '@xenova/transformers';
 // Create a feature-extraction pipeline
 const extractor = await pipeline('feature-extraction', 'Xenova/bge-small-en-v1.5');
@@ -76,5 +76,4 @@ console.log(scores);
 // ]
 ```
 Note: Having a separate repo for ONNX weights is intended to be a temporary solution until WebML gains more traction. If you would like to make your models web-ready, we recommend converting to ONNX using [🤗 Optimum](https://huggingface.co/docs/optimum/index) and structuring your repo like this one (with ONNX weights located in a subfolder named `onnx`).

 ## Usage (Transformers.js)
+If you haven't already, you can install the [Transformers.js](https://huggingface.co/docs/transformers.js) JavaScript library from [NPM](https://www.npmjs.com/package/@huggingface/transformers) using:
 ```bash
+npm i @huggingface/transformers
 ```
 You can then use the model to compute embeddings, as follows:
 ```js
+import { pipeline } from '@huggingface/transformers';
 // Create a feature-extraction pipeline
 const extractor = await pipeline('feature-extraction', 'Xenova/bge-small-en-v1.5');
 You can also use the model for retrieval. For example:
 ```js
+import { pipeline, cos_sim } from '@huggingface/transformers';
 // Create a feature-extraction pipeline
 const extractor = await pipeline('feature-extraction', 'Xenova/bge-small-en-v1.5');
 // ]
 ```
 Note: Having a separate repo for ONNX weights is intended to be a temporary solution until WebML gains more traction. If you would like to make your models web-ready, we recommend converting to ONNX using [🤗 Optimum](https://huggingface.co/docs/optimum/index) and structuring your repo like this one (with ONNX weights located in a subfolder named `onnx`).

onnx/model_bnb4.onnx ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:4c02225730f532a92f286bbbff7930a90605b871dfedb160cfd091ddfac4460b
+size 60147542

onnx/model_int8.onnx ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:bf64d05457cb391fa88d045faf5927a15ea36d96228ddf23ea970087afdc1197
+size 33760831

onnx/model_q4.onnx ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:f8beecd3ea4f11b9819a1ef3ba157a51b0dc81138a236ac66127dde0b5c295b5
+size 61474190

onnx/model_q4f16.onnx ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:60c88c17a3b2da945d10c359b3a1bce90b00a5462c6385240cd9be4fd3d93c0e
+size 36190171

onnx/model_uint8.onnx ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:6ec7329d42bc829e909a02d02b044da5271a70af8245417dc31f7ad07a56799c
+size 33760859