mobicham commited on
Commit
68106f2
·
verified ·
1 Parent(s): 185ba21

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +6 -1
README.md CHANGED
@@ -14,7 +14,12 @@ More specifically, the attention layers are quantized to 4-bit and the experts a
14
  This idea was suggest by Artem Eliseev (@lavawolfiee) and Denis Mazur (@dvmazur) [in this Github discussion](https://github.com/mobiusml/hqq/issues/2).
15
 
16
  ### Basic Usage
17
- To run the model, install the HQQ library from https://github.com/mobiusml/hqq and use it as follows:
 
 
 
 
 
18
  ``` Python
19
  model_id = 'mobiuslabsgmbh/Mixtral-8x7B-v0.1-hf-attn-4bit-moe-2bit-HQQ'
20
  #Load the model
 
14
  This idea was suggest by Artem Eliseev (@lavawolfiee) and Denis Mazur (@dvmazur) [in this Github discussion](https://github.com/mobiusml/hqq/issues/2).
15
 
16
  ### Basic Usage
17
+ To run the model, install the HQQ library:
18
+ ```
19
+ pip install hqq==0.1.8
20
+ pip install transformers==4.46.0
21
+ ```
22
+ and use it as follows:
23
  ``` Python
24
  model_id = 'mobiuslabsgmbh/Mixtral-8x7B-v0.1-hf-attn-4bit-moe-2bit-HQQ'
25
  #Load the model