knowledgator
/

gliner-bi-llama-v1.0

Token Classification

information extraction

entity recognition

Model card Files Files and versions Community

Ihor commited on Sep 1, 2024

Commit

7c6d3b5

·

verified ·

1 Parent(s): fc24d1e

Update README.md

Files changed (1) hide show

README.md +6 -2

README.md CHANGED Viewed

@@ -72,11 +72,15 @@ European Championship => competitions
 If you want to use flash attention or increase sequence length, please, check the following code:
 ```python
-model = GLiNER.from_pretrained("knowledgator/gliner-bi-llama-v1.0",
                                 _attn_implementation = 'flash_attention_2',
-                                                max_len = 2048).to('cuda:0')
 ```
 If you have a large amount of entities and want to pre-embed them, please, refer to the following code snippet:
 ```python

 If you want to use flash attention or increase sequence length, please, check the following code:
 ```python
+from gliner import GLiNER
+import torch
+model = GLiNER.from_pretrained("knowledgator/gliner-qwen-1.5B-v1.0",
                                 _attn_implementation = 'flash_attention_2',
+                                                max_len = 2048).to('cuda:0', dtype=torch.float16)
 ```
 If you have a large amount of entities and want to pre-embed them, please, refer to the following code snippet:
 ```python