Arc-Intelligence
/

ATLAS-8B-Thinking

Text Generation

reinforcement-learning

teacher-student

adaptive-learning

text-generation-inference

Model card Files Files and versions

Jarrodbarnes commited on 29 days ago

Commit

a230fcd

·

verified ·

1 Parent(s): 48cf09e

Update README.md

Files changed (1) hide show

README.md +20 -0

README.md CHANGED Viewed

@@ -68,6 +68,26 @@ The ATLAS framework, using this teacher model, produces the following improvemen
 `ATLAS-8B-Thinking` is not a standard instruction-tuned model for direct chat. It is a core component of the ATLAS training framework, designed to interact with a "student" model in a two-pass process.
 ### Conceptual Usage
 The following is a simplified, conceptual example of the ATLAS interaction loop. The full implementation is available in the official repository.

 `ATLAS-8B-Thinking` is not a standard instruction-tuned model for direct chat. It is a core component of the ATLAS training framework, designed to interact with a "student" model in a two-pass process.
+### Loading the Model
+**Important:** This model requires `trust_remote_code=True` due to custom Qwen3 architecture components.
+```python
+  from transformers import AutoModelForCausalLM, AutoTokenizer
+  # Load the teacher model
+  teacher_model = AutoModelForCausalLM.from_pretrained(
+      "Arc-Intelligence/ATLAS-8B-Thinking",
+      trust_remote_code=True,  # Required for custom architecture
+      torch_dtype=torch.bfloat16  # Recommended for efficiency
+  )
+  teacher_tokenizer = AutoTokenizer.from_pretrained(
+      "Arc-Intelligence/ATLAS-8B-Thinking",
+      trust_remote_code=True
+  )
+```
 ### Conceptual Usage
 The following is a simplified, conceptual example of the ATLAS interaction loop. The full implementation is available in the official repository.