Jarrodbarnes commited on
Commit
a230fcd
·
verified ·
1 Parent(s): 48cf09e

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +20 -0
README.md CHANGED
@@ -68,6 +68,26 @@ The ATLAS framework, using this teacher model, produces the following improvemen
68
 
69
  `ATLAS-8B-Thinking` is not a standard instruction-tuned model for direct chat. It is a core component of the ATLAS training framework, designed to interact with a "student" model in a two-pass process.
70
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
71
  ### Conceptual Usage
72
 
73
  The following is a simplified, conceptual example of the ATLAS interaction loop. The full implementation is available in the official repository.
 
68
 
69
  `ATLAS-8B-Thinking` is not a standard instruction-tuned model for direct chat. It is a core component of the ATLAS training framework, designed to interact with a "student" model in a two-pass process.
70
 
71
+ ### Loading the Model
72
+
73
+ **Important:** This model requires `trust_remote_code=True` due to custom Qwen3 architecture components.
74
+
75
+ ```python
76
+ from transformers import AutoModelForCausalLM, AutoTokenizer
77
+
78
+ # Load the teacher model
79
+ teacher_model = AutoModelForCausalLM.from_pretrained(
80
+ "Arc-Intelligence/ATLAS-8B-Thinking",
81
+ trust_remote_code=True, # Required for custom architecture
82
+ torch_dtype=torch.bfloat16 # Recommended for efficiency
83
+ )
84
+
85
+ teacher_tokenizer = AutoTokenizer.from_pretrained(
86
+ "Arc-Intelligence/ATLAS-8B-Thinking",
87
+ trust_remote_code=True
88
+ )
89
+ ```
90
+
91
  ### Conceptual Usage
92
 
93
  The following is a simplified, conceptual example of the ATLAS interaction loop. The full implementation is available in the official repository.