Commit History
sync xformers patch to follow shared format and be diffable
985dcbc
split sdp attn into its own patch
5d0b27e
fix check for flash attn branching (#377)
343ac84
unverified
Attention mask and position id fixes for packing (#285)
2bb0b78
unverified
Update XFormers Attention Monkeypatch to handle Llama-2 70B (GQA) (#339)
10405b9
unverified
ssmi153
commited on