Skip to content

v2.5.0a4

Compare
Choose a tag to compare
@github-actions github-actions released this 21 Feb 00:12
Implement custom sharding for flash_mha, to allow efficient multi-gpu…

… computation when sharded across batch or head dimensions.