You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
parallel_cross_entropy loss support for tensor parallelism (#246)
Support for training the Mistral architecture with tensor parallelism (#303)
AWS SDK
Fix: neuron_parallel_compile is compatible with the cache system (#352)
Full support for neuron_parallel_compile with the cache system: compilation files produced by neuron_parallel_compile will be pushed to the remote cache repo on the Hugging Face Hub at the beginning of the next training job (#354)
Documentation
Guide explaining how distributed training works in optimum-neuron (#339)
Inference
Data parallelism option for Stable Diffusion - LCM allowing multi-device inference (#346)
Support decoding sequences of byte tokens in TGI (#350)