Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Document use of
retry_on_error
for dedicated inference endpoints (#554
) Shortly after #549, the inference endpoint backend was updated to block by default on model loading. This PR adds documentation explaining how to circumvent that blocking so that the user, if desired, can handle the 500 errors themselves.
- Loading branch information