Skip to content

Releases: aws-samples/foundation-model-benchmarking-tool

SageMaker BYOE metrics fix

18 Dec 03:18
Compare
Choose a tag to compare

What's Changed

Full Changelog: v2.0.22...v2.0.23

Amazon Nova models, multi-modal

12 Dec 04:52
Compare
Choose a tag to compare

What's Changed

Full Changelog: v2.0.21...v2.0.22

EC2 pricing through API, misc. config file changes

09 Dec 20:14
Compare
Choose a tag to compare

What's Changed

New Contributors

Full Changelog: v2.0.20...v2.0.21

embeddings models on SageMaker

06 Nov 01:51
Compare
Choose a tag to compare

What's Changed

  • Update config-ec2-llama3-1-70b-inf2-48xl-deploy-ec2-djl.yml by @aarora79 in #229
  • Update config-ec2-llama3-1-70b-inf2-48xl-deploy-ec2-djl.yml by @aarora79 in #230
  • fix for triton ep names by @madhurprash in #231
  • Add Initial support for bge-base-en-v1-5 embedding model and Llama 3.2 11b-Vision-Instruct on FMBench by @dheerajoruganty in #227

Full Changelog: v2.0.16...v2.0.17

torch version 2.4

29 Oct 01:16
Compare
Choose a tag to compare

What's Changed

Full Changelog: v2.0.15...v2.0.16

Ollama support

27 Oct 23:57
Compare
Choose a tag to compare

What's Changed

Full Changelog: v2.0.14...v2.0.15

FMBench orchestrator

25 Oct 18:32
Compare
Choose a tag to compare

What's Changed

  • Configuration files for llama3.1 70b on large prompt payloads + longbench dataset by @madhurprash in #216
  • adding support for llama3 summarization prompt by @madhurprash in #217
  • changing file name for llama3 summarization prompt by @madhurprash in #218
  • Config files for llama3.1 8b instruct on g6e instances by @madhurprash in #219
  • All config files for llama3.1 8b on g6e instances using DJL by @madhurprash in #220
  • make config file naming convention consistent for llama3.1 8b/70b on g6e by @madhurprash in #221
  • Config files for all llama3.2 models - tested by @madhurprash in #222

Full Changelog: v2.0.13...v2.0.14

pricing.yml updates

10 Oct 21:37
Compare
Choose a tag to compare

What's Changed

  • Update pricing.yml by @aarora79 in #210
  • Rename config-llama3-8b-g6e.4xl-tp-2-mc-max-djl-ec2.yml to config-lla… by @aarora79 in #212
  • add mixtral config file for AWQ version - g6e.48xl by @madhurprash in #214
  • pricing update + retry logic added to bedrock predictor by @madhurprash in #215

Full Changelog: v2.0.11...v2.0.13

Llama3 with Triton+DJL on Neuron

04 Oct 02:13
Compare
Choose a tag to compare

Llama3 on g6e

03 Oct 22:13
Compare
Choose a tag to compare

What's Changed

Full Changelog: v2.0.9...v2.0.10