Releases: aws-samples/foundation-model-benchmarking-tool
Releases · aws-samples/foundation-model-benchmarking-tool
SageMaker BYOE metrics fix
What's Changed
- Summarization Prompt Template by @dheerajoruganty in #255
- HF dataset loading bug fixes + new config file by @madhurprash in #256
- llama3.2 1b g6e.2xl config file for smaller prompts by @madhurprash in #257
Full Changelog: v2.0.22...v2.0.23
Amazon Nova models, multi-modal
What's Changed
- NOVA models config file + bedrock converseAPI predictor by @madhurprash in #253
- Update template.yml by @antara678 in #254
Full Changelog: v2.0.21...v2.0.22
EC2 pricing through API, misc. config file changes
What's Changed
- copy s3 content update + llama3-8b triton g5.48xl config by @madhurprash in #244
- HF token bug fix + docs update for copy s3 content command by @madhurprash in #245
- EC2 instance utilization metrics by @madhurprash in #248
- Fallback pricing for EC2 by @dheerajoruganty in #249
- Update EC2 Metrics by @dheerajoruganty in #250
- Update Fallback Pricing YAML Path by @dheerajoruganty in #252
- chore(ec2): update byoe yml and docs by @tonyksong in #251
New Contributors
- @tonyksong made their first contribution in #251
Full Changelog: v2.0.20...v2.0.21
embeddings models on SageMaker
What's Changed
- Update config-ec2-llama3-1-70b-inf2-48xl-deploy-ec2-djl.yml by @aarora79 in #229
- Update config-ec2-llama3-1-70b-inf2-48xl-deploy-ec2-djl.yml by @aarora79 in #230
- fix for triton ep names by @madhurprash in #231
- Add Initial support for
bge-base-en-v1-5
embedding model and Llama 3.2 11b-Vision-Instruct on FMBench by @dheerajoruganty in #227
Full Changelog: v2.0.16...v2.0.17
torch version 2.4
Ollama support
What's Changed
- Add BYO Ollama Support by @dheerajoruganty in #223
- Change Llama 3 8b and 70b Model IDs by @dheerajoruganty in #225
Full Changelog: v2.0.14...v2.0.15
FMBench orchestrator
What's Changed
- Configuration files for llama3.1 70b on large prompt payloads + longbench dataset by @madhurprash in #216
- adding support for llama3 summarization prompt by @madhurprash in #217
- changing file name for llama3 summarization prompt by @madhurprash in #218
- Config files for llama3.1 8b instruct on g6e instances by @madhurprash in #219
- All config files for llama3.1 8b on g6e instances using DJL by @madhurprash in #220
- make config file naming convention consistent for llama3.1 8b/70b on g6e by @madhurprash in #221
- Config files for all llama3.2 models - tested by @madhurprash in #222
Full Changelog: v2.0.13...v2.0.14
pricing.yml updates
What's Changed
- Update pricing.yml by @aarora79 in #210
- Rename config-llama3-8b-g6e.4xl-tp-2-mc-max-djl-ec2.yml to config-lla… by @aarora79 in #212
- add mixtral config file for AWQ version - g6e.48xl by @madhurprash in #214
- pricing update + retry logic added to bedrock predictor by @madhurprash in #215
Full Changelog: v2.0.11...v2.0.13
Llama3 with Triton+DJL on Neuron
Full Changelog: v2.0.10...v2.0.11
Llama3 on g6e
What's Changed
- Add support and pricing for g6e instances by @dheerajoruganty in #207
- Config file for llama3 8b on inf2 using triton with DJL by @madhurprash in #205
- Add config files for g6e instances by @dheerajoruganty in #208
- Add concurrency=3 for g6e instance configs by @dheerajoruganty in #209
Full Changelog: v2.0.9...v2.0.10