Skip to content

Navigation Menu

Explore
By company size
By use case
By industry
View all solutions
Topics
- AI
- DevOps
- Security
- Software Development
- View all
Explore
- GitHub Sponsors
  Fund open source developers
- The ReadME Project
  GitHub community articles
Repositories
- Enterprise platform
  AI-powered developer platform
Available add-ons
Pricing

Search code, repositories, users, issues, pull requests...

Search

Clear

Search syntax tips

Provide feedback

We read every piece of feedback, and take your input very seriously.

Include my email address so I can be contacted

Saved searches

Use saved searches to filter your results more quickly

Name

Query

To see all available qualifiers, see our documentation.

You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session.

Dismiss alert

aws-samples / foundation-model-benchmarking-tool Public

Notifications You must be signed in to change notification settings
Fork 32
Star 211

Code
Issues 28
Pull requests 15
Actions
Projects
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Actions
Projects
Security
Insights

Releases: aws-samples/foundation-model-benchmarking-tool

Releases · aws-samples/foundation-model-benchmarking-tool

SageMaker BYOE metrics fix

18 Dec 03:18

aarora79

Compare

Choose a tag to compare

Loading

SageMaker BYOE metrics fix Latest

Latest

What's Changed

Summarization Prompt Template by @dheerajoruganty in #255
HF dataset loading bug fixes + new config file by @madhurprash in #256
llama3.2 1b g6e.2xl config file for smaller prompts by @madhurprash in #257

Full Changelog: v2.0.22...v2.0.23

Contributors

dheerajoruganty and madhurprash

Assets 2

Loading

All reactions

Amazon Nova models, multi-modal

12 Dec 04:52

aarora79

Compare

Choose a tag to compare

Loading

Amazon Nova models, multi-modal

What's Changed

NOVA models config file + bedrock converseAPI predictor by @madhurprash in #253
Update template.yml by @antara678 in #254

Full Changelog: v2.0.21...v2.0.22

Contributors

antara678 and madhurprash

Assets 2

Loading

All reactions

EC2 pricing through API, misc. config file changes

09 Dec 20:14

aarora79

Compare

Choose a tag to compare

Loading

EC2 pricing through API, misc. config file changes

What's Changed

copy s3 content update + llama3-8b triton g5.48xl config by @madhurprash in #244
HF token bug fix + docs update for copy s3 content command by @madhurprash in #245
EC2 instance utilization metrics by @madhurprash in #248
Fallback pricing for EC2 by @dheerajoruganty in #249
Update EC2 Metrics by @dheerajoruganty in #250
Update Fallback Pricing YAML Path by @dheerajoruganty in #252
chore(ec2): update byoe yml and docs by @tonyksong in #251

New Contributors

@tonyksong made their first contribution in #251

Full Changelog: v2.0.20...v2.0.21

Contributors

tonyksong, dheerajoruganty, and madhurprash

Assets 2

Loading

All reactions

embeddings models on SageMaker

06 Nov 01:51

aarora79

Compare

Choose a tag to compare

Loading

embeddings models on SageMaker

What's Changed

Update config-ec2-llama3-1-70b-inf2-48xl-deploy-ec2-djl.yml by @aarora79 in #229
Update config-ec2-llama3-1-70b-inf2-48xl-deploy-ec2-djl.yml by @aarora79 in #230
fix for triton ep names by @madhurprash in #231
Add Initial support for bge-base-en-v1-5 embedding model and Llama 3.2 11b-Vision-Instruct on FMBench by @dheerajoruganty in #227

Full Changelog: v2.0.16...v2.0.17

Contributors

aarora79, dheerajoruganty, and madhurprash

Assets 2

Loading

All reactions

torch version 2.4

29 Oct 01:16

aarora79

Compare

Choose a tag to compare

Loading

torch version 2.4

What's Changed

Update pyproject.toml by @antara678 in #226

Full Changelog: v2.0.15...v2.0.16

Contributors

antara678

Assets 2

Loading

All reactions

Ollama support

27 Oct 23:57

aarora79

Compare

Choose a tag to compare

Loading

Ollama support

What's Changed

Add BYO Ollama Support by @dheerajoruganty in #223
Change Llama 3 8b and 70b Model IDs by @dheerajoruganty in #225

Full Changelog: v2.0.14...v2.0.15

Contributors

dheerajoruganty

Assets 2

Loading

All reactions

FMBench orchestrator

25 Oct 18:32

aarora79

Compare

Choose a tag to compare

Loading

FMBench orchestrator

What's Changed

Configuration files for llama3.1 70b on large prompt payloads + longbench dataset by @madhurprash in #216
adding support for llama3 summarization prompt by @madhurprash in #217
changing file name for llama3 summarization prompt by @madhurprash in #218
Config files for llama3.1 8b instruct on g6e instances by @madhurprash in #219
All config files for llama3.1 8b on g6e instances using DJL by @madhurprash in #220
make config file naming convention consistent for llama3.1 8b/70b on g6e by @madhurprash in #221
Config files for all llama3.2 models - tested by @madhurprash in #222

Full Changelog: v2.0.13...v2.0.14

Contributors

madhurprash

Assets 2

Loading

All reactions

pricing.yml updates

10 Oct 21:37

aarora79

Compare

Choose a tag to compare

Loading

pricing.yml updates

What's Changed

Update pricing.yml by @aarora79 in #210
Rename config-llama3-8b-g6e.4xl-tp-2-mc-max-djl-ec2.yml to config-lla… by @aarora79 in #212
add mixtral config file for AWQ version - g6e.48xl by @madhurprash in #214
pricing update + retry logic added to bedrock predictor by @madhurprash in #215

Full Changelog: v2.0.11...v2.0.13

Contributors

aarora79 and madhurprash

Assets 2

Loading

All reactions

Llama3 with Triton+DJL on Neuron

04 Oct 02:13

aarora79

Compare

Choose a tag to compare

Loading

Llama3 with Triton+DJL on Neuron

Full Changelog: v2.0.10...v2.0.11

Assets 2

Loading

All reactions

Llama3 on g6e

03 Oct 22:13

aarora79

Compare

Choose a tag to compare

Loading

Llama3 on g6e

What's Changed

Add support and pricing for g6e instances by @dheerajoruganty in #207
Config file for llama3 8b on inf2 using triton with DJL by @madhurprash in #205
Add config files for g6e instances by @dheerajoruganty in #208
Add concurrency=3 for g6e instance configs by @dheerajoruganty in #209

Full Changelog: v2.0.9...v2.0.10

Contributors

dheerajoruganty and madhurprash

Assets 2

Loading

All reactions

Previous 1 2 3 4 5 Next

Previous Next

Footer

© 2024 GitHub, Inc.

Footer navigation

Terms
Privacy
Security
Status
Docs
Contact

You can’t perform that action at this time.