icon |
---|
clock-rotate-left |
All notable changes to this project will be documented in this file. It keeps track of changes to the GooeyAI repository - gooey-server and other changes to the Documentation and the Gooey.AI website.
The format is based on Keep a Changelog.
- Support for using Seamless M4T v2 for translation and TTS will soon be available in the UI!
Fixed
- Render correct run cost estimate when something on the page changes #475
Added
- Ability to select LLM on /eval #472
- UX improvements to "save as new" / "save" for anon & logged in users (#453, #481)
- New "share" dialog for published runs #453
- Beta release of Python SDK - https://github.com/gooeyai/python-sdk and https://pypi.org/project/gooeyai/
- Improve OpenAPI spec to use the standard security scheme definition for Bearer
- Bug fix for SERP search location when set to st
- Updated web widget chat with a sidebar to contain conversation history
- All error codes are available in the documentation
- Updated GPT-4o to
gpt-4o-2024-08-06
with support for 16,384 output tokens - Added support for Gemini 1.5 Flash and ChatGPT-4o models
- Support for JSON mode on Gemini 1.5 Pro, Gemini 1.5 Flash, and Claude
- Try here: LLM JSON Output
- Allow auto-recharge for users without a paid subscription
- Allow saving payment method after a top-up for easier payments in the future
- Updated meta descriptions for: https://gooey.ai/llm and https://gooey.ai/copilot 
- Rate limits page
- LLM: Updated Sarvam, GPT-4o Mini, SEA-LION-v2. Head over to our Compare LLM Generator to see it in action, SEA-LION v2 and Sarvam available here!
- Functions: Revamped functions editor with an in-built linter.
- AI Standards: Our proposal to the Library of Congress and Rockefeller Foundation on how shared AI workflows can catalyze innovation everywhere.
- Impact: Understand how you can use GenAI for your Impact Organization.
-
Speech Recognition and Translation: We have upgraded Seamless M4T to v2. This provides improved ASR for nearly 100 languages. Try it here: https://gooey.ai/speech/seamless-m4t-v2-hindi-english-g8883e8675gk/
-
Copilot: Enabled "Auto-play responses" in Copilot Web Widget Integrations. You can control whether audio/video responses should be auto-played or not. This feature is enabled by default.\
-
Lipsync: We have deployed some improvements around the SadTalker lipsync model
- more understandable error messages,
- support for a wider range of image/video resolutions,
- the ability to not crash on long videos,
- better support for video inputs overall.
Here is an example of the improved performance on video input:
OLD OUTPUT
{% embed url="https://storage.googleapis.com/dara-c1b52.appspot.com/daras_ai/media/eef688c8-1dfb-11ef-af8a-02420a000128/gooey.ai%20lipsync.mp4" %}
NEW OUTPUT
{% embed url="https://storage.googleapis.com/dara-c1b52.appspot.com/daras_ai/media/1db03ed8-4ecc-11ef-b5a4-02420a000192/gooey.ai%20lipsync.mp4" %}
- Lipsync: bug fixes for short input audio and reference inputs
- Fixed the regression with our eleven labs custom voices support. You should now be able to use your custom 11labs voices using your API key on https://gooey.ai/compare-text-to-speech-engines , https://gooey.ai/copilot/ and https://gooey.ai/lipsync-maker/