Skip to content

Commit

Permalink
GITBOOK-149: Ambika's Aug 15 changes
Browse files Browse the repository at this point in the history
  • Loading branch information
Ambika Joshi authored and gitbook-bot committed Aug 15, 2024
1 parent 32d53b5 commit 026c154
Show file tree
Hide file tree
Showing 4 changed files with 59 additions and 0 deletions.
Binary file added .gitbook/assets/Auto-play (1).png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added .gitbook/assets/Auto-play.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
1 change: 1 addition & 0 deletions SUMMARY.md
Original file line number Diff line number Diff line change
@@ -1,6 +1,7 @@
# Table of contents

* [🗃️ Gooey.AI Docs](README.md)
* [Changelog](changelog.md)

## 📖 Guides

Expand Down
58 changes: 58 additions & 0 deletions changelog.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,58 @@
---
icon: clock-rotate-left
---

# Changelog

All notable changes to this project will be documented in this file. It keeps track of changes to the GooeyAI repository - [gooey-server](https://github.com/gooeyAI/gooey-server)

The format is based on [Keep a Changelog](https://keepachangelog.com/en/1.1.0/). 

### Unreleased

* Support for using Seamless M4T v2 for translation and TTS will be made available in the UI soon!

## 14-Aug-2024

#### Added

* **Speech Recognition and Translation**: We have upgraded Seamless M4T to v2. This provides improved ASR for nearly 100 languages. Try it here: [https://gooey.ai/speech/seamless-m4t-v2-hindi-english-g8883e8675gk/](https://gooey.ai/speech/seamless-m4t-v2-hindi-english-g8883e8675gk/)
* **Copilot:** Enabled "Auto-play responses" in **Copilot** Web Widget Integrations. You can control whether audio/video responses should be auto-played or not. This feature is enabled by default.\
\


<figure><img src=".gitbook/assets/Auto-play (1).png" alt=""><figcaption></figcaption></figure>

### 8-Aug-2024



#### Added

* **Lipsync**: We have deployed some improvements around the SadTalker lipsync model&#x20;

* more understandable error messages,&#x20;
* support for a wider range of image/video resolutions,&#x20;
* the ability to not crash on long videos, &#x20;
* better support for video inputs overall.&#x20;

Here is an example of the improved performance on video input:&#x20;

_OLD OUTPUT_

{% embed url="https://storage.googleapis.com/dara-c1b52.appspot.com/daras_ai/media/eef688c8-1dfb-11ef-af8a-02420a000128/gooey.ai%20lipsync.mp4" %}

_NEW OUTPUT_

{% embed url="https://storage.googleapis.com/dara-c1b52.appspot.com/daras_ai/media/1db03ed8-4ecc-11ef-b5a4-02420a000192/gooey.ai%20lipsync.mp4" %}

#### Fixed

* **Lipsync**: bug fixes for short input audio and reference inputs &#x20;

### 24-Jul-2024

#### Fixed

* Fixed the regression with our eleven labs custom voices support. You should now be able to use your custom 11labs voices using your API key on [https://gooey.ai/compare-text-to-speech-engines](https://gooey.ai/compare-text-to-speech-engines) , [https://gooey.ai/copilot/](https://gooey.ai/copilot/) and [https://gooey.ai/lipsync-maker/](https://gooey.ai/lipsync-maker/)

0 comments on commit 026c154

Please sign in to comment.