Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

a error threw when Extract timeline for part #58

Open
yhown589 opened this issue Jun 10, 2024 · 2 comments
Open

a error threw when Extract timeline for part #58

yhown589 opened this issue Jun 10, 2024 · 2 comments
Labels
bug Something isn't working

Comments

@yhown589
Copy link

atfer aligning audio, proceed to the step of Extract timeline for part, an error was throw:
PixPin_2024-06-11_05-25-58
I enter the method that threw the error and found:
PixPin_2024-06-11_05-28-23
I guess that it maybe caused by recognizing the definition of word

text: That allows the firm to start mass production. The EH216.S is the first flying car to receive such regulatory approval anywhere in the world. Ehang has competition in China, an EV toll from Autoflight, a Shanghai based firm, obtained a type certificate from the CAAC in March, signifying approval of its design.

@rotemdan
Copy link
Member

rotemdan commented Jul 4, 2024

Yes, this issue has been known to me for several months and included in my tasklist (the first issue).

It occurs when the recognizer produces a different word segmentation than the one that's use to convert a word timeline to a sentence / segment timeline (which typically uses the cldr-segmentation by default, jieba-wasm for Chinese, kuromoji for Japanese).

A lot of the time it occurs is when a dot character in the middle of the word is parsed as a sentence separator in one approach, and as a native part of the word in another. For example EH216.S is parsed as a single word by the recognizer but as two words, where the . is a sentence separator, in another.

I'll need to find a plan on how to prevent this from occurring. I'm not sure exactly how at the moment.

@rotemdan rotemdan added the bug Something isn't working label Jul 4, 2024
@rotemdan
Copy link
Member

rotemdan commented Oct 4, 2024

I added a potential fix / workaround for this issue on the new 1.6.0 release.

See this comment on the other thread for more information.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

2 participants