Extend audio ds_tool #113

liPatrick · 2024-09-13T21:24:03Z

Adding an extend audio task in ds_tool to create longer audio segments for eval

juberti

LG overall, just a few nits.

ultravox/tools/ds_tool/ds_tool.py

juberti · 2024-09-18T00:41:49Z

ultravox/tools/ds_tool/ds_tool.py

+        sentence = sample[self.asr_column_name]
+        translation = sample[self.translation_column_name]
+
+        if not isinstance(audio, dict) or "array" not in audio:


Might be able to handle this automatically by using ds_split.cast_column to Audio. (Note also that this doesn't exist in the combine operation below)

Hm, actually, i think if array isn't in audio, it'll just throw a key error (without the check), which should be fine in this case. I'm hesitant to stack more map operations than necessary because it takes a lot of time to process large datasets.

juberti · 2024-10-18T01:26:18Z

ultravox/tools/ds_tool/ds_tool.py

+@dataclasses.dataclass
+class AudioExtensionTask:
+    audio_column_name: str = simple_parsing.field(default="audio", alias="-a")
+    text_column_name: str = simple_parsing.field(default="sentence", alias="-A")


Suggested change

text_column_name: str = simple_parsing.field(default="sentence", alias="-A")

text_column_name: str = simple_parsing.field(default="sentence", alias="-t")

First

a4a2854

liPatrick marked this pull request as draft September 13, 2024 21:24

liPatrick added 10 commits September 13, 2024 14:34

Repeat sentence and translation

936df16

update map batch combine

546e584

Fix combine map

7c690da

Small fixes to map_sample_repeat

050f54a

Update id column name

23806fb

Upload split nameing

5cfb67f

adding some debugging logs

633b3ca

Make sure splits have the same columns in audioextensiontask

9cdc08b

Clean up the sample repeat code

600fe70

Remove unnecessary columns

e13d439

liPatrick marked this pull request as ready for review September 13, 2024 23:20

juberti reviewed Sep 18, 2024

View reviewed changes

Address comments

e05b52e

juberti reviewed Oct 18, 2024

View reviewed changes

juberti approved these changes Oct 18, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Extend audio ds_tool #113

Extend audio ds_tool #113

liPatrick commented Sep 13, 2024 •

edited

Loading

juberti left a comment

juberti Sep 18, 2024

liPatrick Sep 19, 2024 •

edited

Loading

juberti Oct 18, 2024

	text_column_name: str = simple_parsing.field(default="sentence", alias="-A")
	text_column_name: str = simple_parsing.field(default="sentence", alias="-t")

Extend audio ds_tool #113

Are you sure you want to change the base?

Extend audio ds_tool #113

Conversation

liPatrick commented Sep 13, 2024 • edited Loading

juberti left a comment

Choose a reason for hiding this comment

juberti Sep 18, 2024

Choose a reason for hiding this comment

liPatrick Sep 19, 2024 • edited Loading

Choose a reason for hiding this comment

juberti Oct 18, 2024

Choose a reason for hiding this comment

liPatrick commented Sep 13, 2024 •

edited

Loading

liPatrick Sep 19, 2024 •

edited

Loading