WIP: Vcc2018 #611

oplatek · 2022-03-09T21:32:03Z

No description provided.

SupervisionSet contains 4 supervisions that start at 0 for recording XY for all recordings because each recording has four MOS supervisions. Plus there are 291 recordings which do not have suppervisions - TODO investigate

desh2608

Sorry, we missed reviewing this earlier. I have made some comments.

desh2608 · 2022-09-28T13:13:12Z

lhotse/bin/modes/recipes/__init__.py

@@ -56,3 +56,4 @@
 from .voxceleb import *
 from .wenet_speech import *
 from .yesno import *
+from .vcc2018 import *


We try to keep the imports sorted in lexicographical order. Could you put this import above vctk?

desh2608 · 2022-09-28T13:17:02Z

lhotse/recipes/vcc2018.py

+    mos_scores = mos_dir / "vcc2018_evaluation_mos.txt"
+    assert mos_scores.is_file()
+    sim_scores = mos_dir / "vcc2018_evaluation_sim.txt"
+    sim_scores.is_file()


should this be an assert?

desh2608 · 2022-09-28T13:17:57Z

lhotse/recipes/vcc2018.py

+        f"Collecting reference target recordings for the VCC2018 challange from {reference_speech_dir}"
+    )
+
+    # TODO


Can be removed?

desh2608 · 2022-09-28T13:21:35Z

lhotse/recipes/vcc2018.py

+    return {"recordings": recordings, "supervisions": supervisions}
+
+
+def prepare_mos_supervisions(


If this function is only intended to be used inside this module, it is recommended to add a single underscore, i.e., _prepare_mos_supervisions(). Note that this does not enforce privacy but only indicates that this method should not be called directly.

desh2608 · 2022-09-28T13:21:55Z

lhotse/recipes/vcc2018.py

+    return SupervisionSet.from_segments(supervisions)
+
+
+def load_vcc_results(path: Pathlike):


Same _load_vcc_results()

desh2608 · 2022-09-28T13:23:16Z

lhotse/recipes/vcc2018.py

+    """
+    Returns pandas.DataFrame
+    """
+    #     """


Use either multi-line comment (""" """) or single-line (#)

desh2608 · 2022-09-28T13:25:02Z

lhotse/recipes/vcc2018.py

+    recording_ids = set(mos["left_audio"].tolist())
+    supervisions = []
+    for recording_id_wav in tqdm(recording_ids, desc="Supervision creation"):
+        recording_id = recording_id_wav.rstrip(".wav")


If you make the recording_ids above as a set of Path types, then you could use .stem here instead.

I know… but it is less readible with Path(recording_id_wav).stem since I literally need to use both recording_id wit hand without “.wav” for the original data and Lhotse use.

recording_id_wav.name would give you the file name with the extension.

desh2608 · 2022-09-28T13:27:43Z

lhotse/recipes/vcc2018.py

+def prepare_mos_supervisions(
+    mos_results_path, recordings: RecordingSet, id2trn: Dict[str, str]
+) -> SupervisionSet:
+    # TODO very slow -> make it faster it takes ~8min 170it/s


This may be because you are manipulating a Pandas dataframe. Pandas does a lot of book-keeping under the hood which makes it good for complicated operations, but here I think it may be better to just use something like a list of namedtuples, and then use groupby to group them.

oplatek · 2022-09-28T13:32:06Z

@desh2608 I forgot about this WIP PR. I will address your comments next week.

oplatek and others added 9 commits January 3, 2022 13:03

vc2018 preparation recipe works, but emits warnings

0837106

SupervisionSet contains 4 supervisions that start at 0 for recording XY for all recordings because each recording has four MOS supervisions. Plus there are 291 recordings which do not have suppervisions - TODO investigate

multiple MOS per SupervisionSegment

956f1f7

remove deprecated property

69f47a9

simplified script, better names of downloaded files

aae86e0

add reference_path: untested

302d45c

document strong assumptions

7d3e48a

Merge remote-tracking branch 'upstream/master' into vcc2018

9293501

Merge branch 'master' into vcc2018

967f280

Merge branch 'master' into vcc2018

e37d5e7

desh2608 requested changes Sep 28, 2022

View reviewed changes

Merge branch 'master' into vcc2018

a324adc

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

WIP: Vcc2018 #611

WIP: Vcc2018 #611

oplatek commented Mar 9, 2022

desh2608 left a comment

desh2608 Sep 28, 2022

desh2608 Sep 28, 2022

desh2608 Sep 28, 2022

desh2608 Sep 28, 2022

desh2608 Sep 28, 2022

desh2608 Sep 28, 2022

desh2608 Sep 28, 2022

oplatek Oct 4, 2023

desh2608 Oct 4, 2023

desh2608 Sep 28, 2022

oplatek commented Sep 28, 2022

		return {"recordings": recordings, "supervisions": supervisions}


		def prepare_mos_supervisions(

		return SupervisionSet.from_segments(supervisions)


		def load_vcc_results(path: Pathlike):

WIP: Vcc2018 #611

Are you sure you want to change the base?

WIP: Vcc2018 #611

Conversation

oplatek commented Mar 9, 2022

desh2608 left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

oplatek commented Sep 28, 2022