Request for advice on custom wake word model training and evaluating #227
Unanswered
sangheonEN
asked this question in
Q&A
Replies: 1 comment 17 replies
-
Hello! I’ve been also trying to generate my own synthetic audio files from that repo, still I’m facing some different issues, because I’m not very sure on where to use them. |
Beta Was this translation helpful? Give feedback.
17 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hello. I am writing this to explain my current situation and gain an understanding of open wake word learning.
Currently, I am using training_models.ipynb to learn with my dataset.
Dataset used:
Training content: Refer to the attached training_models.ipynb
Evaluation Content: I tested it with the attached hey_thomas_female_test_sample.wav file and the prediction value did not increase during the time when hey thomas was spoken, and the prediction value only stayed at the maximum range of 0.008, so it seems that the learning model was not trained properly. See the graph below.
1) Am I doing the training correctly?
- An error occurs in the mixed_clips, labels, background_clips = next(mixing_generator) code. (Refer to the attached training_models.ipynb)
- An error occurs in the openwakeword.data.trim_mmap(output_file) code. (Refer to the attached training_models.ipynb)
- When I set batch_size = 8 and perform training, the following problem occurs.
"---------------------------------------------------------------------------
AttributeError Traceback (most recent call last)
Cell In[40], line 33
31 tp = sum(predictions.flatten()[y.flatten() == 1] >= 0.5)
32 fn = sum(predictions.flatten()[y.flatten() == 1] < 0.5)
---> 33 history['recall'].append(float(tp/(tp+fn).detach().numpy()))
AttributeError: 'int' object has no attribute 'detach'"
2) I would like to get some advice on how to supplement the lack of positive samples.
Currently, I have about 2 million wav files for negative samples. However, since there are only 50 positive samples, I expect a class imbalance problem to occur. So, the information I found out on my own is as follows. However, I could not find the code for data augmentation.
"https://github.com/dscripka/openWakeWord/blob/main/docs/synthetic_data_generation.md" Here, you mentioned the synthetic data augmentation technique, but I can understand the content, but do I have to develop the code myself? I wonder if there is a code that I can refer to. I don't think I can develop the code myself. I would appreciate it if you could tell me the development method for data augmentation for positive samples.
openwakeword_advice_data.zip
Beta Was this translation helpful? Give feedback.
All reactions