1、对于python,以后遇到不熟悉的函数,只需要访问如下两个网址:
标准库文档:https://docs.python.org/zh-cn/3.6/library/
第三方库文档:https://pypi.org/search/?q=pyaudio
2、发现新的宝藏教程:
周莫烦:https://space.bilibili.com/243821484/video?tid=0&keyword=&order=pubdate
李宏毅:https://www.youtube.com/channel/UC2ggjtuuWvxrHHHiaDH1dlQ
resnet残差网络:https://www.youtube.com/watch?v=Bu9A_-M5OZk
残差网络文字教程:https://blog.csdn.net/sunny_yeah_/article/details/89430124
3、VGG:
代码:https://github.com/WeidiXie/VGG-Speaker-Recognition
论文:https://arxiv.org/pdf/1902.10107.pdf
另一篇说明文章:http://www.robots.ox.ac.uk/~vgg/research/speakerID/
训练数据:https://drive.google.com/drive/folders/1eqYaMIAOa4e8gm8BUwnlPh1AeKds--Hl
4、uisrnn:
代码:https://github.com/google/uis-rnn/blob/master/tests/utils_test.py
视频:https://www.youtube.com/watch?v=pGkqwRPzx9U
5、目前使用的demo:https://github.com/taylorlu/Speaker-Diarization
6、又发现了其他三个可用的demo:
说话人特征提取:https://github.com/dairuining/speaker-feature-extractor
说话人识别:基于VGG-Speaker-Recognition开发的,本项目主要是用于声纹识别:https://github.com/yeyupiaoling/Kersa-Speaker-Recognition
Vggvoxt+voxceleb:speaker identification and verification:https://github.com/a-nagrani/VGGVox
voxceleb的论文:https://arxiv.org/pdf/1806.05622.pdf
7、声纹识别笔记ivector和PLDA:写的还蛮好的:https://www.pianshen.com/article/70421446728/
8、别人写的uisrnn的demo,据说有错误,可以debug一下:https://www.pianshen.com/article/70421446728/
浙公网安备 33010602011771号