1、对于python,以后遇到不熟悉的函数,只需要访问如下两个网址:

标准库文档:https://docs.python.org/zh-cn/3.6/library/

第三方库文档:https://pypi.org/search/?q=pyaudio

2、发现新的宝藏教程:

周莫烦:https://space.bilibili.com/243821484/video?tid=0&keyword=&order=pubdate

李宏毅:https://www.youtube.com/channel/UC2ggjtuuWvxrHHHiaDH1dlQ

resnet残差网络:https://www.youtube.com/watch?v=Bu9A_-M5OZk

残差网络文字教程:https://blog.csdn.net/sunny_yeah_/article/details/89430124

3、VGG:

代码:https://github.com/WeidiXie/VGG-Speaker-Recognition

论文:https://arxiv.org/pdf/1902.10107.pdf

另一篇说明文章:http://www.robots.ox.ac.uk/~vgg/research/speakerID/

训练数据:https://drive.google.com/drive/folders/1eqYaMIAOa4e8gm8BUwnlPh1AeKds--Hl

4、uisrnn:

代码:https://github.com/google/uis-rnn/blob/master/tests/utils_test.py

视频:https://www.youtube.com/watch?v=pGkqwRPzx9U

5、目前使用的demo:https://github.com/taylorlu/Speaker-Diarization

6、又发现了其他三个可用的demo:

说话人特征提取:https://github.com/dairuining/speaker-feature-extractor

说话人识别:基于VGG-Speaker-Recognition开发的,本项目主要是用于声纹识别:https://github.com/yeyupiaoling/Kersa-Speaker-Recognition

Vggvoxt+voxceleb:speaker identification and verification:https://github.com/a-nagrani/VGGVox

voxceleb的论文:https://arxiv.org/pdf/1806.05622.pdf

7、声纹识别笔记ivector和PLDA:写的还蛮好的:https://www.pianshen.com/article/70421446728/

8、别人写的uisrnn的demo,据说有错误,可以debug一下:https://www.pianshen.com/article/70421446728/

 

posted on 2020-10-15 18:05  蔡狗八  阅读(140)  评论(0)    收藏  举报