使用TensorRT加速yolo3

一、TensorRT支持的模型：

TensorRT 直接支持的model有ONNX、Caffe、TensorFlow，其他常见model建议先转化成ONNX。总结如下：

1 ONNX(.onnx)

2 Keras(.h5) --> ONNX(.onnx) (https://github.com/onnx/keras-onnx)

3 Caffe(.caffemodel)

4 Darknet(.cfg) --> ONNX(.onnx) (Our tutorial : yolo-v3)

5 TensorFlow(.uff)

二、TensorRT支持的常见运算：

Activation(激活函数)、Convolution(卷积运算)、Deconvolution(反卷积运算)、FullConnected(全连接)、Padding(填充)、Pooling(池化)、RNN(递归神经网络)、SoftMax()等。

更详细的API可参考：

https://docs.nvidia.com/deeplearning/sdk/tensorrt-api/c_api/classnvinfer1_1_1_i_network_definition.html

三、TensorRT加速yolo3：

yolo3由CNN网络和detection模块组成，TensorRT只对CNN网络进行Inference加速。即：

TensorRT input is：608*608 image

TensorRT output is：array

　　(array[0].shape = 255 *19*19、

　　 array[1].shape = 255*38*38、

　　 array[2].shape = 255 *76*76)

具体实现过程：

1 Darknet(.cfg) --> ONNX(.onnx)

2 ONNX(.onnx) --> TensorRT model(.trt)

3 TensorRT加速CNN部分，执行detection模块得到最终结果。

pytorch-yolo3：https://github.com/ayooshkathuria/pytorch-yolo-v3

本项目地址：https://github.com/Cw-zero/TensorRT_yolo3

(注：本项目是对pytorch-yolo3进行改写加速的)

四、性能比较：

--------------------------------------------end~我是可爱的分割线~--------------------------------------

More about TensorRT 可参考官方指导：

https://docs.nvidia.com/deeplearning/sdk/tensorrt-developer-guide/index.html#python_example_unsupported

posted @ 2019-02-24 22:02 JustCopyer 阅读(14743) 评论(7) 收藏举报

刷新页面返回顶部

登录后才能查看或发表评论，立即登录或者逛逛博客园首页

阅读排行：
· 我是不是很有钱？
· 遭遇疯狂 cc 攻击的一个周末
· 【EF Core】聊聊“复合”属性
· GPT‑5 重磅发布
· 美丽而脆弱的天体运动：当C#遇见宇宙混沌

公告

2025年8月

日

一

二

三

四

五

六

随笔分类

随笔档案

阅读排行榜

评论排行榜

1. 使用TensorRT加速yolo3(7)

JustCopyer