验证tensorRT10.0 trt模型是否有效

trtexec --loadEngine=model.trt
rtx3060显卡显示pass则OK
&&&& PASSED TensorRT.trtexec [TensorRT v100001] # trtexec --loadEngine=model.trt


D:\cdtxw\TensorRT-10.0.1.6.Windows10.win10.cuda-12.4\TensorRT-10.0.1.6\bin>trtexec --loadEngine=model.trt
&&&& RUNNING TensorRT.trtexec [TensorRT v100001] # trtexec --loadEngine=model.trt
[04/29/2026-13:23:53] [I] === Model Options ===
[04/29/2026-13:23:53] [I] Format: *
[04/29/2026-13:23:53] [I] Model:
[04/29/2026-13:23:53] [I] Output:
[04/29/2026-13:23:53] [I]
[04/29/2026-13:23:53] [I] === System Options ===
[04/29/2026-13:23:53] [I] Device: 0
[04/29/2026-13:23:53] [I] DLACore:
[04/29/2026-13:23:53] [I] Plugins:
[04/29/2026-13:23:53] [I] setPluginsToSerialize:
[04/29/2026-13:23:53] [I] dynamicPlugins:
[04/29/2026-13:23:53] [I] ignoreParsedPluginLibs: 0
[04/29/2026-13:23:53] [I]
[04/29/2026-13:23:53] [I] === Inference Options ===
[04/29/2026-13:23:53] [I] Batch: Explicit
[04/29/2026-13:23:53] [I] Input inference shapes: model
[04/29/2026-13:23:53] [I] Iterations: 10
[04/29/2026-13:23:53] [I] Duration: 3s (+ 200ms warm up)
[04/29/2026-13:23:53] [I] Sleep time: 0ms
[04/29/2026-13:23:53] [I] Idle time: 0ms
[04/29/2026-13:23:53] [I] Inference Streams: 1
[04/29/2026-13:23:53] [I] ExposeDMA: Disabled
[04/29/2026-13:23:53] [I] Data transfers: Enabled
[04/29/2026-13:23:53] [I] Spin-wait: Disabled
[04/29/2026-13:23:53] [I] Multithreading: Disabled
[04/29/2026-13:23:53] [I] CUDA Graph: Disabled
[04/29/2026-13:23:53] [I] Separate profiling: Disabled
[04/29/2026-13:23:53] [I] Time Deserialize: Disabled
[04/29/2026-13:23:53] [I] Time Refit: Disabled
[04/29/2026-13:23:53] [I] NVTX verbosity: 0
[04/29/2026-13:23:53] [I] Persistent Cache Ratio: 0
[04/29/2026-13:23:53] [I] Optimization Profile Index: 0
[04/29/2026-13:23:53] [I] Weight Streaming Budget: Disabled
[04/29/2026-13:23:53] [I] Inputs:
[04/29/2026-13:23:53] [I] Debug Tensor Save Destinations:
[04/29/2026-13:23:53] [I] === Reporting Options ===
[04/29/2026-13:23:53] [I] Verbose: Disabled
[04/29/2026-13:23:53] [I] Averages: 10 inferences
[04/29/2026-13:23:53] [I] Percentiles: 90,95,99
[04/29/2026-13:23:53] [I] Dump refittable layers:Disabled
[04/29/2026-13:23:53] [I] Dump output: Disabled
[04/29/2026-13:23:53] [I] Profile: Disabled
[04/29/2026-13:23:53] [I] Export timing to JSON file:
[04/29/2026-13:23:53] [I] Export output to JSON file:
[04/29/2026-13:23:53] [I] Export profile to JSON file:
[04/29/2026-13:23:53] [I]
[04/29/2026-13:23:53] [I] === Device Information ===
[04/29/2026-13:23:53] [I] Available Devices:
[04/29/2026-13:23:53] [I]   Device 0: "NVIDIA GeForce RTX 3060" UUID: GPU-fdec3edd-55e9-18e8-70ae-dd99ad5ef70b
[04/29/2026-13:23:54] [I] Selected Device: NVIDIA GeForce RTX 3060
[04/29/2026-13:23:54] [I] Selected Device ID: 0
[04/29/2026-13:23:54] [I] Selected Device UUID: GPU-fdec3edd-55e9-18e8-70ae-dd99ad5ef70b
[04/29/2026-13:23:54] [I] Compute Capability: 8.6
[04/29/2026-13:23:54] [I] SMs: 28
[04/29/2026-13:23:54] [I] Device Global Memory: 12287 MiB
[04/29/2026-13:23:54] [I] Shared Memory per SM: 100 KiB
[04/29/2026-13:23:54] [I] Memory Bus Width: 192 bits (ECC disabled)
[04/29/2026-13:23:54] [I] Application Compute Clock Rate: 1.777 GHz
[04/29/2026-13:23:54] [I] Application Memory Clock Rate: 7.501 GHz
[04/29/2026-13:23:54] [I]
[04/29/2026-13:23:54] [I] Note: The application clock rates do not reflect the actual clock rates that the GPU is currently running at.
[04/29/2026-13:23:54] [I]
[04/29/2026-13:23:54] [I] TensorRT version: 10.0.1
[04/29/2026-13:23:54] [I] Loading standard plugins
[04/29/2026-13:23:54] [I] [TRT] Loaded engine size: 31 MiB
[04/29/2026-13:23:54] [I] Engine deserialized in 0.0506413 sec.
[04/29/2026-13:23:54] [I] [TRT] [MemUsageChange] TensorRT-managed allocation in IExecutionContext creation: CPU +0, GPU +32, now: CPU 0, GPU 59 (MiB)
[04/29/2026-13:23:54] [I] Setting persistentCacheLimit to 0 bytes.
[04/29/2026-13:23:54] [I] Created execution context with device memory size: 32.5684 MiB
[04/29/2026-13:23:54] [I] Using random values for input images
[04/29/2026-13:23:54] [I] Input binding for images with dimensions 1x3x640x640 is created.
[04/29/2026-13:23:54] [I] Output binding for output0 with dimensions 1x25200x6 is created.
[04/29/2026-13:23:54] [I] Starting inference
[04/29/2026-13:23:57] [I] Warmup completed 55 queries over 200 ms
[04/29/2026-13:23:57] [I] Timing trace has 912 queries over 3.00798 s
[04/29/2026-13:23:57] [I]
[04/29/2026-13:23:57] [I] === Trace details ===
[04/29/2026-13:23:57] [I] Trace averages of 10 runs:
[04/29/2026-13:23:57] [I] Average on 10 runs - GPU latency: 3.27158 ms - Host latency: 3.55169 ms (enqueue 0.235562 ms)
[04/29/2026-13:23:57] [I] Average on 10 runs - GPU latency: 3.26227 ms - Host latency: 3.54421 ms (enqueue 0.229588 ms)
[04/29/2026-13:23:57] [I] Average on 10 runs - GPU latency: 3.25232 ms - Host latency: 3.5312 ms (enqueue 0.233936 ms)
[04/29/2026-13:23:57] [I] Average on 10 runs - GPU latency: 3.26164 ms - Host latency: 3.55309 ms (enqueue 0.312955 ms)
[04/29/2026-13:23:57] [I] Average on 10 runs - GPU latency: 3.27527 ms - Host latency: 3.55285 ms (enqueue 0.263553 ms)
[04/29/2026-13:23:57] [I] Average on 10 runs - GPU latency: 3.24512 ms - Host latency: 3.52156 ms (enqueue 0.234555 ms)
[04/29/2026-13:23:57] [I] Average on 10 runs - GPU latency: 3.27122 ms - Host latency: 3.54536 ms (enqueue 0.279669 ms)
[04/29/2026-13:23:57] [I] Average on 10 runs - GPU latency: 3.25597 ms - Host latency: 3.52809 ms (enqueue 0.232373 ms)
[04/29/2026-13:23:57] [I] Average on 10 runs - GPU latency: 3.25819 ms - Host latency: 3.54006 ms (enqueue 0.236603 ms)
[04/29/2026-13:23:57] [I] Average on 10 runs - GPU latency: 3.25058 ms - Host latency: 3.52447 ms (enqueue 0.225491 ms)
[04/29/2026-13:23:57] [I] Average on 10 runs - GPU latency: 3.25029 ms - Host latency: 3.52529 ms (enqueue 0.351422 ms)
[04/29/2026-13:23:57] [I] Average on 10 runs - GPU latency: 3.26315 ms - Host latency: 3.54017 ms (enqueue 0.248328 ms)
[04/29/2026-13:23:57] [I] Average on 10 runs - GPU latency: 3.25595 ms - Host latency: 3.53047 ms (enqueue 0.225879 ms)
[04/29/2026-13:23:57] [I] Average on 10 runs - GPU latency: 3.26296 ms - Host latency: 3.53982 ms (enqueue 0.229376 ms)
[04/29/2026-13:23:57] [I] Average on 10 runs - GPU latency: 3.26677 ms - Host latency: 3.54118 ms (enqueue 0.227271 ms)
[04/29/2026-13:23:57] [I] Average on 10 runs - GPU latency: 3.26885 ms - Host latency: 3.54128 ms (enqueue 0.226233 ms)
[04/29/2026-13:23:57] [I] Average on 10 runs - GPU latency: 3.33024 ms - Host latency: 3.60322 ms (enqueue 0.274139 ms)
[04/29/2026-13:23:57] [I] Average on 10 runs - GPU latency: 3.25842 ms - Host latency: 3.5301 ms (enqueue 0.23075 ms)
[04/29/2026-13:23:57] [I] Average on 10 runs - GPU latency: 3.24912 ms - Host latency: 3.52295 ms (enqueue 0.226483 ms)
[04/29/2026-13:23:57] [I] Average on 10 runs - GPU latency: 3.25103 ms - Host latency: 3.52394 ms (enqueue 0.35061 ms)
[04/29/2026-13:23:57] [I] Average on 10 runs - GPU latency: 3.25674 ms - Host latency: 3.52866 ms (enqueue 0.252045 ms)
[04/29/2026-13:23:57] [I] Average on 10 runs - GPU latency: 3.26942 ms - Host latency: 3.54549 ms (enqueue 0.268597 ms)
[04/29/2026-13:23:57] [I] Average on 10 runs - GPU latency: 3.25859 ms - Host latency: 3.5386 ms (enqueue 0.2755 ms)
[04/29/2026-13:23:57] [I] Average on 10 runs - GPU latency: 3.25273 ms - Host latency: 3.52945 ms (enqueue 0.226239 ms)
[04/29/2026-13:23:57] [I] Average on 10 runs - GPU latency: 3.25125 ms - Host latency: 3.52156 ms (enqueue 0.230713 ms)
[04/29/2026-13:23:57] [I] Average on 10 runs - GPU latency: 3.24615 ms - Host latency: 3.52579 ms (enqueue 0.241553 ms)
[04/29/2026-13:23:57] [I] Average on 10 runs - GPU latency: 3.26956 ms - Host latency: 3.54294 ms (enqueue 0.320264 ms)
[04/29/2026-13:23:57] [I] Average on 10 runs - GPU latency: 3.24769 ms - Host latency: 3.53057 ms (enqueue 0.424158 ms)
[04/29/2026-13:23:57] [I] Average on 10 runs - GPU latency: 3.24855 ms - Host latency: 3.52181 ms (enqueue 0.227209 ms)
[04/29/2026-13:23:57] [I] Average on 10 runs - GPU latency: 3.25098 ms - Host latency: 3.53075 ms (enqueue 0.232568 ms)
[04/29/2026-13:23:57] [I] Average on 10 runs - GPU latency: 3.24949 ms - Host latency: 3.52338 ms (enqueue 0.228137 ms)
[04/29/2026-13:23:57] [I] Average on 10 runs - GPU latency: 3.26489 ms - Host latency: 3.5382 ms (enqueue 0.224622 ms)
[04/29/2026-13:23:57] [I] Average on 10 runs - GPU latency: 3.33331 ms - Host latency: 3.61204 ms (enqueue 0.241907 ms)
[04/29/2026-13:23:57] [I] Average on 10 runs - GPU latency: 3.25673 ms - Host latency: 3.53912 ms (enqueue 0.275366 ms)
[04/29/2026-13:23:57] [I] Average on 10 runs - GPU latency: 3.25276 ms - Host latency: 3.52327 ms (enqueue 0.234912 ms)
[04/29/2026-13:23:57] [I] Average on 10 runs - GPU latency: 3.25162 ms - Host latency: 3.52848 ms (enqueue 0.25531 ms)
[04/29/2026-13:23:57] [I] Average on 10 runs - GPU latency: 3.23345 ms - Host latency: 3.50366 ms (enqueue 0.226343 ms)
[04/29/2026-13:23:57] [I] Average on 10 runs - GPU latency: 3.26809 ms - Host latency: 3.54641 ms (enqueue 0.310645 ms)
[04/29/2026-13:23:57] [I] Average on 10 runs - GPU latency: 3.26123 ms - Host latency: 3.53636 ms (enqueue 0.225623 ms)
[04/29/2026-13:23:57] [I] Average on 10 runs - GPU latency: 3.24888 ms - Host latency: 3.52233 ms (enqueue 0.227661 ms)
[04/29/2026-13:23:57] [I] Average on 10 runs - GPU latency: 3.25553 ms - Host latency: 3.52748 ms (enqueue 0.310608 ms)
[04/29/2026-13:23:57] [I] Average on 10 runs - GPU latency: 3.27717 ms - Host latency: 3.54877 ms (enqueue 0.486438 ms)
[04/29/2026-13:23:57] [I] Average on 10 runs - GPU latency: 3.25909 ms - Host latency: 3.51829 ms (enqueue 0.236597 ms)
[04/29/2026-13:23:57] [I] Average on 10 runs - GPU latency: 3.26716 ms - Host latency: 3.53732 ms (enqueue 0.233606 ms)
[04/29/2026-13:23:57] [I] Average on 10 runs - GPU latency: 3.27636 ms - Host latency: 3.5307 ms (enqueue 0.229749 ms)
[04/29/2026-13:23:57] [I] Average on 10 runs - GPU latency: 3.2725 ms - Host latency: 3.54008 ms (enqueue 0.230872 ms)
[04/29/2026-13:23:57] [I] Average on 10 runs - GPU latency: 3.29437 ms - Host latency: 3.57953 ms (enqueue 0.229053 ms)
[04/29/2026-13:23:57] [I] Average on 10 runs - GPU latency: 3.31082 ms - Host latency: 3.59656 ms (enqueue 0.225122 ms)
[04/29/2026-13:23:57] [I] Average on 10 runs - GPU latency: 3.34286 ms - Host latency: 3.61547 ms (enqueue 0.267114 ms)
[04/29/2026-13:23:57] [I] Average on 10 runs - GPU latency: 3.25658 ms - Host latency: 3.52594 ms (enqueue 0.226233 ms)
[04/29/2026-13:23:57] [I] Average on 10 runs - GPU latency: 3.25605 ms - Host latency: 3.5321 ms (enqueue 0.230554 ms)
[04/29/2026-13:23:57] [I] Average on 10 runs - GPU latency: 3.26814 ms - Host latency: 3.5452 ms (enqueue 0.265808 ms)
[04/29/2026-13:23:57] [I] Average on 10 runs - GPU latency: 3.25149 ms - Host latency: 3.52341 ms (enqueue 0.338147 ms)
[04/29/2026-13:23:57] [I] Average on 10 runs - GPU latency: 3.26481 ms - Host latency: 3.54089 ms (enqueue 0.24082 ms)
[04/29/2026-13:23:57] [I] Average on 10 runs - GPU latency: 3.27865 ms - Host latency: 3.55026 ms (enqueue 0.227759 ms)
[04/29/2026-13:23:57] [I] Average on 10 runs - GPU latency: 3.26298 ms - Host latency: 3.53359 ms (enqueue 0.231311 ms)
[04/29/2026-13:23:57] [I] Average on 10 runs - GPU latency: 3.26667 ms - Host latency: 3.54 ms (enqueue 0.317224 ms)
[04/29/2026-13:23:57] [I] Average on 10 runs - GPU latency: 3.27161 ms - Host latency: 3.54849 ms (enqueue 0.309863 ms)
[04/29/2026-13:23:57] [I] Average on 10 runs - GPU latency: 3.25444 ms - Host latency: 3.5197 ms (enqueue 0.236108 ms)
[04/29/2026-13:23:57] [I] Average on 10 runs - GPU latency: 3.27859 ms - Host latency: 3.55837 ms (enqueue 0.233203 ms)
[04/29/2026-13:23:57] [I] Average on 10 runs - GPU latency: 3.28555 ms - Host latency: 3.56121 ms (enqueue 0.266992 ms)
[04/29/2026-13:23:57] [I] Average on 10 runs - GPU latency: 3.2625 ms - Host latency: 3.53491 ms (enqueue 0.227295 ms)
[04/29/2026-13:23:57] [I] Average on 10 runs - GPU latency: 3.26868 ms - Host latency: 3.54458 ms (enqueue 0.23064 ms)
[04/29/2026-13:23:57] [I] Average on 10 runs - GPU latency: 3.28748 ms - Host latency: 3.56067 ms (enqueue 0.22981 ms)
[04/29/2026-13:23:57] [I] Average on 10 runs - GPU latency: 3.33057 ms - Host latency: 3.61541 ms (enqueue 0.294409 ms)
[04/29/2026-13:23:57] [I] Average on 10 runs - GPU latency: 3.26702 ms - Host latency: 3.5417 ms (enqueue 0.228857 ms)
[04/29/2026-13:23:57] [I] Average on 10 runs - GPU latency: 3.27664 ms - Host latency: 3.54985 ms (enqueue 0.225977 ms)
[04/29/2026-13:23:57] [I] Average on 10 runs - GPU latency: 3.28008 ms - Host latency: 3.56162 ms (enqueue 0.312085 ms)
[04/29/2026-13:23:57] [I] Average on 10 runs - GPU latency: 3.276 ms - Host latency: 3.53518 ms (enqueue 0.320728 ms)
[04/29/2026-13:23:57] [I] Average on 10 runs - GPU latency: 3.27627 ms - Host latency: 3.54097 ms (enqueue 0.245654 ms)
[04/29/2026-13:23:57] [I] Average on 10 runs - GPU latency: 3.29497 ms - Host latency: 3.5739 ms (enqueue 0.266357 ms)
[04/29/2026-13:23:57] [I] Average on 10 runs - GPU latency: 3.28687 ms - Host latency: 3.56318 ms (enqueue 0.347046 ms)
[04/29/2026-13:23:57] [I] Average on 10 runs - GPU latency: 3.28574 ms - Host latency: 3.55991 ms (enqueue 0.233521 ms)
[04/29/2026-13:23:57] [I] Average on 10 runs - GPU latency: 3.27397 ms - Host latency: 3.5512 ms (enqueue 0.225562 ms)
[04/29/2026-13:23:57] [I] Average on 10 runs - GPU latency: 3.27397 ms - Host latency: 3.54617 ms (enqueue 0.246875 ms)
[04/29/2026-13:23:57] [I] Average on 10 runs - GPU latency: 3.26941 ms - Host latency: 3.54966 ms (enqueue 0.225635 ms)
[04/29/2026-13:23:57] [I] Average on 10 runs - GPU latency: 3.28464 ms - Host latency: 3.55774 ms (enqueue 0.345483 ms)
[04/29/2026-13:23:57] [I] Average on 10 runs - GPU latency: 3.28767 ms - Host latency: 3.56484 ms (enqueue 0.22627 ms)
[04/29/2026-13:23:57] [I] Average on 10 runs - GPU latency: 3.2917 ms - Host latency: 3.56902 ms (enqueue 0.232324 ms)
[04/29/2026-13:23:57] [I] Average on 10 runs - GPU latency: 3.28391 ms - Host latency: 3.56431 ms (enqueue 0.226978 ms)
[04/29/2026-13:23:57] [I] Average on 10 runs - GPU latency: 3.33992 ms - Host latency: 3.61333 ms (enqueue 0.259058 ms)
[04/29/2026-13:23:57] [I] Average on 10 runs - GPU latency: 3.27871 ms - Host latency: 3.55242 ms (enqueue 0.23147 ms)
[04/29/2026-13:23:57] [I] Average on 10 runs - GPU latency: 3.26851 ms - Host latency: 3.54167 ms (enqueue 0.22688 ms)
[04/29/2026-13:23:57] [I] Average on 10 runs - GPU latency: 3.28333 ms - Host latency: 3.55967 ms (enqueue 0.316919 ms)
[04/29/2026-13:23:57] [I] Average on 10 runs - GPU latency: 3.27964 ms - Host latency: 3.5583 ms (enqueue 0.274243 ms)
[04/29/2026-13:23:57] [I] Average on 10 runs - GPU latency: 3.27249 ms - Host latency: 3.54912 ms (enqueue 0.241016 ms)
[04/29/2026-13:23:57] [I] Average on 10 runs - GPU latency: 3.29192 ms - Host latency: 3.57085 ms (enqueue 0.226147 ms)
[04/29/2026-13:23:57] [I] Average on 10 runs - GPU latency: 3.26965 ms - Host latency: 3.54851 ms (enqueue 0.275024 ms)
[04/29/2026-13:23:57] [I] Average on 10 runs - GPU latency: 3.26995 ms - Host latency: 3.55879 ms (enqueue 0.330786 ms)
[04/29/2026-13:23:57] [I] Average on 10 runs - GPU latency: 3.27021 ms - Host latency: 3.546 ms (enqueue 0.223511 ms)
[04/29/2026-13:23:57] [I] Average on 10 runs - GPU latency: 3.27686 ms - Host latency: 3.54849 ms (enqueue 0.315283 ms)
[04/29/2026-13:23:57] [I]
[04/29/2026-13:23:57] [I] === Performance summary ===
[04/29/2026-13:23:57] [I] Throughput: 303.194 qps
[04/29/2026-13:23:57] [I] Latency: min = 3.46265 ms, max = 4.35815 ms, mean = 3.54589 ms, median = 3.5403 ms, percentile(90%) = 3.5813 ms, percentile(95%) = 3.59961 ms, percentile(99%) = 3.65942 ms
[04/29/2026-13:23:57] [I] Enqueue Time: min = 0.220703 ms, max = 0.797729 ms, mean = 0.259363 ms, median = 0.226807 ms, percentile(90%) = 0.270996 ms, percentile(95%) = 0.61084 ms, percentile(99%) = 0.673584 ms
[04/29/2026-13:23:57] [I] H2D Latency: min = 0.20752 ms, max = 0.346649 ms, mean = 0.230353 ms, median = 0.230957 ms, percentile(90%) = 0.235107 ms, percentile(95%) = 0.23999 ms, percentile(99%) = 0.266113 ms
[04/29/2026-13:23:57] [I] GPU Compute Time: min = 3.20331 ms, max = 4.08569 ms, mean = 3.27073 ms, median = 3.26453 ms, percentile(90%) = 3.30371 ms, percentile(95%) = 3.31885 ms, percentile(99%) = 3.38477 ms
[04/29/2026-13:23:57] [I] D2H Latency: min = 0.0324707 ms, max = 0.107971 ms, mean = 0.0448103 ms, median = 0.0471191 ms, percentile(90%) = 0.0501099 ms, percentile(95%) = 0.0530396 ms, percentile(99%) = 0.0649414 ms
[04/29/2026-13:23:57] [I] Total Host Walltime: 3.00798 s
[04/29/2026-13:23:57] [I] Total GPU Compute Time: 2.9829 s
[04/29/2026-13:23:57] [W] * GPU compute time is unstable, with coefficient of variance = 1.88254%.
[04/29/2026-13:23:57] [W]   If not already in use, locking GPU clock frequency or adding --useSpinWait may improve the stability.
[04/29/2026-13:23:57] [I] Explanations of the performance metrics are printed in the verbose logs.
[04/29/2026-13:23:57] [I]
&&&& PASSED TensorRT.trtexec [TensorRT v100001] # trtexec --loadEngine=model.trt

 

posted @ 2026-04-29 13:33  txwtech  阅读(6)  评论(0)    收藏  举报