DeepStack Case Study: Performance from CPU to GPU version

mercer2 · May 1, 2022

with the amount of cpu, I though using main streams would not matter
also using VBR

sebastiantombs · May 1, 2022

I use sub streams, CBR with 10240 on 4MP and 5120 on 2MP cameras. All are set to 15/15 for frame and iframe rates. The CPU, an i7-6700K, rarely gets over 30% and then only for a few seconds at a time. Throughput as reported by BI is around 200kb/ps and 200Mp/ps. Sub streams will reduce the CPU load by a factor of five or more. In fact when using sub streams hardware acceleration isn't needed at all.

mercer2 · May 1, 2022

so i want to record continous at full resoution quality, any way to use the quadrop to drop cpu usage

sebastiantombs · May 1, 2022

You can record at full resolution although exactly why should be more compelling than "I just want to". BI will record the sub stream until motion is detected and then switch, automagically, to the main stream for the duration of the alert. This significantly reduces video file size and increase retention time without having to add more drive space. Look in the BI help file, it's all covered in there.

TBurt · May 1, 2022

sebastiantombs said:
I use sub streams, CBR with 10240 on 4MP and 5120 on 2MP cameras. All are set to 15/15 for frame and iframe rates. The CPU, an i7-6700K, rarely gets over 30% and then only for a few seconds at a time. Throughput as reported by BI is around 200kb/ps and 200Mp/ps. Sub streams will reduce the CPU load by a factor of five or more. In fact when using sub streams hardware acceleration isn't needed at all.

What bitrate do you run your substreams at?

sebastiantombs · May 1, 2022

Hang on, I'll check.....2048 on 4MP and 1792 on 2MP.

105437 · May 1, 2022

Here's what I use for the main and sub stream on my IPC-T5442T-ZE, seems to work okay for me.

sebastiantombs · May 1, 2022

I raised bit rates recently. It did add ore detail to both the 4MP and 2MP cameras. The CPU load increase was not significant at all.

wittaj · May 1, 2022

If your CPU is spiking to 100%, then obviously you don't have enough CPU.

Substreams are your friend. If you are using them for the main workhorse of BI (motion detection, alerts, etc.), then you can record the mainstream 24/7 without much CPU overhead, but you need to still use substreams or the CPU will spike.

While this thread was directed towards LPR, the same principals apply regarding resolution. You would be surprised how good a D1 substream can be to save on overall storage requirements:

How much resolution is really needed for LPR

We have folks come here all the time thinking that more MP is better, whether it be for general purposes or for LPR. Those of us that have been around long enough know that sensor size is more important than MP. Those that have been here awhile know that I share a representative sample of...

ipcamtalk.com

hajalie24 · May 5, 2022

Has anyone tried mining while also running Deepstack off the GPU? Wondering if it will significantly slow things down or just a bit. Sometimes I game while mining and it still runs fine, but my hashrate drops.

T_Tronix · Sep 3, 2022

Is the amount of GB on the gpu matter? For example would a gtx1060 3gb suffice or would an 6gb for about 7 hd cameras be a better choice?

bqz · Sep 3, 2022

Hi..

Yes I think so. Every task uses memory. If I enable Blue Iris to use hardware video decode, video RAM usage increases. I'm currently with nVidia 1030 GT (2 GB RAM). It works ok with Deepstack and Codeproject, although I disabled all modules except custom object detection. Current video RAM usage is about 1 GB.

T_Tronix said:
It's the amount of gb on the gpu matter? For example would a gtx1060 3gb suffice for about 7 hd cameras?

TBurt · Sep 6, 2022

T_Tronix said:
Is the amount of GB on the gpu matter? For example would a gtx1060 3gb suffice or would an 6gb for about 7 hd cameras be a better choice?

Get the 6GB version. I have the 3GB 1060 and I will get the occasional Ai not responding if I have too many custom models loaded/being used with Deep Stack. Or it just would not start DS at all, and/or Blue Iris would stop responding. I found it works best to load maybe 2 or three custom ones. Also if you find those custom ones do what you need, you can turn off the built-in DS detection model to save resources even more. It is not as big of a problem now that I have moved over to CodeProject.AI. It is go easier on memory for the computer, and GPU seems. Although I have been able to push it also to where it will stop responding also. The 6GB model should not be too much more and should keep the problems down.

T_Tronix · Sep 6, 2022

I ended up getting a used gtx1060 6gb. Tried to get CodeProject.AI working but couldn't so went back to deepstack. So far it's very stable and low on resources. I'm not using custom models just the default ones.

TBurt said:
Get the 6GB version. I have the 3GB 1060 and I will get the occasional Ai not responding if I have too many custom models loaded/being used with Deep Stack. Or it just would not start DS at all, and/or Blue Iris would stop responding. I found it works best to load maybe 2 or three custom ones. Also if you find those custom ones do what you need, you can turn off the built-in DS detection model to save resources even more. It is not as big of a problem now that I have moved over to CodeProject.AI. It is go easier on memory for the computer, and GPU seems. Although I have been able to push it also to where it will stop responding also. The 6GB model should not be too much more and should keep the problems down.

Pentagano · Sep 6, 2022

T_Tronix said:
I ended up getting a used gtx1060 6gb. Tried to get CodeProject.AI working but couldn't so went back to deepstack. So far it's very stable and low on resources. I'm not using custom models just the default ones.

What inference speeds are you getting with the gtx1060? I have the gtx970 and it is fairly respectable between 50 and 80ms

Pentagano · Sep 6, 2022

There is a possibility that I might be upgrading my motherboard. Looking at the x570 series for AMD. I need more sata ports and just generally more options as mine is a very basic motherboard.

Most have double slots for gpu cards.

Has anyone tried sli mode with 2 cheaper cards?
I wonder how deepstack works with 2 cards in SLI mode.
I'm happy with my gtx970 but curiosty always strikes.

T_Tronix · Sep 6, 2022

I could test it but not sure what tool is best to show the ms I'm getting.

Pentagano said:
What inference speeds are you getting with the gtx1060? I have the gtx970 and it is fairly respectable between 50 and 80ms

Pentagano · Sep 6, 2022

T_Tronix said:
I could test it but not sure what tool is best to show the ms I'm getting.

ok I'm used to using AITool which displays all the metrics.
Not sure - must be a place in BI if you are not using AITool

T_Tronix · Sep 6, 2022

This is what i get...

pbc · Sep 14, 2022

i've been happily running Deepstack via AITool until a couple months ago upgrading BI which apparently broke the AITools trigger/flagging into BI. So am looking now at trying to figure out how to get what I was doing in AITools done in BI without the added app.

I had the Deepstack GPU version installed as I installed a Nvidia GTX 1050 card (640 CUDA Cores but only 2GB).

Is this only for Deepstack running on Windows vs the Docker GPU version?

DeepStack Case Study: Performance from CPU to GPU version

n3wb

Known around here

n3wb

Known around here

Getting the hang of it

Known around here

BIT Beta Team

Known around here

Getting the hang of it

Young grasshopper

Young grasshopper

Getting the hang of it

Young grasshopper

Getting comfortable

Getting comfortable

Young grasshopper

Getting comfortable

Young grasshopper

Getting comfortable