Well you need to decide, will you go with HTTP Listener or alertStream, data is same. I preffer HTTP Listener.
Create TCP server to accept data
Log in to camera and under alarm server or HTTP Listener (depends on firmware name can vary) insert ip address of your server.
Camera will push events to that server.
You parse multipart data which contains xml,image1,image2,image3 by default. On web interface you can pick whether your camera will send images nor not and will it be just image1 or just image 3.
Once received , save dto some database so you can have longer period of data.
ps: if you enable ANR on HTTPListener, for each received event you need to respond with 200 OK, otherwise it will loop same event over and over until u acknowledge it.
Cheers.