Trying to capture display output for real-time analysis with OpenCV; I need help with interfacing with the OS for input

2023-01-22 23:39 问答作者：

I want to apply operations from the OpenCV computer vision library, in real time, to video captured from my computer display. The idea in this particular case is to detect interesting features during gameplay in a popular game and provide the user with an enhanced experience; but I could think of several other scenarios where one would want to have live access to this data as well. At any rate, for the development phase it might be acceptable using canned video, but for the final application performance and responsiveness are obviously critical.

I am trying to do this on Ubuntu 10.10 as of now, and would prefer to use a UNIX-like开发者_运维百科 system, but any options are of interest. My C skills are very limited, so whenever talking to OpenCV through Python is possible, I try to use that instead. Please note that I am trying to capture NOT from a camera device, but from a live stream of display output; and I'm at a loss as to how to take the input. As far as I can tell, CaptureFromCAM works only for camera devices, and it seems to me that the requirement for real-time performance in the end result makes storage in file and reading back through CaptureFromFile a bad option.

The most promising route I have found so far seems to be using ffmpeg with the x11grab option to capture from an X11 display; (e.g. the command ffmpeg -f x11grab -sameq -r 25 -s wxga -i :0.0 out.mpg captures 1366x768 of display 0 to 'out.mpg'). I imagine it should be possible to treat the output stream from ffmpeg as a file to be read by OpenCV (presumably by using the CaptureFromFile function) maybe by using pipes; but this is all on a much higher level than I have ever dealt with before and I could really use some directions. Do you think this approach is feasible? And more importantly can you think of a better one? How would you do it?

I would discard x11grab or any other cmd-line tools to take screenshots if you are looking for real time performance.

Write your own screen grabber so you can send it directly to OpenCV. You could take a look at xwd source code if you want to know how to do that under X11.

I think the main challenge is the real-time requirement. I think you have to create some piece of software for OpenCv, inspired by the code for video grabbing in ffmpeg. but that for sure would involves C level coding.

My suggestion is to try to get your vision algorithm right first, by using the ffmpeg-captured video.

继续阅读：capture ffmpeg opencv real-time vision

Trying to capture display output for real-time analysis with OpenCV; I need help with interfacing with the OS for input

更多精彩内容

精彩评论

最新问答

央视是哪个频道？

请问买过的朋友，舒提啦旅行箱实际使用体验如何？？

检查不孕不育需要的费用？

海信ULED电视画质有什么不同的地方?？

钉子可以挂的住画框幕布吗？

问答排行榜

河神2九牛入海钓河妖是第几集河妖什么来历可活吞牛？

性激素六项检查的最佳时间是多久？多少钱？？

Easiest way to get words of one line from istream into a vector?

《梦在燃烧 (《三国演义》动画片主题曲)》MP3歌词-汤子星？

抽烟只抽炫赫门？