Custom glBlendFunc a lot slower than native

2023-03-28 12:15 问答作者：

I'm trying to do my own custom glBlendFunc through fragment shaders, however, my solution is a lot slower than the native glBlendFunc, even 开发者_如何学Cwhen they do the exact blending function.

I was wondering if anyone had any suggestion on how to do this in a more efficient way.

My solution works something like this:

void draw(fbo fbos[2], render_item item)
{
   // fbos[0] is the render target
   // fbos[1] is the previous render target used to read "background" to blend against in shader
   // Both fbos have exactly the same content, however they need to be different since we can't both read and write to the same texture. The texture we render to needs to have the entire content since we might not draw geometry everywhere.

   fbos[0]->attach(); // Attach fbo
   fbos[1]->bind(1); // Bind as texture 1

   render(item);

   glCopyTexSubImage2D(...); // copy from fbos[0] to fbos[1], fbos[1] == fbos[0]
}

fragment.glsl

vec4 blend_color(vec4 fore) 
{   
    vec4 back = texture2D(background, gl_TexCoord[1].st); // background is read from texture "1"
    return vec4(mix(back.rgb, fore.rgb, fore.a), back.a + fore.a);  
}

Your best bet for improving the performance of FBO-based blending is NV_texture_barrier. Despite the name, AMD has implemented it as well, so if you stick to Radeon HD-class cards, it should be available to you.

Basically, it allows you to ping-pong without heavyweight operations like FBO binding or texture attachment operations. The specification has a section towards the bottom that shows the general algorithm.

Another alternative is EXT_shader_image_load_store. This will require DX11/GL 4.x class hardware. OpenGL 4.2 recently promoted this to core with ARB_shader_image_load_store.

Even with this, as Darcy said, you're never going to beat regular blending. It uses special hardware structures that shaders can't access (since they happen after the shaders have run). You should only do programmatic blending if there is some effect that you absolutely cannot accomplish any other way.

It is a lot more efficient because blending operations are built directly into the GPU hardware, so you probably aren't going to be able to beat it for speed. Having said that,make sure you have depth-testing, back-face culling , hardware blending, and any other unneeded operations turned off. I can't say it will make a huge difference, but it may make some.

继续阅读：glblendfunc opengl shader

Custom glBlendFunc a lot slower than native

更多精彩内容

精彩评论

最新问答

央视是哪个频道？

请问买过的朋友，舒提啦旅行箱实际使用体验如何？？

检查不孕不育需要的费用？

海信ULED电视画质有什么不同的地方?？

钉子可以挂的住画框幕布吗？

问答排行榜

河神2九牛入海钓河妖是第几集河妖什么来历可活吞牛？

性激素六项检查的最佳时间是多久？多少钱？？

Easiest way to get words of one line from istream into a vector?

《梦在燃烧 (《三国演义》动画片主题曲)》MP3歌词-汤子星？

抽烟只抽炫赫门？