MLAA (Morphological Anti-Aliasing)

Gamestudio Links

Zorro Links

Newest Posts

Zorro Calculate Commission and Spread Even Setting Is Off
by vicknick. 05/13/24 09:17

Change chart colours
by 7th_zorro. 05/11/24 09:25

Texture/Sprite animation tool. Someone interested in ?
by NeoDumont. 05/10/24 19:36

Seeking Advice: Trading Two Algorithms for SPX500 / US Treasury
by AndrewAMD. 05/07/24 00:56

Data from CSV not parsed correctly
by dr_panther. 05/06/24 18:50

Zorro Mistakenly Changes Another Assets Parameter in Loop
by dr_panther. 05/06/24 18:24

tradeUpdatePool() but for only exiting positions
by AndrewAMD. 05/03/24 15:20

FOSS C++ DLL framework with a Python subsystem and git hooks.
by firatv. 05/02/24 09:40

AUM Magazine

Latest Screens

Who's Online Now

3 registered members (AndrewAMD, Ayumi, 1 invisible), 1,291 guests, and 3 spiders.

Key: Admin, Global Mod, Mod

Newest Members

firatv, wandaluciaia, Mega_Rod, EternallyCurious, howardR
19050 Registered Users

Print Thread

Rate Thread

Re: MLAA on the GPU using Direct3D9 (DEMO) [Re: rojart] #362179 03/07/11 00:49 03/07/11 00:49
Joined: Oct 2004 Posts: 900 Lgh rojart OP User
rojart OP User Joined: Oct 2004 Posts: 900 Lgh	The original GPU Pro 2 MLAA source code was released. Does anyone know, how to convert it? Click to reveal.. /** * Copyright (C) 2010 Jorge Jimenez (jorge@iryoku.com) * Copyright (C) 2010 Belen Masia (bmasia@unizar.es) * Copyright (C) 2010 Jose I. Echevarria (joseignacioechevarria@gmail.com) * Copyright (C) 2010 Fernando Navarro (fernandn@microsoft.com) * Copyright (C) 2010 Diego Gutierrez (diegog@unizar.es) * All rights reserved. * * Redistribution and use in source and binary forms, with or without * modification, are permitted provided that the following conditions are met: * * 1. Redistributions of source code must retain the above copyright notice, * this list of conditions and the following disclaimer. * * 2. Redistributions in binary form must reproduce the following statement: * * "Uses Jimenez's MLAA. Copyright (C) 2010 by Jorge Jimenez, Belen Masia, * Jose I. Echevarria, Fernando Navarro and Diego Gutierrez." * * THIS SOFTWARE IS PROVIDED BY THE COPYRIGHT HOLDERS AND CONTRIBUTORS ``AS * IS'' AND ANY EXPRESS OR IMPLIED WARRANTIES, INCLUDING, BUT NOT LIMITED TO, * THE IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS FOR A PARTICULAR * PURPOSE ARE DISCLAIMED. IN NO EVENT SHALL COPYRIGHT HOLDERS OR CONTRIBUTORS * BE LIABLE FOR ANY DIRECT, INDIRECT, INCIDENTAL, SPECIAL, EXEMPLARY, OR * CONSEQUENTIAL DAMAGES (INCLUDING, BUT NOT LIMITED TO, PROCUREMENT OF * SUBSTITUTE GOODS OR SERVICES; LOSS OF USE, DATA, OR PROFITS; OR BUSINESS * INTERRUPTION) HOWEVER CAUSED AND ON ANY THEORY OF LIABILITY, WHETHER IN * CONTRACT, STRICT LIABILITY, OR TORT (INCLUDING NEGLIGENCE OR OTHERWISE) * ARISING IN ANY WAY OUT OF THE USE OF THIS SOFTWARE, EVEN IF ADVISED OF THE * POSSIBILITY OF SUCH DAMAGE. * * The views and conclusions contained in the software and documentation are * those of the authors and should not be interpreted as representing official * policies, either expressed or implied, of the copyright holders. / // Just for checking syntax at compile time #if !defined(PIXEL_SIZE) #define PIXEL_SIZE float2(1.0 / 1280.0, 1.0 / 720.0) #define MAX_SEARCH_STEPS 8 #define MAX_DISTANCE 32 #endif /* * Here we have an interesting define. In the last pass we make usage of * bilinear filtering to avoid some lerps; however, bilinear filtering * in DX9, under DX9 hardware (but not in DX9 code running on DX10 hardware) * is done in gamma space, which gives sustantially worser results. So, this * flag allows to avoid the bilinear filter trick, changing it with some * software lerps. * * So, to summarize, it is safe to use the bilinear filter trick when you are * using DX10 hardware on DX9. However, for the best results when using DX9 * hardware, it is recommended comment this line. / #define BILINEAR_FILTER_TRICK /* * Input vars and textures. / float threshold; texture2D colorTex; texture2D depthTex; texture2D edgesTex; texture2D blendTex; texture2D areaTex; /* * DX9 samplers hell following this. / sampler2D colorMap { Texture = <colorTex>; AddressU = Clamp; AddressV = Clamp; MipFilter = Point; MinFilter = Point; MagFilter = Point; SRGBTexture = true; }; sampler2D colorMapL { Texture = <colorTex>; AddressU = Clamp; AddressV = Clamp; MipFilter = Point; MinFilter = Linear; MagFilter = Linear; SRGBTexture = true; }; sampler2D depthMap { Texture = <depthTex>; AddressU = Clamp; AddressV = Clamp; MipFilter = Point; MinFilter = Point; MagFilter = Point; SRGBTexture = false; }; sampler2D edgesMap { Texture = <edgesTex>; AddressU = Clamp; AddressV = Clamp; MipFilter = Point; MinFilter = Point; MagFilter = Point; SRGBTexture = false; }; sampler2D edgesMapL { Texture = <edgesTex>; AddressU = Clamp; AddressV = Clamp; MipFilter = Point; MinFilter = Linear; MagFilter = Linear; SRGBTexture = false; }; sampler2D blendMap { Texture = <blendTex>; AddressU = Clamp; AddressV = Clamp; MipFilter = Point; MinFilter = Point; MagFilter = Point; SRGBTexture = false; }; sampler2D areaMap { Texture = <areaTex>; AddressU = Clamp; AddressV = Clamp; AddressW = Clamp; MipFilter = Point; MinFilter = Point; MagFilter = Point; SRGBTexture = false; }; /* * Typical Multiply-Add operation to ease translation to assembly code. / float4 mad(float4 m, float4 a, float4 b) { #if defined(XBOX) float4 result; asm { mad result, m, a, b }; return result; #else return m a + b; #endif } /** * This one just returns the first level of a mip map chain, which allow us to * avoid the nasty ddx/ddy warnings, even improving the performance a little * bit. / float4 tex2Dlevel0(sampler2D map, float2 texcoord) { return tex2Dlod(map, float4(texcoord, 0.0, 0.0)); } /* * Same as above, this eases translation to assembly code; / float4 tex2Doffset(sampler2D map, float2 texcoord, float2 offset) { #if defined(XBOX) && MAX_SEARCH_STEPS < 6 float4 result; float x = offset.x; float y = offset.y; asm { tfetch2D result, texcoord, map, OffsetX = x, OffsetY = y }; return result; #else return tex2Dlevel0(map, texcoord + PIXEL_SIZE offset); #endif } /** * Ok, we have the distance and both crossing edges, can you please return * the float2 blending weights? / float2 Area(float2 distance, float e1, float e2) { // By dividing by areaSize - 1.0 below we are implicitely offsetting to // always fall inside of a pixel // * Rounding prevents bilinear access precision problems float areaSize = MAX_DISTANCE * 5.0; float2 pixcoord = MAX_DISTANCE * round(4.0 * float2(e1, e2)) + distance; float2 texcoord = pixcoord / (areaSize - 1.0); return tex2Dlevel0(areaMap, texcoord).ra; } /** * 1 S T P A S S ~ C O L O R V E R S I O N / float4 ColorEdgeDetectionPS(float2 texcoord : TEXCOORD0) : COLOR0 { float3 weights = float3(0.2126,0.7152, 0.0722); // These ones are from the CIE XYZ standard. float L = dot(tex2Dlevel0(colorMap, texcoord).rgb, weights); float Lleft = dot(tex2Doffset(colorMap, texcoord, -float2(1.0, 0.0)).rgb, weights); float Ltop = dot(tex2Doffset(colorMap, texcoord, -float2(0.0, 1.0)).rgb, weights); float Lright = dot(tex2Doffset(colorMap, texcoord, float2(1.0, 0.0)).rgb, weights); float Lbottom = dot(tex2Doffset(colorMap, texcoord, float2(0.0, 1.0)).rgb, weights); /* * We detect edges in gamma 1.0/2.0 space. Gamma space boosts the contrast * of the blacks, where the human vision system is more sensitive to small * gradations of intensity. / float4 delta = abs(sqrt(L).xxxx - sqrt(float4(Lleft, Ltop, Lright, Lbottom))); float4 edges = step(threshold.xxxx, delta); if (dot(edges, 1.0) == 0.0) discard; return edges; } /* * 1 S T P A S S ~ D E P T H V E R S I O N / float4 DepthEdgeDetectionPS(float2 texcoord : TEXCOORD0) : COLOR0 { float D = tex2Dlevel0(depthMap, texcoord).r; float Dleft = tex2Doffset(depthMap, texcoord, -float2(1.0, 0.0)).r; float Dtop = tex2Doffset(depthMap, texcoord, -float2(0.0, 1.0)).r; float Dright = tex2Doffset(depthMap, texcoord, float2(1.0, 0.0)).r; float Dbottom = tex2Doffset(depthMap, texcoord, float2(0.0, 1.0)).r; float4 delta = abs(D.xxxx - float4(Dleft, Dtop, Dright, Dbottom)); float4 edges = step(threshold.xxxx / 10.0, delta); // Dividing by 10 give us results similar to the color-based detection. if (dot(edges, 1.0) == 0.0) discard; return edges; } /* * Search functions for the 2nd pass. / float SearchXLeft(float2 texcoord) { // We compare with 0.9 to prevent bilinear access precision problems. float i; float e = 0.0; for (i = -1.5; i > -2.0 MAX_SEARCH_STEPS; i -= 2.0) { e = tex2Doffset(edgesMapL, texcoord, float2(i, 0.0)).g; [flatten] if (e < 0.9) break; } return max(i + 1.5 - 2.0 * e, -2.0 * MAX_SEARCH_STEPS); } float SearchXRight(float2 texcoord) { float i; float e = 0.0; for (i = 1.5; i < 2.0 * MAX_SEARCH_STEPS; i += 2.0) { e = tex2Doffset(edgesMapL, texcoord, float2(i, 0.0)).g; [flatten] if (e < 0.9) break; } return min(i - 1.5 + 2.0 * e, 2.0 * MAX_SEARCH_STEPS); } float SearchYUp(float2 texcoord) { float i; float e = 0.0; for (i = -1.5; i > -2.0 * MAX_SEARCH_STEPS; i -= 2.0) { e = tex2Doffset(edgesMapL, texcoord, float2(i, 0.0).yx).r; [flatten] if (e < 0.9) break; } return max(i + 1.5 - 2.0 * e, -2.0 * MAX_SEARCH_STEPS); } float SearchYDown(float2 texcoord) { float i; float e = 0.0; for (i = 1.5; i < 2.0 * MAX_SEARCH_STEPS; i += 2.0) { e = tex2Doffset(edgesMapL, texcoord, float2(i, 0.0).yx).r; [flatten] if (e < 0.9) break; } return min(i - 1.5 + 2.0 * e, 2.0 * MAX_SEARCH_STEPS); } /** * Checks if the crossing edges e1 and e2 correspond to a _U_ shape. / bool IsUShape(float e1, float e2) { float t = e1 + e2; return abs(t - 1.5) < 0.1 \|\| abs(t - 0.5) < 0.1; } /* * S E C O N D P A S S / float4 BlendWeightCalculationPS(float2 texcoord : TEXCOORD0) : COLOR0 { float4 areas = 0.0; float2 e = tex2Dlevel0(edgesMap, texcoord); [branch] if (e.g) { // Edge at north // Search distances to the left and to the right: float2 d = float2(SearchXLeft(texcoord), SearchXRight(texcoord)); // Now fetch the crossing edges. Instead of sampling between edgels, we // sample at -0.25, to be able to discern what value has each edgel: float4 coords = mad(float4(d.x, -0.25, d.y + 1.0, -0.25), PIXEL_SIZE.xyxy, texcoord.xyxy); float e1 = tex2Dlevel0(edgesMapL, coords.xy).r; float e2 = tex2Dlevel0(edgesMapL, coords.zw).r; if (-d.r + d.g + 1 > 1 \|\| IsUShape(e1, e2)) { // Ok, we know how this pattern looks like, now it is time for getting // the actual area: areas.rg = Area(abs(d), e1, e2); } } [branch] if (e.r) { // Edge at west // Search distances to the top and to the bottom: float2 d = float2(SearchYUp(texcoord), SearchYDown(texcoord)); // Now fetch the crossing edges (yet again): float4 coords = mad(float4(-0.25, d.x, -0.25, d.y + 1.0), PIXEL_SIZE.xyxy, texcoord.xyxy); float e1 = tex2Dlevel0(edgesMapL, coords.xy).g; float e2 = tex2Dlevel0(edgesMapL, coords.zw).g; if (-d.r + d.g + 1 > 1 \|\| IsUShape(e1, e2)) { // Get the area for this direction: areas.ba = Area(abs(d), e1, e2); } } return areas; } /* * T H I R D P A S S / float4 NeighborhoodBlendingPS(float2 texcoord : TEXCOORD0) : COLOR0 { // Fetch the blending weights for current pixel: float4 topLeft = tex2Dlevel0(blendMap, texcoord); float bottom = tex2Doffset(blendMap, texcoord, float2(0.0, 1.0)).g; float right = tex2Doffset(blendMap, texcoord, float2(1.0, 0.0)).a; float4 a = float4(topLeft.r, bottom, topLeft.b, right); // There is some blending weight with a value greater than 0.0? float sum = dot(a, 1.0); [branch] if (sum > 0.0) {; float4 color = 0.0; // Add the contributions of the possible 4 lines that can cross this pixel: #ifdef BILINEAR_FILTER_TRICK float4 o = a PIXEL_SIZE.yyxx; color = mad(tex2Dlevel0(colorMapL, texcoord + float2( 0.0, -o.r)), a.r, color); color = mad(tex2Dlevel0(colorMapL, texcoord + float2( 0.0, o.g)), a.g, color); color = mad(tex2Dlevel0(colorMapL, texcoord + float2(-o.b, 0.0)), a.b, color); color = mad(tex2Dlevel0(colorMapL, texcoord + float2( o.a, 0.0)), a.a, color); #else float4 C = tex2Dlevel0(colorMap, texcoord); float4 Cleft = tex2Doffset(colorMap, texcoord, -float2(1.0, 0.0)); float4 Ctop = tex2Doffset(colorMap, texcoord, -float2(0.0, 1.0)); float4 Cright = tex2Doffset(colorMap, texcoord, float2(1.0, 0.0)); float4 Cbottom = tex2Doffset(colorMap, texcoord, float2(0.0, 1.0)); color = mad(lerp(C, Ctop, a.r), a.r, color); color = mad(lerp(C, Cbottom, a.g), a.g, color); color = mad(lerp(C, Cleft, a.b), a.b, color); color = mad(lerp(C, Cright, a.a), a.a, color); #endif // Normalize the resulting color and we are finished! return color / sum; } else { return tex2Dlevel0(colorMap, texcoord); } } /** * Time for some techniques! */ technique ColorEdgeDetection { pass ColorEdgeDetection { VertexShader = null; PixelShader = compile ps_3_0 ColorEdgeDetectionPS(); ZEnable = false; SRGBWriteEnable = false; // We will be creating the stencil buffer for later usage. StencilEnable = true; StencilPass = REPLACE; StencilRef = 1; } } technique DepthEdgeDetection { pass DepthEdgeDetection { VertexShader = null; PixelShader = compile ps_3_0 DepthEdgeDetectionPS(); ZEnable = false; SRGBWriteEnable = false; // We will be creating the stencil buffer for later usage. StencilEnable = true; StencilPass = REPLACE; StencilRef = 1; } } technique BlendWeightCalculation { pass BlendWeightCalculation { VertexShader = null; PixelShader = compile ps_3_0 BlendWeightCalculationPS(); ZEnable = false; SRGBWriteEnable = false; // Here we want to process only marked pixels. StencilEnable = true; StencilPass = KEEP; StencilFunc = EQUAL; StencilRef = 1; } } technique NeighborhoodBlending { pass NeighborhoodBlending { VertexShader = null; PixelShader = compile ps_3_0 NeighborhoodBlendingPS(); ZEnable = false; SRGBWriteEnable = true; // Here we want to process only marked pixels. StencilEnable = true; StencilPass = KEEP; StencilFunc = EQUAL; StencilRef = 1; } } by the way: GPU Pro 2 Example Programs and Source Code Regards, Robert Quote Everything should be made as simple as possible, but not one bit simpler. by Albert Einstein PhysX Preview of Cloth, Fluid and Soft Body A8.47.1P

Entire Thread
Subject	Posted By	Posted
MLAA (Morphological Anti-Aliasing)	rojart	10/28/10 13:54
Re: MLAA (Morphological Anti-Aliasing)	Hummel	10/28/10 15:13
Re: MLAA (Morphological Anti-Aliasing)	Toast	10/28/10 15:22
Re: MLAA (Morphological Anti-Aliasing)	Hummel	10/28/10 16:15
Re: MLAA (Morphological Anti-Aliasing)	Joey	10/28/10 18:22
Re: MLAA (Morphological Anti-Aliasing)	Joozey	10/28/10 20:56
Re: MLAA (Morphological Anti-Aliasing)	Quad	10/28/10 21:17
MLAA on the GPU using Direct3D9 (DEMO)	rojart	10/31/10 01:19
Re: MLAA on the GPU using Direct3D9 (DEMO)	Nowherebrain	10/31/10 06:12
Re: MLAA on the GPU using Direct3D9 (DEMO)	Joey	12/05/10 10:39
Re: MLAA on the GPU using Direct3D9 (DEMO)	rojart	12/05/10 11:28
Re: MLAA on the GPU using Direct3D9 (DEMO)	Joey	12/05/10 11:52
Re: MLAA on the GPU using Direct3D9 (DEMO)	HeelX	12/05/10 13:11
Re: MLAA on the GPU using Direct3D9 (DEMO)	Joey	12/05/10 13:32
Re: MLAA on the GPU using Direct3D9 (DEMO)	HeelX	12/05/10 15:18
Re: MLAA on the GPU using Direct3D9 (DEMO)	Hummel	12/05/10 20:14
Re: MLAA on the GPU using Direct3D9 (DEMO)	rojart	12/07/10 22:57
Re: MLAA on the GPU using Direct3D9 (DEMO)	rojart	01/30/11 20:16
Re: MLAA on the GPU using Direct3D9 (DEMO)	rojart	03/07/11 00:49
Re: MLAA on the GPU using Direct3D9 (DEMO)	1234David	01/07/15 18:36
Re: MLAA on the GPU using Direct3D9 (DEMO)	Kartoffel	01/07/15 19:52
Re: MLAA (Morphological Anti-Aliasing)	Toast	10/28/10 21:31
Re: MLAA (Morphological Anti-Aliasing)	jcl	10/29/10 13:38
Re: MLAA (Morphological Anti-Aliasing)	painkiller	10/29/10 18:00
Re: MLAA (Morphological Anti-Aliasing)	Hummel	10/29/10 18:12
Re: MLAA (Morphological Anti-Aliasing)	HeelX	10/29/10 19:33

Moderated by aztec, Spirit