2019-10-13 07:02:07 +01:00
|
|
|
namespace Ryujinx.Graphics.Shader.Translation
|
New shader translator implementation (#654)
* Start implementing a new shader translator
* Fix shift instructions and a typo
* Small refactoring on StructuredProgram, move RemovePhis method to a separate class
* Initial geometry shader support
* Implement TLD4
* Fix -- There's no negation on FMUL32I
* Add constant folding and algebraic simplification optimizations, nits
* Some leftovers from constant folding
* Avoid cast for constant assignments
* Add a branch elimination pass, and misc small fixes
* Remove redundant branches, add expression propagation and other improvements on the code
* Small leftovers -- add missing break and continue, remove unused properties, other improvements
* Add null check to handle empty block cases on block visitor
* Add HADD2 and HMUL2 half float shader instructions
* Optimize pack/unpack sequences, some fixes related to half float instructions
* Add TXQ, TLD, TLDS and TLD4S shader texture instructions, and some support for bindless textures, some refactoring on codegen
* Fix copy paste mistake that caused RZ to be ignored on the AST instruction
* Add workaround for conditional exit, and fix half float instruction with constant buffer
* Add missing 0.0 source for TLDS.LZ variants
* Simplify the switch for TLDS.LZ
* Texture instructions related fixes
* Implement the HFMA instruction, and some misc. fixes
* Enable constant folding on UnpackHalf2x16 instructions
* Refactor HFMA to use OpCode* for opcode decoding rather than on the helper methods
* Remove the old shader translator
* Remove ShaderDeclInfo and other unused things
* Add dual vertex shader support
* Add ShaderConfig, used to pass shader type and maximum cbuffer size
* Move and rename some instruction enums
* Move texture instructions into a separate file
* Move operand GetExpression and locals management to OperandManager
* Optimize opcode decoding using a simple list and binary search
* Add missing condition for do-while on goto elimination
* Misc. fixes on texture instructions
* Simplify TLDS switch
* Address PR feedback, and a nit
2019-04-18 00:57:08 +01:00
|
|
|
{
|
|
|
|
static class AttributeConsts
|
|
|
|
{
|
2021-10-18 22:38:04 +01:00
|
|
|
public const int TessLevelOuter0 = 0x000;
|
|
|
|
public const int TessLevelOuter1 = 0x004;
|
|
|
|
public const int TessLevelOuter2 = 0x008;
|
|
|
|
public const int TessLevelOuter3 = 0x00c;
|
|
|
|
public const int TessLevelInner0 = 0x010;
|
|
|
|
public const int TessLevelInner1 = 0x014;
|
|
|
|
public const int Layer = 0x064;
|
|
|
|
public const int PointSize = 0x06c;
|
|
|
|
public const int PositionX = 0x070;
|
|
|
|
public const int PositionY = 0x074;
|
|
|
|
public const int PositionZ = 0x078;
|
|
|
|
public const int PositionW = 0x07c;
|
|
|
|
public const int ClipDistance0 = 0x2c0;
|
|
|
|
public const int ClipDistance1 = 0x2c4;
|
|
|
|
public const int ClipDistance2 = 0x2c8;
|
|
|
|
public const int ClipDistance3 = 0x2cc;
|
|
|
|
public const int ClipDistance4 = 0x2d0;
|
|
|
|
public const int ClipDistance5 = 0x2d4;
|
|
|
|
public const int ClipDistance6 = 0x2d8;
|
|
|
|
public const int ClipDistance7 = 0x2dc;
|
|
|
|
public const int PointCoordX = 0x2e0;
|
|
|
|
public const int PointCoordY = 0x2e4;
|
|
|
|
public const int TessCoordX = 0x2f0;
|
|
|
|
public const int TessCoordY = 0x2f4;
|
|
|
|
public const int InstanceId = 0x2f8;
|
|
|
|
public const int VertexId = 0x2fc;
|
|
|
|
public const int FrontFacing = 0x3fc;
|
New shader translator implementation (#654)
* Start implementing a new shader translator
* Fix shift instructions and a typo
* Small refactoring on StructuredProgram, move RemovePhis method to a separate class
* Initial geometry shader support
* Implement TLD4
* Fix -- There's no negation on FMUL32I
* Add constant folding and algebraic simplification optimizations, nits
* Some leftovers from constant folding
* Avoid cast for constant assignments
* Add a branch elimination pass, and misc small fixes
* Remove redundant branches, add expression propagation and other improvements on the code
* Small leftovers -- add missing break and continue, remove unused properties, other improvements
* Add null check to handle empty block cases on block visitor
* Add HADD2 and HMUL2 half float shader instructions
* Optimize pack/unpack sequences, some fixes related to half float instructions
* Add TXQ, TLD, TLDS and TLD4S shader texture instructions, and some support for bindless textures, some refactoring on codegen
* Fix copy paste mistake that caused RZ to be ignored on the AST instruction
* Add workaround for conditional exit, and fix half float instruction with constant buffer
* Add missing 0.0 source for TLDS.LZ variants
* Simplify the switch for TLDS.LZ
* Texture instructions related fixes
* Implement the HFMA instruction, and some misc. fixes
* Enable constant folding on UnpackHalf2x16 instructions
* Refactor HFMA to use OpCode* for opcode decoding rather than on the helper methods
* Remove the old shader translator
* Remove ShaderDeclInfo and other unused things
* Add dual vertex shader support
* Add ShaderConfig, used to pass shader type and maximum cbuffer size
* Move and rename some instruction enums
* Move texture instructions into a separate file
* Move operand GetExpression and locals management to OperandManager
* Optimize opcode decoding using a simple list and binary search
* Add missing condition for do-while on goto elimination
* Misc. fixes on texture instructions
* Simplify TLDS switch
* Address PR feedback, and a nit
2019-04-18 00:57:08 +01:00
|
|
|
|
|
|
|
public const int UserAttributesCount = 32;
|
|
|
|
public const int UserAttributeBase = 0x80;
|
|
|
|
public const int UserAttributeEnd = UserAttributeBase + UserAttributesCount * 16;
|
|
|
|
|
2021-10-18 22:38:04 +01:00
|
|
|
public const int LoadOutputMask = 1 << 30;
|
|
|
|
public const int Mask = 0x3fffffff;
|
|
|
|
|
New shader translator implementation (#654)
* Start implementing a new shader translator
* Fix shift instructions and a typo
* Small refactoring on StructuredProgram, move RemovePhis method to a separate class
* Initial geometry shader support
* Implement TLD4
* Fix -- There's no negation on FMUL32I
* Add constant folding and algebraic simplification optimizations, nits
* Some leftovers from constant folding
* Avoid cast for constant assignments
* Add a branch elimination pass, and misc small fixes
* Remove redundant branches, add expression propagation and other improvements on the code
* Small leftovers -- add missing break and continue, remove unused properties, other improvements
* Add null check to handle empty block cases on block visitor
* Add HADD2 and HMUL2 half float shader instructions
* Optimize pack/unpack sequences, some fixes related to half float instructions
* Add TXQ, TLD, TLDS and TLD4S shader texture instructions, and some support for bindless textures, some refactoring on codegen
* Fix copy paste mistake that caused RZ to be ignored on the AST instruction
* Add workaround for conditional exit, and fix half float instruction with constant buffer
* Add missing 0.0 source for TLDS.LZ variants
* Simplify the switch for TLDS.LZ
* Texture instructions related fixes
* Implement the HFMA instruction, and some misc. fixes
* Enable constant folding on UnpackHalf2x16 instructions
* Refactor HFMA to use OpCode* for opcode decoding rather than on the helper methods
* Remove the old shader translator
* Remove ShaderDeclInfo and other unused things
* Add dual vertex shader support
* Add ShaderConfig, used to pass shader type and maximum cbuffer size
* Move and rename some instruction enums
* Move texture instructions into a separate file
* Move operand GetExpression and locals management to OperandManager
* Optimize opcode decoding using a simple list and binary search
* Add missing condition for do-while on goto elimination
* Misc. fixes on texture instructions
* Simplify TLDS switch
* Address PR feedback, and a nit
2019-04-18 00:57:08 +01:00
|
|
|
|
2019-07-02 03:39:22 +01:00
|
|
|
// Note: Those attributes are used internally by the translator
|
|
|
|
// only, they don't exist on Maxwell.
|
2021-10-18 22:38:04 +01:00
|
|
|
public const int SpecialMask = 0xf << 24;
|
New shader translator implementation (#654)
* Start implementing a new shader translator
* Fix shift instructions and a typo
* Small refactoring on StructuredProgram, move RemovePhis method to a separate class
* Initial geometry shader support
* Implement TLD4
* Fix -- There's no negation on FMUL32I
* Add constant folding and algebraic simplification optimizations, nits
* Some leftovers from constant folding
* Avoid cast for constant assignments
* Add a branch elimination pass, and misc small fixes
* Remove redundant branches, add expression propagation and other improvements on the code
* Small leftovers -- add missing break and continue, remove unused properties, other improvements
* Add null check to handle empty block cases on block visitor
* Add HADD2 and HMUL2 half float shader instructions
* Optimize pack/unpack sequences, some fixes related to half float instructions
* Add TXQ, TLD, TLDS and TLD4S shader texture instructions, and some support for bindless textures, some refactoring on codegen
* Fix copy paste mistake that caused RZ to be ignored on the AST instruction
* Add workaround for conditional exit, and fix half float instruction with constant buffer
* Add missing 0.0 source for TLDS.LZ variants
* Simplify the switch for TLDS.LZ
* Texture instructions related fixes
* Implement the HFMA instruction, and some misc. fixes
* Enable constant folding on UnpackHalf2x16 instructions
* Refactor HFMA to use OpCode* for opcode decoding rather than on the helper methods
* Remove the old shader translator
* Remove ShaderDeclInfo and other unused things
* Add dual vertex shader support
* Add ShaderConfig, used to pass shader type and maximum cbuffer size
* Move and rename some instruction enums
* Move texture instructions into a separate file
* Move operand GetExpression and locals management to OperandManager
* Optimize opcode decoding using a simple list and binary search
* Add missing condition for do-while on goto elimination
* Misc. fixes on texture instructions
* Simplify TLDS switch
* Address PR feedback, and a nit
2019-04-18 00:57:08 +01:00
|
|
|
public const int FragmentOutputDepth = 0x1000000;
|
|
|
|
public const int FragmentOutputColorBase = 0x1000010;
|
|
|
|
public const int FragmentOutputColorEnd = FragmentOutputColorBase + 8 * 16;
|
2019-10-13 07:02:07 +01:00
|
|
|
|
2020-07-26 04:03:40 +01:00
|
|
|
public const int FragmentOutputIsBgraBase = 0x1000100;
|
|
|
|
public const int FragmentOutputIsBgraEnd = FragmentOutputIsBgraBase + 8 * 4;
|
|
|
|
|
2019-10-13 07:02:07 +01:00
|
|
|
public const int ThreadIdX = 0x2000000;
|
|
|
|
public const int ThreadIdY = 0x2000004;
|
|
|
|
public const int ThreadIdZ = 0x2000008;
|
|
|
|
|
|
|
|
public const int CtaIdX = 0x2000010;
|
|
|
|
public const int CtaIdY = 0x2000014;
|
|
|
|
public const int CtaIdZ = 0x2000018;
|
2019-11-08 20:29:41 +00:00
|
|
|
|
|
|
|
public const int LaneId = 0x2000020;
|
|
|
|
|
2021-10-18 22:38:04 +01:00
|
|
|
public const int InvocationId = 0x2000024;
|
|
|
|
public const int PrimitiveId = 0x2000028;
|
|
|
|
public const int PatchVerticesIn = 0x200002c;
|
|
|
|
|
|
|
|
public const int EqMask = 0x2000030;
|
|
|
|
public const int GeMask = 0x2000034;
|
|
|
|
public const int GtMask = 0x2000038;
|
|
|
|
public const int LeMask = 0x200003c;
|
|
|
|
public const int LtMask = 0x2000040;
|
2021-04-02 11:50:35 +01:00
|
|
|
|
2021-10-18 22:38:04 +01:00
|
|
|
public const int ThreadKill = 0x2000044;
|
New shader translator implementation (#654)
* Start implementing a new shader translator
* Fix shift instructions and a typo
* Small refactoring on StructuredProgram, move RemovePhis method to a separate class
* Initial geometry shader support
* Implement TLD4
* Fix -- There's no negation on FMUL32I
* Add constant folding and algebraic simplification optimizations, nits
* Some leftovers from constant folding
* Avoid cast for constant assignments
* Add a branch elimination pass, and misc small fixes
* Remove redundant branches, add expression propagation and other improvements on the code
* Small leftovers -- add missing break and continue, remove unused properties, other improvements
* Add null check to handle empty block cases on block visitor
* Add HADD2 and HMUL2 half float shader instructions
* Optimize pack/unpack sequences, some fixes related to half float instructions
* Add TXQ, TLD, TLDS and TLD4S shader texture instructions, and some support for bindless textures, some refactoring on codegen
* Fix copy paste mistake that caused RZ to be ignored on the AST instruction
* Add workaround for conditional exit, and fix half float instruction with constant buffer
* Add missing 0.0 source for TLDS.LZ variants
* Simplify the switch for TLDS.LZ
* Texture instructions related fixes
* Implement the HFMA instruction, and some misc. fixes
* Enable constant folding on UnpackHalf2x16 instructions
* Refactor HFMA to use OpCode* for opcode decoding rather than on the helper methods
* Remove the old shader translator
* Remove ShaderDeclInfo and other unused things
* Add dual vertex shader support
* Add ShaderConfig, used to pass shader type and maximum cbuffer size
* Move and rename some instruction enums
* Move texture instructions into a separate file
* Move operand GetExpression and locals management to OperandManager
* Optimize opcode decoding using a simple list and binary search
* Add missing condition for do-while on goto elimination
* Misc. fixes on texture instructions
* Simplify TLDS switch
* Address PR feedback, and a nit
2019-04-18 00:57:08 +01:00
|
|
|
}
|
|
|
|
}
|