1
0
Fork 0
mirror of https://github.com/Ryujinx/Ryujinx.git synced 2024-12-03 16:12:00 +00:00
Ryujinx/ARMeilleure
jduncanator 68e15c1a74
Implement Fast Paths for most A32 SIMD instructions (#952)
* Begin work on A32 SIMD Intrinsics

* More instructions, some cleanup.

* Intrinsics for Move instructions (zip etc)

These pass the existing tests.

* Intrinsics for some of Cvt

While doing this I noticed that the conversion for int/fp was incorrect
in the slow path. I'll fix this in the original repo.

* Intrinsics for more Arithmetic instructions.

* Intrinsics for Vext

* Fix VEXT Intrinsic for double words.

* Use InsertPs to move scalar values.

* Cleanup, fix VPADD.f32 and VMIN signed integer.

* Cleanup, add SSE2 support for scalar insert.

Works similarly to the IR scalar insert, but obviously this one works
directly on V128.

* Minor cleanup.

* Enable intrinsic for FP64 to integer conversion.

* Address feedback apart from splitting out intrinsic float abs

Also: bad VREV encodings as undefined rather than throwing in translation.

* Move float abs to helper, fix bug with cvt

* Rename opc2 & 3 to match A32 docs, use ArgumentOutOfRangeException appropriately.

* Get name of variable at compilation rather than string literal.

* Use correct double sign mask.
2020-03-05 11:41:33 +11:00
..
CodeGen Implement Fast Paths for most A32 SIMD instructions (#952) 2020-03-05 11:41:33 +11:00
Common Implemented fast paths for: (#841) 2019-12-07 13:45:32 +01:00
Decoders Implement Fast Paths for most A32 SIMD instructions (#952) 2020-03-05 11:41:33 +11:00
Diagnostics Add most of the A32 instruction set to ARMeilleure (#897) 2020-02-24 08:20:40 +11:00
Instructions Implement Fast Paths for most A32 SIMD instructions (#952) 2020-03-05 11:41:33 +11:00
IntermediateRepresentation Implement Fast Paths for most A32 SIMD instructions (#952) 2020-03-05 11:41:33 +11:00
Memory Add a GetSpan method to the memory manager and use it on GPU (#877) 2020-01-13 10:27:50 +11:00
State Add SSAT, SSAT16, USAT and USAT16 ARM32 instructions (#954) 2020-03-01 07:51:55 +11:00
Translation Add most of the A32 instruction set to ARMeilleure (#897) 2020-02-24 08:20:40 +11:00
ARMeilleure.csproj Render Profiler in GUI (#854) 2020-02-06 11:25:47 +00:00
Optimizations.cs Fix Fcmge_S/V & Fcmgt_S/V Inst.s (#815) 2019-11-09 23:21:03 -03:00
Statistics.cs Add a new JIT compiler for CPU code (#693) 2019-08-08 21:56:22 +03:00