Francisco Zurita
cc6b8833e3
Add libretro NEON optimizations
...
credits: https://github.com/libretro/parallel-n64/tree/master/mupen64plus-rsp-cxd4
2017-03-04 23:36:21 -05:00
Francisco Zurita
e86432df61
Update to latest CXD4
2016-07-28 08:27:07 -04:00
Gillou68310
8796295a2c
Merge commit '73232513e7889c82f86fd77f81ac6a060fe7d828'
2015-11-10 11:57:18 +01:00
no
9e0328f45b
Fix GNU assembler syntax errors by prefixing vcr's with cf_.
2015-01-30 14:16:55 -05:00
unknown
2d1887de16
k.
2015-01-29 11:31:35 -05:00
unknown
c90be1f99c
enforcing unsigned types for bit masks and bit-sensitive work
2015-01-29 08:58:37 -05:00
unknown
14ff3d4878
fixed SIMD::SSE macros for bi-compatibility with __m128i and arrays
2015-01-28 13:02:56 -05:00
unknown
fcc46e7845
Avoid Unix name collisions with RSP vector function exports.
2015-01-27 22:53:15 -05:00
unknown
95cf462dfb
force dummy buffer allocations for LWC2, native SWC2 wraparound
2015-01-21 15:10:18 -05:00
unknown
9c49dc4fff
abolish SSSE3 configurator for byte-wise shuffling
2014-12-13 16:34:48 -05:00
unknown
699896f677
install new pointer types to distinguish mem. reference from decl.'s
2014-12-08 23:47:50 -05:00
unknown
ac4fb238da
temporarily supporting the SSSE3 superset for early experiments
2014-11-16 15:46:54 -05:00
unknown
2e1e9edf75
cut SHUFFLE_VECTOR to only 2 arguments with pre-loaded VT
2014-10-17 02:23:08 -04:00
unknown
f05e2d603e
globalize the shuffle macro for a future shot at SMC in su.c
2014-10-17 00:50:30 -04:00
unknown
158a4d0b60
pass only 2 XMM operands, w/ no return slot ifndef ARCH_MIN_SSE2
2014-10-16 00:43:37 -04:00
unknown
e8e87ce602
some new flexible intrinsic macros for vector operations
2014-10-14 17:59:00 -04:00
unknown
91ba902637
removed extra load/store from old scalar SHUFFLE_VECTOR template
2014-10-14 04:53:49 -04:00
unknown
7d80d7d115
fix macro re-definition warning in GCC (already in my_types.h)
2014-10-10 01:38:41 -04:00
unknown
f1481dd39b
restructured modular layout of the source, dropped some optional features
2014-10-09 16:45:55 -04:00
unknown
d5692be247
dissolved VU arguments into higher-level SIMD vector call stacks
2014-10-07 00:33:15 -04:00
Sven Eckelmann
4392ebde42
Add an explicit public domain dedication statement
...
The process of dedicating a piece of work under the public domain is not the
same under different legal systems. It is possible that different rights are
given away depending on the origin. Sometimes even the dedication of the work
under public domain is not possible at all. CC0 tries to provide an explicit
way to waive all rights to still provide a secure way for other parties to use
this work [1] and provides a fallback when parts of the license may be judged
invalid under any jurisdiction.
[1] http://creativecommons.org/about/cc0
2013-12-17 13:36:56 +01:00
unknown
0b8b2ad900
restored old (surprisingly faster) small VU operand allocation
2013-11-26 13:22:37 -05:00
unknown
cd77984576
manual override of GCC's slower decode of SA IW bitmask
2013-10-11 02:27:18 -04:00
unknown
56c5ac9daf
BIG speed-up by moving shuffles out of EX queue, into VU ops
2013-10-11 00:40:41 -04:00
unknown
4c2b671d53
deprecated 2-D opcode-element vector jump table
2013-10-10 23:04:07 -04:00
unknown
f0b8985bda
moved SSE2 declare macro to MAKE/GCC command script
2013-10-07 23:37:06 -04:00
unknown
9d1e64e935
moved everything about RSP flags to new SSE2-hybrid header
2013-10-03 21:06:41 -04:00
unknown
342488f056
integrated signed clamp with VADD/VSUB
2013-09-23 14:29:18 -04:00
unknown
d29cbbe3e2
more MAC micro-optimizations, split clamping to new header
2013-09-23 03:30:24 -04:00
unknown
c427e052fe
purified vectors for complex RSP clip selects, fix VCL
2013-09-22 04:39:23 -04:00
unknown
5c1ab5806b
unify shuffles to new one-dimensional vector op-code jumping
2013-09-21 18:16:13 -04:00
unknown
d064b82976
split shuffle-related stuff to its own header
2013-09-20 23:49:05 -04:00
unknown
70a45f8bd7
fix Visual Studio interpretation of VR "re-def"
2013-09-20 15:59:27 -04:00
unknown
d757e41676
force 16-byte alignment, divide clamp base formula to each op
2013-09-20 11:56:10 -04:00
unknown
bbebbcc81a
uninstalled dynamic vector/scalar coefficient global
2013-09-20 00:36:26 -04:00
unknown
3c96c25950
fix compiler static over-optimized SSE2 build faults
2013-09-19 20:14:03 -04:00
unknown
485a04bc60
divide group shuffling now completely vectorized
2013-09-18 22:00:00 -04:00
unknown
fce5402ba1
started SSE2 intrinsics dynamic inclusion for shuffling
2013-09-18 01:41:36 -04:00
unknown
12aa9cf331
forgot to in-line SIGNED_CLAMP, other tech fixes
2013-09-17 03:40:15 -04:00
unknown
76f8358287
try to amend vector intelligence for GCC 4.8.1 upgrade
2013-09-17 02:26:32 -04:00
unknown
b176173f55
reconstructed scalar 64-bit segmentations as 16-bit SSE steps
2013-09-16 22:33:30 -04:00
unknown
9e260a641a
restored old static vector clamping for VADD/VSUB
2013-09-16 03:01:24 -04:00
unknown
7c791ae5ba
$vco is now vectorized. (And that's the last of them.)
2013-09-15 02:19:59 -04:00
unknown
b81fb83d53
$vcc is now vectorized.
2013-09-14 19:31:22 -04:00
unknown
ea759d0db6
$vce is now vectorized.
2013-09-14 14:14:50 -04:00
unknown
ca343c7900
compress signed clamping vectorizer
2013-09-14 13:31:49 -04:00
unknown
1192748b33
amend storage ambiguities to the compiler's vectorizer
2013-09-14 03:35:32 -04:00
unknown
8e69fa7b8b
fix unused function/structure warnings
2013-09-13 17:27:37 -04:00
unknown
dfb3b6371a
obvious bug is obvious.
2013-09-13 17:22:42 -04:00
unknown
49bd94cd9f
completely vectorized all accumulator R/W
2013-09-13 15:04:06 -04:00