Commit graph

85 commits

Author SHA1 Message Date
Francisco Zurita
cc6b8833e3 Add libretro NEON optimizations
credits: https://github.com/libretro/parallel-n64/tree/master/mupen64plus-rsp-cxd4
2017-03-04 23:36:21 -05:00
Francisco Zurita
e86432df61 Update to latest CXD4 2016-07-28 08:27:07 -04:00
Gillou68310
8796295a2c Merge commit '73232513e7889c82f86fd77f81ac6a060fe7d828' 2015-11-10 11:57:18 +01:00
no
9e0328f45b Fix GNU assembler syntax errors by prefixing vcr's with cf_. 2015-01-30 14:16:55 -05:00
unknown
2d1887de16 k. 2015-01-29 11:31:35 -05:00
unknown
c90be1f99c enforcing unsigned types for bit masks and bit-sensitive work 2015-01-29 08:58:37 -05:00
unknown
14ff3d4878 fixed SIMD::SSE macros for bi-compatibility with __m128i and arrays 2015-01-28 13:02:56 -05:00
unknown
fcc46e7845 Avoid Unix name collisions with RSP vector function exports. 2015-01-27 22:53:15 -05:00
unknown
95cf462dfb force dummy buffer allocations for LWC2, native SWC2 wraparound 2015-01-21 15:10:18 -05:00
unknown
9c49dc4fff abolish SSSE3 configurator for byte-wise shuffling 2014-12-13 16:34:48 -05:00
unknown
699896f677 install new pointer types to distinguish mem. reference from decl.'s 2014-12-08 23:47:50 -05:00
unknown
ac4fb238da temporarily supporting the SSSE3 superset for early experiments 2014-11-16 15:46:54 -05:00
unknown
2e1e9edf75 cut SHUFFLE_VECTOR to only 2 arguments with pre-loaded VT 2014-10-17 02:23:08 -04:00
unknown
f05e2d603e globalize the shuffle macro for a future shot at SMC in su.c 2014-10-17 00:50:30 -04:00
unknown
158a4d0b60 pass only 2 XMM operands, w/ no return slot ifndef ARCH_MIN_SSE2 2014-10-16 00:43:37 -04:00
unknown
e8e87ce602 some new flexible intrinsic macros for vector operations 2014-10-14 17:59:00 -04:00
unknown
91ba902637 removed extra load/store from old scalar SHUFFLE_VECTOR template 2014-10-14 04:53:49 -04:00
unknown
7d80d7d115 fix macro re-definition warning in GCC (already in my_types.h) 2014-10-10 01:38:41 -04:00
unknown
f1481dd39b restructured modular layout of the source, dropped some optional features 2014-10-09 16:45:55 -04:00
unknown
d5692be247 dissolved VU arguments into higher-level SIMD vector call stacks 2014-10-07 00:33:15 -04:00
Sven Eckelmann
4392ebde42 Add an explicit public domain dedication statement
The process of dedicating a piece of work under the public domain is not the
same under different legal systems. It is possible that different rights are
given away depending on the origin. Sometimes even the dedication of the work
under public domain is not possible at all. CC0 tries to provide an explicit
way to waive all rights to still provide a secure way for other parties to use
this work [1] and provides a fallback when parts of the license may be judged
invalid under any jurisdiction.

[1] http://creativecommons.org/about/cc0
2013-12-17 13:36:56 +01:00
unknown
0b8b2ad900 restored old (surprisingly faster) small VU operand allocation 2013-11-26 13:22:37 -05:00
unknown
cd77984576 manual override of GCC's slower decode of SA IW bitmask 2013-10-11 02:27:18 -04:00
unknown
56c5ac9daf BIG speed-up by moving shuffles out of EX queue, into VU ops 2013-10-11 00:40:41 -04:00
unknown
4c2b671d53 deprecated 2-D opcode-element vector jump table 2013-10-10 23:04:07 -04:00
unknown
f0b8985bda moved SSE2 declare macro to MAKE/GCC command script 2013-10-07 23:37:06 -04:00
unknown
9d1e64e935 moved everything about RSP flags to new SSE2-hybrid header 2013-10-03 21:06:41 -04:00
unknown
342488f056 integrated signed clamp with VADD/VSUB 2013-09-23 14:29:18 -04:00
unknown
d29cbbe3e2 more MAC micro-optimizations, split clamping to new header 2013-09-23 03:30:24 -04:00
unknown
c427e052fe purified vectors for complex RSP clip selects, fix VCL 2013-09-22 04:39:23 -04:00
unknown
5c1ab5806b unify shuffles to new one-dimensional vector op-code jumping 2013-09-21 18:16:13 -04:00
unknown
d064b82976 split shuffle-related stuff to its own header 2013-09-20 23:49:05 -04:00
unknown
70a45f8bd7 fix Visual Studio interpretation of VR "re-def" 2013-09-20 15:59:27 -04:00
unknown
d757e41676 force 16-byte alignment, divide clamp base formula to each op 2013-09-20 11:56:10 -04:00
unknown
bbebbcc81a uninstalled dynamic vector/scalar coefficient global 2013-09-20 00:36:26 -04:00
unknown
3c96c25950 fix compiler static over-optimized SSE2 build faults 2013-09-19 20:14:03 -04:00
unknown
485a04bc60 divide group shuffling now completely vectorized 2013-09-18 22:00:00 -04:00
unknown
fce5402ba1 started SSE2 intrinsics dynamic inclusion for shuffling 2013-09-18 01:41:36 -04:00
unknown
12aa9cf331 forgot to in-line SIGNED_CLAMP, other tech fixes 2013-09-17 03:40:15 -04:00
unknown
76f8358287 try to amend vector intelligence for GCC 4.8.1 upgrade 2013-09-17 02:26:32 -04:00
unknown
b176173f55 reconstructed scalar 64-bit segmentations as 16-bit SSE steps 2013-09-16 22:33:30 -04:00
unknown
9e260a641a restored old static vector clamping for VADD/VSUB 2013-09-16 03:01:24 -04:00
unknown
7c791ae5ba $vco is now vectorized. (And that's the last of them.) 2013-09-15 02:19:59 -04:00
unknown
b81fb83d53 $vcc is now vectorized. 2013-09-14 19:31:22 -04:00
unknown
ea759d0db6 $vce is now vectorized. 2013-09-14 14:14:50 -04:00
unknown
ca343c7900 compress signed clamping vectorizer 2013-09-14 13:31:49 -04:00
unknown
1192748b33 amend storage ambiguities to the compiler's vectorizer 2013-09-14 03:35:32 -04:00
unknown
8e69fa7b8b fix unused function/structure warnings 2013-09-13 17:27:37 -04:00
unknown
dfb3b6371a obvious bug is obvious. 2013-09-13 17:22:42 -04:00
unknown
49bd94cd9f completely vectorized all accumulator R/W 2013-09-13 15:04:06 -04:00