Commit graph

273 commits

Author SHA1 Message Date
unknown
2b073a276d all shuffling for multiplies now in SSE2 2013-09-18 03:02:36 -04:00
unknown
fce5402ba1 started SSE2 intrinsics dynamic inclusion for shuffling 2013-09-18 01:41:36 -04:00
unknown
12aa9cf331 forgot to in-line SIGNED_CLAMP, other tech fixes 2013-09-17 03:40:15 -04:00
unknown
76f8358287 try to amend vector intelligence for GCC 4.8.1 upgrade 2013-09-17 02:26:32 -04:00
unknown
af2c59b4b9 quieten integer overflow warning with gcc -pedantic 2013-09-16 22:45:00 -04:00
unknown
b176173f55 reconstructed scalar 64-bit segmentations as 16-bit SSE steps 2013-09-16 22:33:30 -04:00
unknown
9e260a641a restored old static vector clamping for VADD/VSUB 2013-09-16 03:01:24 -04:00
unknown
25cdf537a6 applied remaining vector microoptimizations 2013-09-16 02:38:14 -04:00
unknown
8b9712aefd More stupid wrong assumptions about signedness. Die. 2013-09-16 02:23:07 -04:00
unknown
d8d3c7ce90 in-depth vector microoptimizations to VMUDN SSE 2013-09-16 02:18:54 -04:00
unknown
4a003f47c2 fix wrong assumption on sign if corner case 0x8000*0x8000 2013-09-16 02:15:46 -04:00
unknown
3f370bf8ea vectorization boost and corner case fix to VMULF 2013-09-16 01:52:17 -04:00
unknown
28e1081ef3 corrected sign mask set in VACC hi (Mario, SSB gfx) 2013-09-15 21:53:00 -04:00
unknown
7c791ae5ba $vco is now vectorized. (And that's the last of them.) 2013-09-15 02:19:59 -04:00
unknown
b81fb83d53 $vcc is now vectorized. 2013-09-14 19:31:22 -04:00
unknown
ea759d0db6 $vce is now vectorized. 2013-09-14 14:14:50 -04:00
unknown
ca343c7900 compress signed clamping vectorizer 2013-09-14 13:31:49 -04:00
unknown
1192748b33 amend storage ambiguities to the compiler's vectorizer 2013-09-14 03:35:32 -04:00
unknown
675bddbb76 fix almost every possible remaining warning 2013-09-13 17:51:16 -04:00
unknown
29d2c8f278 useless warnings about call-parameter type mismatches 2013-09-13 17:38:23 -04:00
unknown
8e69fa7b8b fix unused function/structure warnings 2013-09-13 17:27:37 -04:00
unknown
dfb3b6371a obvious bug is obvious. 2013-09-13 17:22:42 -04:00
unknown
49bd94cd9f completely vectorized all accumulator R/W 2013-09-13 15:04:06 -04:00
unknown
0f98ed926f refactored old buggy VCH writeback to VCC 2013-09-12 03:06:38 -04:00
unknown
52dc51b015 forgot to include sign when checking le/ge, reversed ternary 2013-09-11 22:55:13 -04:00
unknown
4d4773175b accidentally wrote VCO instead of VCC 2013-09-11 11:48:23 -04:00
unknown
2341bd39fc potential debugger hazard when testing old functions 2013-09-11 09:19:29 -04:00
unknown
1aa5dfe5fd fixed misprioritization of 64-bit type conversion 2013-09-11 08:36:01 -04:00
unknown
68bd431a63 full static rewrite of vector unit 2013-09-11 06:58:09 -04:00
unknown
64311fb516 more Notepad++ typo suggestions 2013-09-02 19:21:45 -04:00
RJ Swedlow
46082660bc support for more pedantic, ANSI C rules 2013-08-26 23:03:02 -04:00
RJ Swedlow
be355e4190 ANSI GNU /Wall strict compliance updates 2013-08-26 21:19:12 -04:00
RJ Swedlow
c55651c805 finished scalar unit jump table prototype 2013-08-23 02:01:43 -04:00
RJ Swedlow
a122360a7e static reduction of vector control register jump tables 2013-08-17 02:10:13 -04:00
RJ Swedlow
e9481816f4 no more USER32/SHELL32 dependencies, plain CRT for messages 2013-08-14 15:48:35 -04:00
RJ Swedlow
7d6c1417c6 safer intrinsics to guard against bugged MinGW arch code gen 2013-08-09 14:42:19 -04:00
RJ Swedlow
36c6d2d031 fixed a corner case for overflow clamps 2013-08-08 23:28:53 -04:00
RJ Swedlow
e87066933e uh, forgot the sperm 2013-08-08 22:52:19 -04:00
RJ Swedlow
410e27d2fc found vector ops where new static clamp can apply 2013-08-08 22:30:53 -04:00
RJ Swedlow
16ee596312 C inline functions compatible with Visual Studio 2013-08-08 20:23:22 -04:00
RJ Swedlow
a635542af6 discovered an old SSE intrinsic for corner case 2013-08-08 18:07:56 -04:00
RJ Swedlow
2f8751e15b faster vector sign clamps 2013-08-08 17:27:52 -04:00
RJ Swedlow
d98b5aac64 new macro option EMULATE_VECTOR_RESULT_BUFFER 2013-05-15 11:19:18 -04:00
RJ Swedlow
34dcc48a53 repairs to manual dynamic vector parallelization mode 2013-05-13 21:35:17 -04:00
RJ Swedlow
8a6aa39a18 finished implementing parallel shuffle macros 2013-05-13 20:12:37 -04:00
RJ Swedlow
bebd498375 starting to implement shuffling in master template 2013-05-10 12:17:37 -04:00
RJ Swedlow
b54d1785d5 prototype solution to shuffle vector scalars to coefficient 2013-05-09 23:18:40 -04:00
RJ Swedlow
f1ff598a14 clean up the crap 2013-04-13 22:53:28 -05:00
RJ Swedlow
7c8c574509 momentary reversion to bitwise search method, seems better 2013-04-11 21:54:11 -05:00
RJ Swedlow
0aad6e82d5 inline function method to maintain clamping functions 2013-04-11 18:50:05 -05:00