Henrik Rydgård
cd511cc43c
In Freeze-Frame mode, don't wipe GPU resources when loading save states.
2013-11-15 16:30:55 +01:00
Henrik Rydgård
f1ed1a9a17
Flush the lit pool before flushing the icache.
2013-11-15 11:14:24 +01:00
Henrik Rydgård
49f53a7ecd
Merge pull request #4543 from unknownbrackets/cpu-minor
...
Minor cpu correctness adjustments
2013-11-15 01:44:49 -08:00
Henrik Rydgård
bd23f91e0c
Flush the lit pool in the vtx dec
2013-11-15 10:34:27 +01:00
Unknown W. Brackets
fc3b3449ed
Enable frameskipping in the null gpu.
...
In case it's used (I used it to check cpu-only perf on Android.)
2013-11-14 23:37:48 -08:00
Henrik Rydgard
adde016338
vtxdec sw skin arm: Simplify code, try a slightly different approach.
2013-11-14 23:07:23 +01:00
Henrik Rydgard
b0ccf5981c
Don't bother with glDrawRangeElements, seems to not improve perf.
2013-11-14 17:33:43 +01:00
Henrik Rydgard
d6d54c8bb4
Fix a couple of bugs in vtxdec
2013-11-14 15:06:51 +01:00
Henrik Rydgard
dc03796ccb
The bone matrix optimization is only safe when software skinning
2013-11-14 15:02:22 +01:00
Henrik Rydgard
9521d0e0a6
NormalizeVertices should not try to skin if we've already software-skinned.
2013-11-14 14:34:47 +01:00
Henrik Rydgard
3b63ef7005
Remove the SubmitPrim param forceIndexType, optimize BBOX more.
2013-11-14 14:03:03 +01:00
Henrik Rydgard
8c562a615f
Bone matrix loading speedup. Bit of a hack but seems reliable enough.
2013-11-14 14:02:31 +01:00
Henrik Rydgard
35ae239eb9
Optimize bbox some more
2013-11-14 12:25:53 +01:00
Henrik Rydgard
4f93654a88
Oops, accidentally enabled some bbox debugging code
2013-11-14 11:49:06 +01:00
Henrik Rydgard
8a69543ec4
BBOX: Transform the planes by the matrix so we don't need to transform the box
2013-11-14 11:44:13 +01:00
Henrik Rydgard
871d8b28ab
Assorted vtxdec x86 optimizations
2013-11-14 11:44:13 +01:00
Sacha
eaa52c4a61
VertexDecoder: Aligned code seems to help Symbian (armv6) performance. It is also used for CPU JIT so there's likely a reason.
2013-11-14 15:30:07 +10:00
Henrik Rydgard
ff1d53c9c7
vtxdec x86: Don't load bones unless software skinning is enabled
2013-11-14 00:56:24 +01:00
Henrik Rydgard
a9b365aa71
Fix for possible misalignment crash on x86
2013-11-14 00:37:04 +01:00
Henrik Rydgård
a6ee2648d0
Merge pull request #4531 from hrydgard/software-skinning
...
Software skinning
2013-11-13 12:54:41 -08:00
Henrik Rydgård
9bbdd1907d
Kind-of optimized ARM software skinning (non-NEON)
2013-11-13 18:11:35 +01:00
Henrik Rydgård
821a2f10f8
Delete obsolete code
2013-11-13 18:10:58 +01:00
Henrik Rydgard
6221dbaf5d
Optimize software skinning for x86.
...
Can't seem to get a win on Windows vs hardware skinning though, even though
draw calls drop by 2/3rd...
2013-11-13 18:10:58 +01:00
Henrik Rydgard
9333d3ea76
Vtx dec jit: Combine the scale and offset registers to save 1 more xmm register.
2013-11-13 18:10:58 +01:00
Henrik Rydgard
f0cacf46d0
No reason to involve the FPU when loading matrices
2013-11-13 18:10:58 +01:00
Henrik Rydgard
6976d6a3a0
Enable the softskinning optimizations that let us merge drawcalls
2013-11-13 18:10:57 +01:00
Henrik Rydgard
179934ec9f
Decode step by step when sw skinning
2013-11-13 18:10:57 +01:00
Henrik Rydgard
46313ced55
Prepare transform pipeline for step by step decoding
2013-11-13 18:10:57 +01:00
Henrik Rydgard
4f78eda23b
Save a couple of registers in the x86 vertex decoder jit by SIMD-ing prescale UV
2013-11-13 18:10:57 +01:00
Henrik Rydgard
7e67476b00
Simple unoptimized software skinning.
...
Does not take advantage of the possible reduction in state changes yet.
2013-11-13 18:10:57 +01:00
Aapo Rantalainen
c2586523fe
Maemo5: added !defined(MAEMO) as !defined(MEEGO_EDITION_HARMATTAN)
2013-11-13 18:23:18 +02:00
Henrik Rydgård
ab3fe9ba86
Extract the software transform code into its own file.
2013-11-13 14:56:34 +01:00
Henrik Rydgård
67ca4419fe
Fix minor bug where we didn't clear dirtyUniforms if early-outing from UpdateUniform.
2013-11-13 10:02:58 +01:00
Sacha
04b338f39e
Buildfix for sse2 builds.
2013-11-13 12:08:46 +10:00
Henrik Rydgard
cf15ec8a53
Add BBOX support (very conservative test)
2013-11-12 17:06:03 +01:00
Henrik Rydgard
54217deb16
Speed up UpdateUniforms a little
2013-11-12 17:06:03 +01:00
Henrik Rydgard
4b98e0d6d6
Optimize LoadClut a little
2013-11-12 17:06:03 +01:00
Henrik Rydgard
84f20a1cad
Small optimizations
2013-11-12 14:05:50 +01:00
Sacha
aad70d8912
Symbian: Support software GPU. It seems to compile properly now. May allow Nokia 500 to run PPSSPP (albeit very slowly).
2013-11-12 16:20:52 +10:00
Henrik Rydgård
662ad32d4f
Fix and re-enable @unknownbracket's handcoded NEON tex hash
2013-11-11 17:12:38 +01:00
Henrik Rydgård
53800bd413
Disable the hand-coded NEON texture hash again. It's slow, for unknown reasons.
2013-11-11 14:39:29 +01:00
raven02
2bca62b26e
Don't reset texture width/height unless the size is different
2013-11-11 21:12:43 +08:00
raven02
d2546bed5b
Regression fix c69ac64
2013-11-11 08:53:47 +08:00
Unknown W. Brackets
1311d71455
softgpu: Fix reversed stencil test funcs.
...
LESS applies to the ref compared to the stencil, not the stencil compared
to the ref.
With this, stencil testing is finally being done properly in Star Ocean.
2013-11-10 03:21:01 -08:00
Unknown W. Brackets
2db98b8669
softgpu: Don't apply logic op to stencil value.
...
Also, they were not at all applying correctly.
2013-11-10 03:21:00 -08:00
Unknown W. Brackets
438928de40
softgpu: Apply stencil op even when depth disabled.
2013-11-10 03:21:00 -08:00
Unknown W. Brackets
5327ce80d5
softgpu: throughmode doesn't disable any tests.
2013-11-10 03:20:59 -08:00
Unknown W. Brackets
5751e0a0a4
softgpu: Simplify color clamping.
2013-11-10 03:20:58 -08:00
Unknown W. Brackets
21deaef35a
Invert gstate.getClearModeColorMask().
...
It makes more sense matching gstate.getColorMask().
2013-11-10 03:20:58 -08:00
Unknown W. Brackets
5817684e28
softgpu: Never write alpha, only stencil.
2013-11-10 03:20:57 -08:00