Henrik Rydgård
|
468b830bec
|
Show IR disassembly in JIT Compare screen
|
2018-01-04 12:23:23 +01:00 |
|
Unknown W. Brackets
|
d8d174fa2b
|
arm64jit: Avoid spilling an extra reg for lwl/lwr.
It's only needed for swl and swr.
|
2018-01-01 08:38:10 -08:00 |
|
Unknown W. Brackets
|
8ffb0101fe
|
jit: Report blocks with uneaten VFPU prefixes.
There may be options to avoid, like continuing these blocks, especially if
they're likely or something.
|
2018-01-01 08:38:10 -08:00 |
|
Unknown W. Brackets
|
9ff812b313
|
arm64jit: Negate in ADDI2R/SUBI2R as well.
Should've done this at the same time as CMN. It's not as common, mostly
catches addu calls, but it's good to have these generic for other uses.
|
2017-12-30 11:11:04 -08:00 |
|
Unknown W. Brackets
|
ae63628360
|
arm64jit: Statically allocate ra as well.
This doesn't seem to have a significant impact on performance, but it
improves bloat by about 5%.
|
2017-12-30 11:11:03 -08:00 |
|
Unknown W. Brackets
|
89cbf36611
|
arm64jit: Free up W23 for static alloc.
We shouldn't always reserve W23 for this uncommon case.
|
2017-12-30 07:51:27 -08:00 |
|
Unknown W. Brackets
|
e7ac672522
|
arm64jit: Cleanup method names, temp discard.
This way MapDirtyIn won't accidentally discard temps.
|
2017-12-30 07:51:27 -08:00 |
|
Unknown W. Brackets
|
0fc8274ec4
|
arm64jit: Enable safe memory for lwl/lwr.
|
2017-12-29 17:30:18 -08:00 |
|
Unknown W. Brackets
|
c00044c5d8
|
arm64jit: Avoid arithmetic movs.
ORR is the preferred encoding and may be faster on some chips.
|
2017-12-29 17:30:18 -08:00 |
|
Unknown W. Brackets
|
98ed6fab3f
|
arm64jit: Fix spilling for more than one temp reg.
Otherwise we hang trying to spill the same reg over and over.
|
2017-12-29 17:30:17 -08:00 |
|
Unknown W. Brackets
|
ee236743f0
|
arm64jit: Use TBZ/TBNZ for vfpu branch as well.
|
2017-12-29 17:30:16 -08:00 |
|
Unknown W. Brackets
|
3b4917a308
|
arm64jit: Use TBZ/TBNZ for fp branches.
|
2017-12-29 17:30:15 -08:00 |
|
Unknown W. Brackets
|
c71285c970
|
arm64jit: Use CBZ/CBNZ for zero compare branches.
These are pretty common, so it reduces bloat decently. Seems about the
same speed, though.
|
2017-12-29 17:30:15 -08:00 |
|
Unknown W. Brackets
|
7f8a871e30
|
arm64jit: Handle more imm compare cases.
|
2017-12-29 17:30:14 -08:00 |
|
Unknown W. Brackets
|
56d64f5c67
|
arm64jit: Avoid temporary on variable shift.
I think we should trust that it works per the spec.
|
2017-12-29 17:30:12 -08:00 |
|
Unknown W. Brackets
|
1ecce2a2e1
|
arm64jit: Reuse code in I2R funcs.
|
2017-12-29 17:30:07 -08:00 |
|
Unknown W. Brackets
|
2498ce5e3e
|
arm64jit: Oops, properly init temp locked flag.
Fixes #10469.
|
2017-12-29 14:36:18 -08:00 |
|
Unknown W. Brackets
|
5177db0f91
|
arm64jit: Remove unnecessary address masking.
We use views like on x86_64, so this isn't needed.
|
2017-12-28 23:58:30 -08:00 |
|
Unknown W. Brackets
|
27116dcb86
|
arm64jit: Avoid flushing when mapping as pointer.
|
2017-12-28 16:04:34 -08:00 |
|
Unknown W. Brackets
|
1b1e2c773b
|
arm64jit: Jit lwl/lwr with proper temp regs.
It's possible rt might overlap with w9/w10, so we really need to allocate
these properly. This locks and spills as necessary.
|
2017-12-28 15:54:03 -08:00 |
|
Unknown W. Brackets
|
970326c9e5
|
arm64jit: Fix and enable imm lwl/lwr.
|
2017-12-28 14:49:55 -08:00 |
|
Unknown W. Brackets
|
1b792c32e1
|
arm64jit: Attempt to reuse imms on sw/etc.
Mostly, this handles the zero case, but it may help in other cases too.
|
2017-12-28 12:32:12 -08:00 |
|
Unknown W. Brackets
|
08e85d0cd4
|
arm64jit: Autodetect pointerify support.
Also, re-enable static alloc without pointerify, it works now.
|
2017-12-28 10:48:55 -08:00 |
|
Unknown W. Brackets
|
2e1d85a55b
|
arm64jit: Allow reg ptr offsets when unaligned.
Since now they support being dirty.
|
2017-12-28 10:45:50 -08:00 |
|
Unknown W. Brackets
|
c4c28282cf
|
arm64jit: Allow ARMREG_AS_PTR to be dirty.
Since we can just add/sub, then it should be valid (as long as we only
offset it when dirty.)
|
2017-12-28 10:40:31 -08:00 |
|
Unknown W. Brackets
|
6fd17fb026
|
arm64jit: Use reg sum for LDR/STR.
Skips an add, and should be less ops anyway.
|
2017-12-28 10:19:55 -08:00 |
|
Unknown W. Brackets
|
092f98d313
|
arm64jit: Fix an integer truncation warning.
|
2017-12-27 19:39:04 -08:00 |
|
Unknown W. Brackets
|
257a4fdd12
|
arm64jit: Reprotect fixed code after icache flush.
|
2017-12-27 19:33:04 -08:00 |
|
Unknown W. Brackets
|
7c2fc90def
|
arm64jit: Avoid MOVK elsewhere without pointerify.
|
2017-12-27 17:57:19 -08:00 |
|
Unknown W. Brackets
|
d82efc4b0b
|
arm64jit: Allow static alloc without pointerify.
|
2017-12-27 17:50:15 -08:00 |
|
Unknown W. Brackets
|
9573a791b4
|
arm64jit: Skip storing spilled but not dirty.
Unless IMM, we don't need to store non-dirty mapped regs.
|
2017-12-27 17:15:18 -08:00 |
|
Unknown W. Brackets
|
3fae092ecb
|
arm64jit: Only adjust pointers if pointerified.
|
2017-12-27 17:02:29 -08:00 |
|
Unknown W. Brackets
|
cccf448ae0
|
arm64jit: Allow disabling pointerification.
For platforms where we can't get base aligned.
|
2017-12-27 17:02:19 -08:00 |
|
Henrik Rydgård
|
0207739d76
|
Can't call functions through known-nil pointers, even if they don't touch local data - LLVM's optimizer might have done something stupid.
|
2017-11-30 01:07:03 +01:00 |
|
Henrik Rydgård
|
87942dd741
|
ARM64 JIT: Fix additional VFPU prefix problems
|
2017-11-24 14:05:19 +01:00 |
|
Henrik Rydgård
|
2cceba41bc
|
Fix JIT bug in ARM64. Fixes #10183
|
2017-11-24 13:57:27 +01:00 |
|
Henrik Rydgård
|
8fe171253d
|
Disable lwl/lwr on ARM64 JIT again. Seems b9b2656e93 broke things, see #10020
|
2017-11-08 12:11:55 +01:00 |
|
Henrik Rydgård
|
22e65ba80d
|
Get rid of ugly alignment macros and some other cruft, we now have alignas(16) from C++11
|
2017-08-31 01:14:51 +02:00 |
|
Henrik Rydgård
|
b9b2656e93
|
More vulkan microoptimizations. Add more profiler scopes.
|
2017-08-18 13:48:11 +02:00 |
|
Henrik Rydgård
|
ed776d8c0b
|
ARM: Delete obsolete comments and check
|
2017-08-18 13:48:11 +02:00 |
|
Unknown W. Brackets
|
33b073c545
|
Jit: Fix syscall outside delay slot.
Doesn't happen in real games, but useful in debug code.
|
2017-06-04 10:39:01 -07:00 |
|
Henrik Rydgård
|
0ec1e5e3b2
|
Don't erase and rewrite the dispatcher when the cache is cleared. Fixes #9708
|
2017-05-26 15:48:03 +02:00 |
|
Henrik Rydgård
|
a769724fd0
|
Fix old bug breaking "non-fast" memory on ARM64. May help #9477, also see #9488
|
2017-03-23 16:53:16 +01:00 |
|
Henrik Rydgard
|
b4740a2bca
|
Fix bug where dispatcher would not check core state directly after Advance. Fixes #9398 properly (it was previously hidden somehow with extra backbuffer binds).
|
2017-03-14 12:32:20 +01:00 |
|
Henrik Rydgård
|
62dcb9c70c
|
Log if Comp_SysCall encounters bad syscall instructions
|
2017-03-10 00:13:06 +01:00 |
|
Henrik Rydgård
|
e74749f2b2
|
A function renamed, some logging improvements
|
2017-03-10 00:11:00 +01:00 |
|
Henrik Rydgard
|
d68953c18c
|
Buildfixes
|
2017-03-02 13:29:51 +01:00 |
|
Henrik Rydgård
|
c4bac2d379
|
ABI: Switch to paired load/stores of floats too
|
2017-01-27 14:56:38 +01:00 |
|
Henrik Rydgård
|
e1bae9aa67
|
Follow the ARM64 ABI better (update the frame pointer).
|
2017-01-27 14:19:03 +01:00 |
|
Henrik Rydgård
|
635b2ada43
|
Remove a function that didn't make a lot of sense.
|
2017-01-26 09:50:16 +01:00 |
|