Commit graph

8 commits

Author SHA1 Message Date
Tyler Stachecki
10d32ce427 Optimize FPU operations somewhat. 2015-01-27 10:27:13 -05:00
Tyler Stachecki
10a5983c0c Add support for SSE4 FPU acceleration.
0d4a5de2f6 is wrong; we can take
advantage of SSE4 rounding intrinsics.
2014-11-16 14:06:34 -05:00
Tyler Stachecki
061a04e216 Change width of fpu_state_t for x86_64.
gcc (and probably other compilers) don't like working with 16-bit
types and will zero-extend where needed. Save some overhead and
just store the state as a 32-bit type.
2014-11-15 15:44:04 -05:00
Tyler Stachecki
a0f1eb5d7c Move intrinsics to a common location. 2014-11-09 18:51:54 -05:00
Tyler Stachecki
1513f3cac2 arch/x86_64: Prefer _mm_set_s* over _mm_load_s*. 2014-11-09 18:27:14 -05:00
Tyler Stachecki
38ba3f7893 Fix x86_64 build errors in last commits. 2014-08-21 12:24:08 -04:00
Tyler Stachecki
ced4183158 More restructuring of arch/ and os/ code. 2014-08-18 16:54:01 +00:00
Tyler Stachecki
3e8ba50851 Add common arch/ folder, move headers out of os/.
Much of the architecture-specific code uses compiler-agnostic
intrinsics. For this reason, split it out into an arch/ folder,
leaving only the compiler and environment-specific code in os/.
2014-08-18 16:08:45 +00:00