luau/CodeGen/include/Luau
Arseny Kapoulkine 80928acb92
CodeGen: Extract all vector tag patching into TAG_VECTOR (#1171)
Instead of patching the tag component with TVECTOR in every instruction
that produces a vector value, we now use a separate IR instruction to do
this. This reduces implementation redundancy, but more importantly
allows for a class of optimizations:

- NUM_TO_VECTOR previously patched the component unconditionally but the
result was used only in MUL/DIV_VEC instructions that ignore it anyway;
we can now remove this.

- ADD_VEC et al can now forward the source of TAG_VECTOR instruction of
either input; this shortens the latency chain and in the future could
allow us to generate optimal vector instruction sequence once the
temporary stores are marked as dead.

- In the future on X64, ADD_VEC et al will be able to analyze the input
instruction and remove tag masking conditionally. This is not part of
this PR as it requires a decision around expected FP environment and/or
the necessity of the existing masking to begin with.

I've also renamed NUM_TO_VECTOR to NUM_TO_VEC so that "VEC" always
refers to "3 float values" and for consistency with ADD/etc.

Note: ADD_VEC input forwarding is currently performed unconditionally;
it may or may not increase the spills that can't be reloaded from the
stack.

On A64 this makes the Taylor series computation a tiny bit faster
(11.3ns => 11.0ns) as it removes the redundant ins instructions along
the NUM_TO_VEC path. Curiously, the optimization of forwarding
TAG_VECTOR input to arithmetic instructions actually has a small penalty
as without it this PR runs at 10.9 ns. I don't know if this is a
property of the benchmark though, as I just noticed that in this
benchmark type inference actually fails to infer parts of the
computation as a vector op. If desired I will happily omit this part of
the change and we can explore that separately.
2024-02-21 07:06:11 -08:00
..
AddressA64.h Sync to upstream/release/613 (#1167) 2024-02-15 18:04:39 -08:00
AssemblyBuilderA64.h Improve A64 lowering for vector operations by using vector instructions (#1164) 2024-02-16 08:30:35 -08:00
AssemblyBuilderX64.h Sync to upstream/release/613 (#1167) 2024-02-15 18:04:39 -08:00
BytecodeAnalysis.h Sync to upstream/release/605 (#1118) 2023-12-01 23:46:57 -08:00
BytecodeSummary.h Sync to upstream/release/613 2024-02-16 03:25:31 +02:00
CodeAllocator.h Sync to upstream/release/590 (#1008) 2023-08-11 07:42:37 -07:00
CodeBlockUnwind.h Sync to upstream/release/576 (#928) 2023-05-12 10:50:47 -07:00
CodeGen.h Sync to upstream/release/613 (#1167) 2024-02-15 18:04:39 -08:00
CodeGenCommon.h Sync to upstream/release/613 (#1167) 2024-02-15 18:04:39 -08:00
ConditionA64.h Sync to upstream/release/573 (#903) 2023-04-21 15:14:26 -07:00
ConditionX64.h Sync to upstream/release/613 (#1167) 2024-02-15 18:04:39 -08:00
IrAnalysis.h Sync to upstream/release/613 (#1167) 2024-02-15 18:04:39 -08:00
IrBuilder.h Sync to upstream/release/605 (#1118) 2023-12-01 23:46:57 -08:00
IrCallWrapperX64.h Sync to upstream/release/572 (#899) 2023-04-14 11:06:22 -07:00
IrData.h CodeGen: Extract all vector tag patching into TAG_VECTOR (#1171) 2024-02-21 07:06:11 -08:00
IrDump.h Sync to upstream/release/610 (#1154) 2024-01-26 19:20:56 -08:00
IrRegAllocX64.h Sync to upstream/release/596 (#1050) 2023-09-22 12:12:15 -07:00
IrUtils.h CodeGen: Extract all vector tag patching into TAG_VECTOR (#1171) 2024-02-21 07:06:11 -08:00
IrVisitUseDef.h Sync to upstream/release/613 (#1167) 2024-02-15 18:04:39 -08:00
Label.h Sync to upstream/release/529 (#505) 2022-05-26 15:08:16 -07:00
OperandX64.h Sync to upstream/release/613 (#1167) 2024-02-15 18:04:39 -08:00
OptimizeConstProp.h Sync to upstream/release/574 (#910) 2023-04-28 12:55:13 -07:00
OptimizeFinalX64.h Sync to upstream/release/563 (#833) 2023-02-10 11:40:38 -08:00
RegisterA64.h Sync to upstream/release/613 (#1167) 2024-02-15 18:04:39 -08:00
RegisterX64.h Sync to upstream/release/613 (#1167) 2024-02-15 18:04:39 -08:00
UnwindBuilder.h Sync to upstream/release/592 (#1018) 2023-08-25 10:23:55 -07:00
UnwindBuilderDwarf2.h Sync to upstream/release/592 (#1018) 2023-08-25 10:23:55 -07:00
UnwindBuilderWin.h Sync to upstream/release/592 (#1018) 2023-08-25 10:23:55 -07:00