luau/CodeGen/src at d6f8884d09dd1c1a05822afdc5bef4ddef5ee0ec - luau-lang/luau

mirror of https://github.com/luau-lang/luau.git synced 2025-05-04 10:33:46 +01:00

History

Arseny Kapoulkine 24cacc94ed Some checks are pending benchmark / callgrind (map[branch:main name:luau-lang/benchmark-data], ubuntu-22.04) (push) Waiting to run build / macos (push) Waiting to run build / macos-arm (push) Waiting to run build / ubuntu (push) Waiting to run build / windows (Win32) (push) Waiting to run build / windows (x64) (push) Waiting to run build / coverage (push) Waiting to run build / web (push) Waiting to run release / macos (push) Waiting to run release / ubuntu (push) Waiting to run release / windows (push) Waiting to run release / web (push) Waiting to run CodeGen: Implement support for math.lerp lowering (#1609 ) To implement math.lerp without branches, we add SELECT_NUM which selects one of the two inputs based on the comparison condition. For simplicity, we only support C == D for now; this can be extended to a more generic version with a IrCondition operand E, but that requires more work on the SSE side (to flip the comparison for some conditions like Greater, and expose more generic vcmpsd). Note: On AArch64 this will effectively result in a change in floating point behavior between native code and non-native code: clang synthesizes fmadd (because floating point contraction is allowed by default, and the arch always has the instruction), whereas this change will use fmul+fadd. I am not sure if this is good or bad, and if this is a problem in C or not. Specifically, clang's behavior results in different results between X64 and AArch64 when not using codegen, and with this change the behavior when using codegen is... the same? :) Fixing this will require either using LERP_NUM instead and hand-coding lowering, or exposing some sort of "quasi" MADD_NUM (which would lower to fma on AArch64 and mul+add on X64). A small benefit to the current approach is `lerp(1, 5, t)` constant-folds the subtraction. With LERP_NUM this optimization will need to be implemented manually as a partial constant-folding for LERP_NUM. A similar problem exists today for vector.cross & vector.dot. So maybe this is not something we need to fix, unsure.		2025-01-16 10:48:27 -08:00
..
AssemblyBuilderA64.cpp	Sync to upstream/release/653 (#1541 )	2024-11-22 13:00:51 -08:00
AssemblyBuilderX64.cpp	CodeGen: Implement support for math.lerp lowering (#1609 )	2025-01-16 10:48:27 -08:00
BitUtils.h	Sync to upstream/release/574 (#910 )	2023-04-28 12:55:13 -07:00
BytecodeAnalysis.cpp	Sync to upstream/release/656 (#1612 )	2025-01-10 11:34:39 -08:00
BytecodeSummary.cpp	Sync to upstream/release/653 (#1541 )	2024-11-22 13:00:51 -08:00
ByteUtils.h	Sync to upstream/release/603 (#1097 )	2023-11-10 13:10:07 -08:00
CodeAllocator.cpp	Sync to upstream/release/637 (#1354 )	2024-08-02 07:30:04 -07:00
CodeBlockUnwind.cpp	Sync to upstream/release/627 (#1266 )	2024-05-26 10:09:09 -07:00
CodeGen.cpp	Sync to upstream/release/656 (#1612 )	2025-01-10 11:34:39 -08:00
CodeGenA64.cpp	Sync to upstream/release/637 (#1354 )	2024-08-02 07:30:04 -07:00
CodeGenA64.h	Sync to upstream/release/629 (#1290 )	2024-06-07 10:51:12 -07:00
CodeGenAssembly.cpp	Sync to upstream/release/653 (#1541 )	2024-11-22 13:00:51 -08:00
CodeGenContext.cpp	Sync to upstream/release/653 (#1541 )	2024-11-22 13:00:51 -08:00
CodeGenContext.h	Sync to upstream/release/637 (#1354 )	2024-08-02 07:30:04 -07:00
CodeGenLower.h	Sync to upstream/release/656 (#1612 )	2025-01-10 11:34:39 -08:00
CodeGenUtils.cpp	Sync to upstream/release/656 (#1612 )	2025-01-10 11:34:39 -08:00
CodeGenUtils.h	Sync to upstream/release/629 (#1290 )	2024-06-07 10:51:12 -07:00
CodeGenX64.cpp	Sync to upstream/release/637 (#1354 )	2024-08-02 07:30:04 -07:00
CodeGenX64.h	Sync to upstream/release/629 (#1290 )	2024-06-07 10:51:12 -07:00
EmitBuiltinsX64.cpp	Sync to upstream/release/640 (#1374 )	2024-08-23 09:35:30 -07:00
EmitBuiltinsX64.h	Sync to upstream/release/631 (#1299 )	2024-06-20 16:37:55 -07:00
EmitCommon.h	Sync to upstream/release/606 (#1127 )	2023-12-08 13:50:16 -08:00
EmitCommonA64.h	Sync to upstream/release/629 (#1290 )	2024-06-07 10:51:12 -07:00
EmitCommonX64.cpp	Sync to upstream/release/632 (#1307 )	2024-06-28 17:34:49 -07:00
EmitCommonX64.h	Sync to upstream/release/629 (#1290 )	2024-06-07 10:51:12 -07:00
EmitInstructionX64.cpp	Sync to upstream/release/613 (#1167 )	2024-02-15 18:04:39 -08:00
EmitInstructionX64.h	Sync to upstream/release/592 (#1018 )	2023-08-25 10:23:55 -07:00
IrAnalysis.cpp	Sync to upstream/release/656 (#1612 )	2025-01-10 11:34:39 -08:00
IrBuilder.cpp	Sync to upstream/release/640 (#1374 )	2024-08-23 09:35:30 -07:00
IrCallWrapperX64.cpp	Sync to upstream/release/613 (#1167 )	2024-02-15 18:04:39 -08:00
IrDump.cpp	CodeGen: Implement support for math.lerp lowering (#1609 )	2025-01-16 10:48:27 -08:00
IrLoweringA64.cpp	CodeGen: Implement support for math.lerp lowering (#1609 )	2025-01-16 10:48:27 -08:00
IrLoweringA64.h	Sync to upstream/release/629 (#1290 )	2024-06-07 10:51:12 -07:00
IrLoweringX64.cpp	CodeGen: Implement support for math.lerp lowering (#1609 )	2025-01-16 10:48:27 -08:00
IrLoweringX64.h	Sync to upstream/release/629 (#1290 )	2024-06-07 10:51:12 -07:00
IrRegAllocA64.cpp	Sync to upstream/release/650 (#1502 )	2024-11-01 12:06:07 -07:00
IrRegAllocA64.h	Sync to upstream/release/595 (#1044 )	2023-09-15 10:26:59 -07:00
IrRegAllocX64.cpp	Sync to upstream/release/637 (#1354 )	2024-08-02 07:30:04 -07:00
IrTranslateBuiltins.cpp	CodeGen: Implement support for math.lerp lowering (#1609 )	2025-01-16 10:48:27 -08:00
IrTranslateBuiltins.h	Sync to upstream/release/637 (#1354 )	2024-08-02 07:30:04 -07:00
IrTranslation.cpp	Sync to upstream/release/640 (#1374 )	2024-08-23 09:35:30 -07:00
IrTranslation.h	Sync to upstream/release/637 (#1354 )	2024-08-02 07:30:04 -07:00
IrUtils.cpp	CodeGen: Implement support for math.lerp lowering (#1609 )	2025-01-16 10:48:27 -08:00
IrValueLocationTracking.cpp	Sync to upstream/release/640 (#1374 )	2024-08-23 09:35:30 -07:00
IrValueLocationTracking.h	Sync to upstream/release/596 (#1050 )	2023-09-22 12:12:15 -07:00
lcodegen.cpp	Sync to upstream/release/625 (#1252 )	2024-05-10 11:21:45 -07:00
NativeProtoExecData.cpp	Sync to upstream/release/637 (#1354 )	2024-08-02 07:30:04 -07:00
NativeState.cpp	Sync to upstream/release/633 (#1318 )	2024-07-08 14:57:06 -07:00
NativeState.h	Sync to upstream/release/632 (#1307 )	2024-06-28 17:34:49 -07:00
OptimizeConstProp.cpp	CodeGen: Implement support for math.lerp lowering (#1609 )	2025-01-16 10:48:27 -08:00
OptimizeDeadStore.cpp	Sync to upstream/release/654 (#1552 )	2024-12-02 16:16:33 -08:00
OptimizeFinalX64.cpp	Sync to upstream/release/615 (#1175 )	2024-03-01 10:45:26 -08:00
SharedCodeAllocator.cpp	Sync to upstream/release/637 (#1354 )	2024-08-02 07:30:04 -07:00
UnwindBuilderDwarf2.cpp	Sync to upstream/release/637 (#1354 )	2024-08-02 07:30:04 -07:00
UnwindBuilderWin.cpp	Sync to upstream/release/637 (#1354 )	2024-08-02 07:30:04 -07:00