added mulhu and mulhs CRT routines by ZERICO2005 · Pull Request #645 · CE-Programming/toolchain

ZERICO2005 · 2025-09-30T03:02:28Z

Added multiply high signed/unsigned routines. These can be used to optimize division by a constant. __smulhu is optimized, but the rest are not well optimized. They use the exact same calling convention as the regular multiplication routines. We can optimize these routines in later PR's.

__smulhu   :         HL = ((uint32_t)         HL * (uint32_t)      BC) >> 16
__imulhu   :        UHL = ((uint48_t)        UHL * (uint48_t)     UBC) >> 24
__lmulhu   :      E:UHL = ((uint64_t)      E:UHL * (uint64_t)   A:UBC) >> 32
__i48mulhu :    UDE:UHL = ((uint96_t)    UDE:UHL * (uint96_t) UIY:UBC) >> 48
__llmulhu  : BC:UDE:UHL = ((uint128_t)BC:UDE:UHL * (uint128_t) (SP64)) >> 64

__smulhs   :         HL = ((int32_t)          HL * (int32_t)       BC) >> 16
__imulhs   :        UHL = ((int48_t)         UHL * (int48_t)      UBC) >> 24
__lmulhs   :      E:UHL = ((int64_t)       E:UHL * (int64_t)    A:UBC) >> 32
__i48mulhs :    UDE:UHL = ((int96_t)     UDE:UHL * (int96_t)  UIY:UBC) >> 48
__llmulhs  : BC:UDE:UHL = ((int128_t) BC:UDE:UHL * (int128_t)  (SP64)) >> 64

__smulhu   :  32 bytes |  33F +  12R +   9W +  17
__imulhu   : 117 bytes | 118F +  39R +  38W +  37
__lmulhu   : 1 call to __llmulu
__i48mulhu :  93 bytes | 902F + 246R + 182W + 344
__llmulhu  : (disables interrupts to use exx) slightly faster than 2 calls to __llmulu

__bmulhu was not added since it is just mlt bc \ ld a, b (and the 8-bit calling convention is not well defined).

src/crt/llmulhu.src

ZERICO2005 · 2026-01-13T20:05:01Z

I just converted this branch/PR from FASMG to GAS. So it would be helpful to know if I did it right

calc84maniac · 2026-04-01T18:39:52Z

src/crt/i48mulhs.src

+	push	hl
+	lea	hl, iy + 0
+	add	hl, hl
+	sbc	a, a
+
+	ld	hl, $800000
+	add	hl, de
+	pop	hl
+	rla


You can save 2F using this, which reverses the bits rotated into A but that can be solved by using or a, a \ call m for the first subtract instead of rrca \ call c

Suggested change

push hl

lea hl, iy + 0

add hl, hl

sbc a, a

ld hl, $800000

add hl, de

pop hl

rla

push de

ex de, hl

add hl, hl

sbc a, a

lea hl, iy + 0

add hl, hl

rla

ex de, hl

pop de

calc84maniac · 2026-04-01T19:05:28Z

src/crt/smulhs.src

+__smulhs:
+	push	bc
+	push	hl
+	call	__smulhu
+
+	; if (BC < 0) { result -= HL; }
+	bit	7, b
+	pop	bc
+	jr	z, .L.positive_hl
+	or	a, a
+	sbc	hl, bc
+.L.positive_hl:
+
+	; if (HL < 0) { result -= BC; }
+	bit	7, b
+	pop	bc
+	ret	z
+	or	a, a
+	sbc	hl, bc
+	ret


This saves 3R+3W on the stack:

Suggested change

__smulhs:

push bc

push hl

call __smulhu

; if (BC < 0) { result -= HL; }

bit 7, b

pop bc

jr z, .L.positive_hl

or a, a

sbc hl, bc

.L.positive_hl:

; if (HL < 0) { result -= BC; }

bit 7, b

pop bc

ret z

or a, a

sbc hl, bc

ret

__smulhs:

push de

ld d, h

ld e, l

call __smulhu

; if (BC < 0) { result -= HL; }

bit 7, b

jr z, .L.positive_hl

or a, a

sbc hl, de

.L.positive_hl:

; if (HL < 0) { result -= BC; }

bit 7, d

pop de

ret z

or a, a

sbc hl, bc

ret

ZERICO2005 marked this pull request as draft September 30, 2025 03:02

ZERICO2005 commented Oct 8, 2025

View reviewed changes

src/crt/llmulhu.src Outdated Show resolved Hide resolved

ZERICO2005 force-pushed the add_crt_mulhu branch from 4cf7c54 to 0853c6b Compare October 9, 2025 19:06

ZERICO2005 temporarily deployed to Autotester October 9, 2025 19:06 — with GitHub Actions Inactive

ZERICO2005 force-pushed the add_crt_mulhu branch from 0853c6b to ef700af Compare October 9, 2025 19:23

ZERICO2005 temporarily deployed to Autotester October 9, 2025 19:23 — with GitHub Actions Inactive

ZERICO2005 added the crt label Oct 10, 2025

ZERICO2005 changed the title ~~added mulhu CRT routines~~ added mulhu and mulhs CRT routines Oct 13, 2025

ZERICO2005 force-pushed the add_crt_mulhu branch from ef700af to 6167e0a Compare October 13, 2025 20:55

ZERICO2005 temporarily deployed to Autotester October 13, 2025 20:55 — with GitHub Actions Inactive

ZERICO2005 temporarily deployed to Autotester January 13, 2026 20:03 — with GitHub Actions Inactive

ZERICO2005 requested a review from mateoconlechuga January 13, 2026 20:04

ZERICO2005 requested a review from calc84maniac January 20, 2026 22:03

ZERICO2005 marked this pull request as ready for review January 20, 2026 22:03

ZERICO2005 mentioned this pull request Feb 9, 2026

add multiply high (unsigned/signed) intrinsics CE-Programming/llvm-project#33

Open

ZERICO2005 added 3 commits April 1, 2026 12:03

added mulhu CRT routines

05792ff

added mulhs routines and fixed i48mulhu/llmulhu

0079474

optimized llmulhu with exx

cfc924c

ZERICO2005 force-pushed the add_crt_mulhu branch from e175d6f to 3c037c7 Compare April 1, 2026 18:10

ZERICO2005 had a problem deploying to Autotester April 1, 2026 18:10 — with GitHub Actions Failure

ZERICO2005 force-pushed the add_crt_mulhu branch from 3c037c7 to cfc924c Compare April 1, 2026 18:17

ZERICO2005 temporarily deployed to Autotester April 1, 2026 18:17 — with GitHub Actions Inactive

ZERICO2005 deployed to Autotester April 1, 2026 18:17 — with GitHub Actions Active

ZERICO2005 temporarily deployed to Autotester April 1, 2026 18:17 — with GitHub Actions Inactive

calc84maniac reviewed Apr 1, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

added mulhu and mulhs CRT routines#645

added mulhu and mulhs CRT routines#645
ZERICO2005 wants to merge 3 commits intomasterfrom
add_crt_mulhu

ZERICO2005 commented Sep 30, 2025 •

edited

Loading

Uh oh!

Uh oh!

ZERICO2005 commented Jan 13, 2026

Uh oh!

calc84maniac Apr 1, 2026

Uh oh!

calc84maniac Apr 1, 2026

Uh oh!

Reviewers

Assignees

Labels

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

ZERICO2005 commented Sep 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

ZERICO2005 commented Jan 13, 2026

Uh oh!

calc84maniac Apr 1, 2026

Choose a reason for hiding this comment

Uh oh!

calc84maniac Apr 1, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Milestone

Development

Uh oh!

2 participants

ZERICO2005 commented Sep 30, 2025 •

edited

Loading