Coder Social home page Coder Social logo

Another AVX2 assembly issue about uasm HOT 5 CLOSED

terraspace avatar terraspace commented on June 15, 2024
Another AVX2 assembly issue

from uasm.

Comments (5)

john-terraspace avatar john-terraspace commented on June 15, 2024

No worries!

These are totally new so it’s progress !

We’ll get onto these ASAP.

From: gwoltman [mailto:[email protected]]
Sent: 26 September 2016 01:06 AM
To: Terraspace/HJWasm [email protected]
Subject: [Terraspace/HJWasm] Another AVX2 assembly issue (#33)

Sorry to be a pain....

First the bugs:

vpsllq ymm11, ymm11, xmm0
vpsllq zmm11, zmm11, xmm0

is not assembling. The Intel manual and ML64 say this is OK.

ML64:
vpmuludq ymm8, ymm6, ymmword ptr [r11+520H] ; 392F _ C4 41 4D: F4. 83, 00000520
vpsrlq ymm9, ymm7, xmmword ptr [r11+2C0H] ; 3A13 _ C4 41 45: D3. 8B, 000002C0
HJWASM:
; Note: Prefix bit or byte has no meaning in this context
vpmuludq ymm8, ymm6, ymmword ptr [rbx+520H] ; 3930 _ C4 61 CD: F4. 83, 00000520
; Note: Prefix bit or byte has no meaning in this context
vpsrlq ymm9, ymm7, xmmword ptr [rbx+2C0H] ; 3A14 _ C4 61 C5: D3. 8B, 000002C0
Note that 'r11' turned into 'rbx'

ML64:
vptest ymm8, ymm6 ; 3FC2 _ C4 62 7D: 17. C6
HJWASM:
vptest ymm8, ymm14 ; 3FC7 _ C4 42 7D: 17. C6

Now some oddities:

ML64:
vpsrlq ymm9, ymm7, 3 ; 38B8 _ C5 B5: 73. D7, 03
HJWASM:
; Note: Prefix bit or byte has no meaning in this context
vpsrlq ymm9, ymm7, 3 ; 38B8 _ C4 E1 B5: 73. D7, 03
ML64 assembled this into one fewer byte.


You are receiving this because you are subscribed to this thread.
Reply to this email directly, view it on GitHub #33 , or mute the thread https://github.com/notifications/unsubscribe-auth/AQGQVFY5dGAdR1iAp8dSAvDuyaU_vKFwks5qtwxzgaJpZM4KGCws .

from uasm.

john-terraspace avatar john-terraspace commented on June 15, 2024

Hi,

So-far they’re all fixed apart from:

vpsllq ymm11, ymm11, xmm0

vpsllq zmm11, zmm11, xmm0

We should have these done tomorrow and then will update the packages again.

John

From: gwoltman [mailto:[email protected]]
Sent: 26 September 2016 01:06 AM
To: Terraspace/HJWasm [email protected]
Subject: [Terraspace/HJWasm] Another AVX2 assembly issue (#33)

Sorry to be a pain....

First the bugs:

vpsllq ymm11, ymm11, xmm0
vpsllq zmm11, zmm11, xmm0

is not assembling. The Intel manual and ML64 say this is OK.

ML64:
vpmuludq ymm8, ymm6, ymmword ptr [r11+520H] ; 392F _ C4 41 4D: F4. 83, 00000520
vpsrlq ymm9, ymm7, xmmword ptr [r11+2C0H] ; 3A13 _ C4 41 45: D3. 8B, 000002C0
HJWASM:
; Note: Prefix bit or byte has no meaning in this context
vpmuludq ymm8, ymm6, ymmword ptr [rbx+520H] ; 3930 _ C4 61 CD: F4. 83, 00000520
; Note: Prefix bit or byte has no meaning in this context
vpsrlq ymm9, ymm7, xmmword ptr [rbx+2C0H] ; 3A14 _ C4 61 C5: D3. 8B, 000002C0
Note that 'r11' turned into 'rbx'

ML64:
vptest ymm8, ymm6 ; 3FC2 _ C4 62 7D: 17. C6
HJWASM:
vptest ymm8, ymm14 ; 3FC7 _ C4 42 7D: 17. C6

Now some oddities:

ML64:
vpsrlq ymm9, ymm7, 3 ; 38B8 _ C5 B5: 73. D7, 03
HJWASM:
; Note: Prefix bit or byte has no meaning in this context
vpsrlq ymm9, ymm7, 3 ; 38B8 _ C4 E1 B5: 73. D7, 03
ML64 assembled this into one fewer byte.


You are receiving this because you are subscribed to this thread.
Reply to this email directly, view it on GitHub #33 , or mute the thread https://github.com/notifications/unsubscribe-auth/AQGQVFY5dGAdR1iAp8dSAvDuyaU_vKFwks5qtwxzgaJpZM4KGCws .

from uasm.

john-terraspace avatar john-terraspace commented on June 15, 2024

; Note: Prefix bit or byte has no meaning in this context
vpmuludq ymm8, ymm6, ymmword ptr [rbx+520H] ; 3930 _ C4 61 CD: F4. 83, 00000520
; Note: Prefix bit or byte has no meaning in this context
vpsrlq ymm9, ymm7, xmmword ptr [rbx+2C0H] ; 3A14 _ C4 61 C5: D3. 8B, 000002C0

In the case of this prefix bit the (CD/C5) just have bit 7 set, which is ignored so this is not a problem.

From: gwoltman [mailto:[email protected]]
Sent: 26 September 2016 01:06 AM
To: Terraspace/HJWasm [email protected]
Subject: [Terraspace/HJWasm] Another AVX2 assembly issue (#33)

Sorry to be a pain....

First the bugs:

vpsllq ymm11, ymm11, xmm0
vpsllq zmm11, zmm11, xmm0

is not assembling. The Intel manual and ML64 say this is OK.

ML64:
vpmuludq ymm8, ymm6, ymmword ptr [r11+520H] ; 392F _ C4 41 4D: F4. 83, 00000520
vpsrlq ymm9, ymm7, xmmword ptr [r11+2C0H] ; 3A13 _ C4 41 45: D3. 8B, 000002C0
HJWASM:
; Note: Prefix bit or byte has no meaning in this context
vpmuludq ymm8, ymm6, ymmword ptr [rbx+520H] ; 3930 _ C4 61 CD: F4. 83, 00000520
; Note: Prefix bit or byte has no meaning in this context
vpsrlq ymm9, ymm7, xmmword ptr [rbx+2C0H] ; 3A14 _ C4 61 C5: D3. 8B, 000002C0
Note that 'r11' turned into 'rbx'

ML64:
vptest ymm8, ymm6 ; 3FC2 _ C4 62 7D: 17. C6
HJWASM:
vptest ymm8, ymm14 ; 3FC7 _ C4 42 7D: 17. C6

Now some oddities:

ML64:
vpsrlq ymm9, ymm7, 3 ; 38B8 _ C5 B5: 73. D7, 03
HJWASM:
; Note: Prefix bit or byte has no meaning in this context
vpsrlq ymm9, ymm7, 3 ; 38B8 _ C4 E1 B5: 73. D7, 03
ML64 assembled this into one fewer byte.


You are receiving this because you are subscribed to this thread.
Reply to this email directly, view it on GitHub #33 , or mute the thread https://github.com/notifications/unsubscribe-auth/AQGQVFY5dGAdR1iAp8dSAvDuyaU_vKFwks5qtwxzgaJpZM4KGCws .

from uasm.

john-terraspace avatar john-terraspace commented on June 15, 2024

Hi,

Packages have been updated and are available on the site for download. All the listed issues are resolved.

From: gwoltman [mailto:[email protected]]
Sent: 26 September 2016 01:06 AM
To: Terraspace/HJWasm [email protected]
Subject: [Terraspace/HJWasm] Another AVX2 assembly issue (#33)

Sorry to be a pain....

First the bugs:

vpsllq ymm11, ymm11, xmm0
vpsllq zmm11, zmm11, xmm0

is not assembling. The Intel manual and ML64 say this is OK.

ML64:
vpmuludq ymm8, ymm6, ymmword ptr [r11+520H] ; 392F _ C4 41 4D: F4. 83, 00000520
vpsrlq ymm9, ymm7, xmmword ptr [r11+2C0H] ; 3A13 _ C4 41 45: D3. 8B, 000002C0
HJWASM:
; Note: Prefix bit or byte has no meaning in this context
vpmuludq ymm8, ymm6, ymmword ptr [rbx+520H] ; 3930 _ C4 61 CD: F4. 83, 00000520
; Note: Prefix bit or byte has no meaning in this context
vpsrlq ymm9, ymm7, xmmword ptr [rbx+2C0H] ; 3A14 _ C4 61 C5: D3. 8B, 000002C0
Note that 'r11' turned into 'rbx'

ML64:
vptest ymm8, ymm6 ; 3FC2 _ C4 62 7D: 17. C6
HJWASM:
vptest ymm8, ymm14 ; 3FC7 _ C4 42 7D: 17. C6

Now some oddities:

ML64:
vpsrlq ymm9, ymm7, 3 ; 38B8 _ C5 B5: 73. D7, 03
HJWASM:
; Note: Prefix bit or byte has no meaning in this context
vpsrlq ymm9, ymm7, 3 ; 38B8 _ C4 E1 B5: 73. D7, 03
ML64 assembled this into one fewer byte.


You are receiving this because you are subscribed to this thread.
Reply to this email directly, view it on GitHub #33 , or mute the thread https://github.com/notifications/unsubscribe-auth/AQGQVFY5dGAdR1iAp8dSAvDuyaU_vKFwks5qtwxzgaJpZM4KGCws .

from uasm.

gwoltman avatar gwoltman commented on June 15, 2024

Above items fixed (except prefix bit or byte warnings). I'll re-open that as a separate issue.

from uasm.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.