Skip to content
Snippets Groups Projects
Commit d4494c3e authored by Niels Möller's avatar Niels Möller
Browse files

Fixed bug, spotted by Mikael Kalms. We

must order the store at [CTX+I] before the load of [CTX+SI+SJ].

Rev: src/nettle/sparc/arcfour-crypt.asm:1.6
parent 1128d5a6
No related branches found
No related tags found
No related merge requests found
...@@ -21,7 +21,7 @@ C MA 02111-1307, USA. ...@@ -21,7 +21,7 @@ C MA 02111-1307, USA.
C Define to YES, to enable the complex code to special case SRC C Define to YES, to enable the complex code to special case SRC
C and DST with compatible alignment. C and DST with compatible alignment.
define(<WITH_ALIGN>, <NO>) define(<WITH_ALIGN>, <YES>)
C Registers C Registers
...@@ -54,8 +54,8 @@ $2: ...@@ -54,8 +54,8 @@ $2:
stb SI, [CTX + J] stb SI, [CTX + J]
add SI, SJ, SI add SI, SJ, SI
and SI, 0xff, SI and SI, 0xff, SI
ldub [CTX + SI], SI
stb SJ, [CTX + I] stb SJ, [CTX + I]
ldub [CTX + SI], SI
xor TMP, SI, TMP xor TMP, SI, TMP
stb TMP, [DST] stb TMP, [DST]
bne $2 bne $2
...@@ -77,8 +77,8 @@ $2: ...@@ -77,8 +77,8 @@ $2:
stb SI, [CTX + J] stb SI, [CTX + J]
add SI, SJ, SI add SI, SJ, SI
and SI, 0xff, SI and SI, 0xff, SI
ldub [CTX + SI], TMP
stb SJ, [CTX + I] stb SJ, [CTX + I]
ldub [CTX + SI], TMP
add I, 1, I add I, 1, I
and I, 0xff, I and I, 0xff, I
...@@ -90,9 +90,9 @@ $2: ...@@ -90,9 +90,9 @@ $2:
stb SI, [CTX + J] stb SI, [CTX + J]
add SI, SJ, SI add SI, SJ, SI
and SI, 0xff, SI and SI, 0xff, SI
stb SJ, [CTX + I]
ldub [CTX + SI], SI ldub [CTX + SI], SI
sll TMP, 8, TMP sll TMP, 8, TMP
stb SJ, [CTX + I]
or TMP, SI, TMP or TMP, SI, TMP
add I, 1, I add I, 1, I
...@@ -105,9 +105,9 @@ $2: ...@@ -105,9 +105,9 @@ $2:
stb SI, [CTX + J] stb SI, [CTX + J]
add SI, SJ, SI add SI, SJ, SI
and SI, 0xff, SI and SI, 0xff, SI
stb SJ, [CTX + I]
ldub [CTX + SI], SI ldub [CTX + SI], SI
sll TMP, 8, TMP sll TMP, 8, TMP
stb SJ, [CTX + I]
or TMP, SI, TMP or TMP, SI, TMP
add I, 1, I add I, 1, I
...@@ -120,9 +120,9 @@ $2: ...@@ -120,9 +120,9 @@ $2:
stb SI, [CTX + J] stb SI, [CTX + J]
add SI, SJ, SI add SI, SJ, SI
and SI, 0xff, SI and SI, 0xff, SI
stb SJ, [CTX + I]
ldub [CTX + SI], SI ldub [CTX + SI], SI
sll TMP, 8, TMP sll TMP, 8, TMP
stb SJ, [CTX + I]
or TMP, SI, TMP or TMP, SI, TMP
xor WORD, TMP, WORD xor WORD, TMP, WORD
st WORD, [DST] st WORD, [DST]
...@@ -211,6 +211,7 @@ C 2: First working version of the assembler code ...@@ -211,6 +211,7 @@ C 2: First working version of the assembler code
C 3: Moved load of source byte C 3: Moved load of source byte
C 4: Better instruction scheduling C 4: Better instruction scheduling
C 5: Special case SRC and DST with compatible alignment C 5: Special case SRC and DST with compatible alignment
C 6: After bugfix (reorder of ld [CTX+SI+SJ] and st [CTX + SI])
C MB/s cycles/byte Code size (bytes) C MB/s cycles/byte Code size (bytes)
C 1: 6.6 12.4 132 C 1: 6.6 12.4 132
...@@ -218,3 +219,4 @@ C 2: 5.6 14.5 116 ...@@ -218,3 +219,4 @@ C 2: 5.6 14.5 116
C 3: 6.0 13.5 116 C 3: 6.0 13.5 116
C 4: 6.5 12.4 116 C 4: 6.5 12.4 116
C 5: 7.9 10.4 496 C 5: 7.9 10.4 496
C 6: 8.3 9.7 496
0% Loading or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment