x86_64 + HOL-Light: Replace poly_decompose AVX2 intrinsics with hand-written assembly and HOL-Light proofs#1181
x86_64 + HOL-Light: Replace poly_decompose AVX2 intrinsics with hand-written assembly and HOL-Light proofs#1181jakemas wants to merge 1 commit into
Conversation
CBMC Results (ML-DSA-65, REDUCE-RAM)Full Results (206 proofs)
|
CBMC Results (ML-DSA-44, REDUCE-RAM)Full Results (206 proofs)
|
CBMC Results (ML-DSA-87, REDUCE-RAM)Full Results (206 proofs)
|
CBMC Results (ML-DSA-65)Full Results (206 proofs)
|
CBMC Results (ML-DSA-87)Full Results (206 proofs)
|
CBMC Results (ML-DSA-44)Full Results (206 proofs)
|
There was a problem hiding this comment.
Mac Mini (M1, 2020) benchmarks (opt)
Details
| Benchmark suite | Current: 0b60fcc | Previous: 9c5eceb | Ratio |
|---|---|---|---|
ML-DSA-44 keypair |
46535 cycles |
46504 cycles |
1.00 |
ML-DSA-44 sign |
131062 cycles |
131079 cycles |
1.00 |
ML-DSA-44 verify |
47343 cycles |
47310 cycles |
1.00 |
ML-DSA-65 keypair |
81693 cycles |
81680 cycles |
1.00 |
ML-DSA-65 sign |
215321 cycles |
215311 cycles |
1.00 |
ML-DSA-65 verify |
79309 cycles |
79299 cycles |
1.00 |
ML-DSA-87 keypair |
132396 cycles |
132405 cycles |
1.00 |
ML-DSA-87 sign |
277357 cycles |
277328 cycles |
1.00 |
ML-DSA-87 verify |
134042 cycles |
134051 cycles |
1.00 |
This comment was automatically generated by workflow using github-action-benchmark.
There was a problem hiding this comment.
Mac Mini (M1, 2020) benchmarks (no-opt)
Details
| Benchmark suite | Current: 0b60fcc | Previous: 9c5eceb | Ratio |
|---|---|---|---|
ML-DSA-44 keypair |
113111 cycles |
112758 cycles |
1.00 |
ML-DSA-44 sign |
402272 cycles |
400845 cycles |
1.00 |
ML-DSA-44 verify |
119782 cycles |
119413 cycles |
1.00 |
ML-DSA-65 keypair |
193015 cycles |
192933 cycles |
1.00 |
ML-DSA-65 sign |
650151 cycles |
649924 cycles |
1.00 |
ML-DSA-65 verify |
192935 cycles |
192850 cycles |
1.00 |
ML-DSA-87 keypair |
318786 cycles |
318753 cycles |
1.00 |
ML-DSA-87 sign |
828784 cycles |
828716 cycles |
1.00 |
ML-DSA-87 verify |
326698 cycles |
326677 cycles |
1.00 |
This comment was automatically generated by workflow using github-action-benchmark.
There was a problem hiding this comment.
Arm Cortex-A76 (Raspberry Pi 5) benchmarks (opt)
Details
| Benchmark suite | Current: 0b60fcc | Previous: 9c5eceb | Ratio |
|---|---|---|---|
ML-DSA-44 keypair |
112171 cycles |
112128 cycles |
1.00 |
ML-DSA-44 sign |
353480 cycles |
353788 cycles |
1.00 |
ML-DSA-44 verify |
117004 cycles |
117189 cycles |
1.00 |
ML-DSA-65 keypair |
194786 cycles |
194358 cycles |
1.00 |
ML-DSA-65 sign |
583882 cycles |
583733 cycles |
1.00 |
ML-DSA-65 verify |
192717 cycles |
193111 cycles |
1.00 |
ML-DSA-87 keypair |
320911 cycles |
320083 cycles |
1.00 |
ML-DSA-87 sign |
747254 cycles |
747201 cycles |
1.00 |
ML-DSA-87 verify |
318739 cycles |
317895 cycles |
1.00 |
This comment was automatically generated by workflow using github-action-benchmark.
There was a problem hiding this comment.
Arm Cortex-A76 (Raspberry Pi 5) benchmarks (no-opt)
Details
| Benchmark suite | Current: 0b60fcc | Previous: 9c5eceb | Ratio |
|---|---|---|---|
ML-DSA-44 keypair |
211464 cycles |
211527 cycles |
1.00 |
ML-DSA-44 sign |
759041 cycles |
759858 cycles |
1.00 |
ML-DSA-44 verify |
228925 cycles |
229351 cycles |
1.00 |
ML-DSA-65 keypair |
377183 cycles |
378548 cycles |
1.00 |
ML-DSA-65 sign |
1247203 cycles |
1247648 cycles |
1.00 |
ML-DSA-65 verify |
371278 cycles |
372375 cycles |
1.00 |
ML-DSA-87 keypair |
600236 cycles |
601887 cycles |
1.00 |
ML-DSA-87 sign |
1582442 cycles |
1582472 cycles |
1.00 |
ML-DSA-87 verify |
615930 cycles |
617767 cycles |
1.00 |
This comment was automatically generated by workflow using github-action-benchmark.
There was a problem hiding this comment.
SpacemiT K1 8 (Banana Pi F3) benchmarks (no-opt)
Details
| Benchmark suite | Current: 0b60fcc | Previous: 9c5eceb | Ratio |
|---|---|---|---|
ML-DSA-44 keypair |
760108 cycles |
759567 cycles |
1.00 |
ML-DSA-44 sign |
3139979 cycles |
3137937 cycles |
1.00 |
ML-DSA-44 verify |
859276 cycles |
858870 cycles |
1.00 |
ML-DSA-65 keypair |
1285510 cycles |
1285661 cycles |
1.00 |
ML-DSA-65 sign |
5076473 cycles |
5075651 cycles |
1.00 |
ML-DSA-65 verify |
1363683 cycles |
1364449 cycles |
1.00 |
ML-DSA-87 keypair |
2108565 cycles |
2111749 cycles |
1.00 |
ML-DSA-87 sign |
6349814 cycles |
6355811 cycles |
1.00 |
ML-DSA-87 verify |
2227740 cycles |
2228276 cycles |
1.00 |
This comment was automatically generated by workflow using github-action-benchmark.
oqs-bot
left a comment
There was a problem hiding this comment.
Intel Xeon 4th gen (c7i)
Details
| Benchmark suite | Current: 0b60fcc | Previous: 9c5eceb | Ratio |
|---|---|---|---|
ML-DSA-44 keypair |
43320 cycles |
43415 cycles |
1.00 |
ML-DSA-44 sign |
130447 cycles |
130832 cycles |
1.00 |
ML-DSA-44 verify |
45019 cycles |
45316 cycles |
0.99 |
ML-DSA-65 keypair |
75630 cycles |
75430 cycles |
1.00 |
ML-DSA-65 sign |
214636 cycles |
215011 cycles |
1.00 |
ML-DSA-65 verify |
74259 cycles |
74350 cycles |
1.00 |
ML-DSA-87 keypair |
123222 cycles |
123299 cycles |
1.00 |
ML-DSA-87 sign |
271095 cycles |
271370 cycles |
1.00 |
ML-DSA-87 verify |
120682 cycles |
120802 cycles |
1.00 |
This comment was automatically generated by workflow using github-action-benchmark.
oqs-bot
left a comment
There was a problem hiding this comment.
Intel Xeon 4th gen (c7i) (no-opt)
Details
| Benchmark suite | Current: 0b60fcc | Previous: 9c5eceb | Ratio |
|---|---|---|---|
ML-DSA-44 keypair |
91460 cycles |
91498 cycles |
1.00 |
ML-DSA-44 sign |
352421 cycles |
352373 cycles |
1.00 |
ML-DSA-44 verify |
99823 cycles |
99791 cycles |
1.00 |
ML-DSA-65 keypair |
153853 cycles |
153828 cycles |
1.00 |
ML-DSA-65 sign |
571320 cycles |
571106 cycles |
1.00 |
ML-DSA-65 verify |
159824 cycles |
159761 cycles |
1.00 |
ML-DSA-87 keypair |
255642 cycles |
255664 cycles |
1.00 |
ML-DSA-87 sign |
725720 cycles |
726163 cycles |
1.00 |
ML-DSA-87 verify |
263733 cycles |
263879 cycles |
1.00 |
This comment was automatically generated by workflow using github-action-benchmark.
oqs-bot
left a comment
There was a problem hiding this comment.
AMD EPYC 3rd gen (c6a)
Details
| Benchmark suite | Current: 0b60fcc | Previous: 9c5eceb | Ratio |
|---|---|---|---|
ML-DSA-44 keypair |
55422 cycles |
55320 cycles |
1.00 |
ML-DSA-44 sign |
159104 cycles |
159183 cycles |
1.00 |
ML-DSA-44 verify |
57499 cycles |
57873 cycles |
0.99 |
ML-DSA-65 keypair |
96456 cycles |
95830 cycles |
1.01 |
ML-DSA-65 sign |
263091 cycles |
263504 cycles |
1.00 |
ML-DSA-65 verify |
96045 cycles |
96058 cycles |
1.00 |
ML-DSA-87 keypair |
154978 cycles |
154555 cycles |
1.00 |
ML-DSA-87 sign |
322539 cycles |
322765 cycles |
1.00 |
ML-DSA-87 verify |
151291 cycles |
150846 cycles |
1.00 |
This comment was automatically generated by workflow using github-action-benchmark.
oqs-bot
left a comment
There was a problem hiding this comment.
AMD EPYC 3rd gen (c6a) (no-opt)
Details
| Benchmark suite | Current: 0b60fcc | Previous: 9c5eceb | Ratio |
|---|---|---|---|
ML-DSA-44 keypair |
133314 cycles |
133213 cycles |
1.00 |
ML-DSA-44 sign |
518729 cycles |
518708 cycles |
1.00 |
ML-DSA-44 verify |
146430 cycles |
146419 cycles |
1.00 |
ML-DSA-65 keypair |
223838 cycles |
225658 cycles |
0.99 |
ML-DSA-65 sign |
842432 cycles |
848883 cycles |
0.99 |
ML-DSA-65 verify |
233764 cycles |
236008 cycles |
0.99 |
ML-DSA-87 keypair |
366923 cycles |
366727 cycles |
1.00 |
ML-DSA-87 sign |
1059620 cycles |
1060028 cycles |
1.00 |
ML-DSA-87 verify |
381222 cycles |
380316 cycles |
1.00 |
This comment was automatically generated by workflow using github-action-benchmark.
oqs-bot
left a comment
There was a problem hiding this comment.
Graviton4
Details
| Benchmark suite | Current: 0b60fcc | Previous: 9c5eceb | Ratio |
|---|---|---|---|
ML-DSA-44 keypair |
67282 cycles |
67251 cycles |
1.00 |
ML-DSA-44 sign |
198335 cycles |
198318 cycles |
1.00 |
ML-DSA-44 verify |
70214 cycles |
70240 cycles |
1.00 |
ML-DSA-65 keypair |
119470 cycles |
119362 cycles |
1.00 |
ML-DSA-65 sign |
326115 cycles |
325982 cycles |
1.00 |
ML-DSA-65 verify |
116920 cycles |
116934 cycles |
1.00 |
ML-DSA-87 keypair |
196541 cycles |
196595 cycles |
1.00 |
ML-DSA-87 sign |
421505 cycles |
421892 cycles |
1.00 |
ML-DSA-87 verify |
193377 cycles |
193379 cycles |
1.00 |
This comment was automatically generated by workflow using github-action-benchmark.
oqs-bot
left a comment
There was a problem hiding this comment.
AMD EPYC 4th gen (c7a)
Details
| Benchmark suite | Current: 0b60fcc | Previous: 9c5eceb | Ratio |
|---|---|---|---|
ML-DSA-44 keypair |
46676 cycles |
46860 cycles |
1.00 |
ML-DSA-44 sign |
139525 cycles |
139486 cycles |
1.00 |
ML-DSA-44 verify |
49201 cycles |
49318 cycles |
1.00 |
ML-DSA-65 keypair |
82513 cycles |
81920 cycles |
1.01 |
ML-DSA-65 sign |
227275 cycles |
227729 cycles |
1.00 |
ML-DSA-65 verify |
81963 cycles |
81877 cycles |
1.00 |
ML-DSA-87 keypair |
129267 cycles |
131127 cycles |
0.99 |
ML-DSA-87 sign |
280881 cycles |
281288 cycles |
1.00 |
ML-DSA-87 verify |
128438 cycles |
129015 cycles |
1.00 |
This comment was automatically generated by workflow using github-action-benchmark.
oqs-bot
left a comment
There was a problem hiding this comment.
Intel Xeon 3rd gen (c6i)
Details
| Benchmark suite | Current: 0b60fcc | Previous: 9c5eceb | Ratio |
|---|---|---|---|
ML-DSA-44 keypair |
61777 cycles |
61685 cycles |
1.00 |
ML-DSA-44 sign |
189264 cycles |
188995 cycles |
1.00 |
ML-DSA-44 verify |
66152 cycles |
66273 cycles |
1.00 |
ML-DSA-65 keypair |
108472 cycles |
111473 cycles |
0.97 |
ML-DSA-65 sign |
312431 cycles |
312617 cycles |
1.00 |
ML-DSA-65 verify |
109611 cycles |
110558 cycles |
0.99 |
ML-DSA-87 keypair |
171304 cycles |
171460 cycles |
1.00 |
ML-DSA-87 sign |
377394 cycles |
377513 cycles |
1.00 |
ML-DSA-87 verify |
169493 cycles |
169102 cycles |
1.00 |
This comment was automatically generated by workflow using github-action-benchmark.
oqs-bot
left a comment
There was a problem hiding this comment.
Graviton4 (no-opt)
Details
| Benchmark suite | Current: 0b60fcc | Previous: 9c5eceb | Ratio |
|---|---|---|---|
ML-DSA-44 keypair |
127565 cycles |
127647 cycles |
1.00 |
ML-DSA-44 sign |
441234 cycles |
441052 cycles |
1.00 |
ML-DSA-44 verify |
136402 cycles |
136340 cycles |
1.00 |
ML-DSA-65 keypair |
220501 cycles |
220707 cycles |
1.00 |
ML-DSA-65 sign |
714356 cycles |
713810 cycles |
1.00 |
ML-DSA-65 verify |
221036 cycles |
220735 cycles |
1.00 |
ML-DSA-87 keypair |
364581 cycles |
365112 cycles |
1.00 |
ML-DSA-87 sign |
915625 cycles |
921310 cycles |
0.99 |
ML-DSA-87 verify |
370856 cycles |
370798 cycles |
1.00 |
This comment was automatically generated by workflow using github-action-benchmark.
oqs-bot
left a comment
There was a problem hiding this comment.
AMD EPYC 4th gen (c7a) (no-opt)
Details
| Benchmark suite | Current: 0b60fcc | Previous: 9c5eceb | Ratio |
|---|---|---|---|
ML-DSA-44 keypair |
118183 cycles |
118568 cycles |
1.00 |
ML-DSA-44 sign |
457225 cycles |
458777 cycles |
1.00 |
ML-DSA-44 verify |
130648 cycles |
131112 cycles |
1.00 |
ML-DSA-65 keypair |
200891 cycles |
200686 cycles |
1.00 |
ML-DSA-65 sign |
745651 cycles |
743736 cycles |
1.00 |
ML-DSA-65 verify |
209587 cycles |
209264 cycles |
1.00 |
ML-DSA-87 keypair |
331930 cycles |
330171 cycles |
1.01 |
ML-DSA-87 sign |
942737 cycles |
935697 cycles |
1.01 |
ML-DSA-87 verify |
342981 cycles |
343489 cycles |
1.00 |
This comment was automatically generated by workflow using github-action-benchmark.
oqs-bot
left a comment
There was a problem hiding this comment.
Intel Xeon 3rd gen (c6i) (no-opt)
Details
| Benchmark suite | Current: 0b60fcc | Previous: 9c5eceb | Ratio |
|---|---|---|---|
ML-DSA-44 keypair |
154189 cycles |
154753 cycles |
1.00 |
ML-DSA-44 sign |
587855 cycles |
591158 cycles |
0.99 |
ML-DSA-44 verify |
169480 cycles |
169947 cycles |
1.00 |
ML-DSA-65 keypair |
262646 cycles |
261726 cycles |
1.00 |
ML-DSA-65 sign |
964482 cycles |
963022 cycles |
1.00 |
ML-DSA-65 verify |
272290 cycles |
271733 cycles |
1.00 |
ML-DSA-87 keypair |
432007 cycles |
431775 cycles |
1.00 |
ML-DSA-87 sign |
1210654 cycles |
1213443 cycles |
1.00 |
ML-DSA-87 verify |
447323 cycles |
447885 cycles |
1.00 |
This comment was automatically generated by workflow using github-action-benchmark.
oqs-bot
left a comment
There was a problem hiding this comment.
Graviton3
Details
| Benchmark suite | Current: 0b60fcc | Previous: 9c5eceb | Ratio |
|---|---|---|---|
ML-DSA-44 keypair |
71405 cycles |
71561 cycles |
1.00 |
ML-DSA-44 sign |
208966 cycles |
209019 cycles |
1.00 |
ML-DSA-44 verify |
74818 cycles |
74745 cycles |
1.00 |
ML-DSA-65 keypair |
125948 cycles |
125918 cycles |
1.00 |
ML-DSA-65 sign |
345621 cycles |
345393 cycles |
1.00 |
ML-DSA-65 verify |
124114 cycles |
124184 cycles |
1.00 |
ML-DSA-87 keypair |
207058 cycles |
206649 cycles |
1.00 |
ML-DSA-87 sign |
443965 cycles |
439798 cycles |
1.01 |
ML-DSA-87 verify |
204102 cycles |
204451 cycles |
1.00 |
This comment was automatically generated by workflow using github-action-benchmark.
There was a problem hiding this comment.
Arm Cortex-A55 (Snapdragon 888) benchmarks (opt)
Details
| Benchmark suite | Current: 0b60fcc | Previous: 9c5eceb | Ratio |
|---|---|---|---|
ML-DSA-44 keypair |
268177 cycles |
266639 cycles |
1.01 |
ML-DSA-44 sign |
811656 cycles |
808808 cycles |
1.00 |
ML-DSA-44 verify |
270342 cycles |
270153 cycles |
1.00 |
ML-DSA-65 keypair |
460977 cycles |
461300 cycles |
1.00 |
ML-DSA-65 sign |
1323114 cycles |
1323365 cycles |
1.00 |
ML-DSA-65 verify |
446367 cycles |
447313 cycles |
1.00 |
ML-DSA-87 keypair |
786856 cycles |
789934 cycles |
1.00 |
ML-DSA-87 sign |
1817396 cycles |
1827662 cycles |
0.99 |
ML-DSA-87 verify |
765924 cycles |
769879 cycles |
0.99 |
This comment was automatically generated by workflow using github-action-benchmark.
oqs-bot
left a comment
There was a problem hiding this comment.
Graviton3 (no-opt)
Details
| Benchmark suite | Current: 0b60fcc | Previous: 9c5eceb | Ratio |
|---|---|---|---|
ML-DSA-44 keypair |
137915 cycles |
138038 cycles |
1.00 |
ML-DSA-44 sign |
485929 cycles |
486082 cycles |
1.00 |
ML-DSA-44 verify |
149011 cycles |
149093 cycles |
1.00 |
ML-DSA-65 keypair |
241465 cycles |
241929 cycles |
1.00 |
ML-DSA-65 sign |
791930 cycles |
791534 cycles |
1.00 |
ML-DSA-65 verify |
242169 cycles |
241280 cycles |
1.00 |
ML-DSA-87 keypair |
395764 cycles |
396309 cycles |
1.00 |
ML-DSA-87 sign |
1013504 cycles |
1019346 cycles |
0.99 |
ML-DSA-87 verify |
403644 cycles |
403741 cycles |
1.00 |
This comment was automatically generated by workflow using github-action-benchmark.
oqs-bot
left a comment
There was a problem hiding this comment.
Graviton2
Details
| Benchmark suite | Current: 0b60fcc | Previous: 9c5eceb | Ratio |
|---|---|---|---|
ML-DSA-44 keypair |
112267 cycles |
112529 cycles |
1.00 |
ML-DSA-44 sign |
354171 cycles |
354040 cycles |
1.00 |
ML-DSA-44 verify |
117356 cycles |
117389 cycles |
1.00 |
ML-DSA-65 keypair |
194413 cycles |
194738 cycles |
1.00 |
ML-DSA-65 sign |
584338 cycles |
584565 cycles |
1.00 |
ML-DSA-65 verify |
193374 cycles |
193297 cycles |
1.00 |
ML-DSA-87 keypair |
320843 cycles |
321000 cycles |
1.00 |
ML-DSA-87 sign |
747781 cycles |
746870 cycles |
1.00 |
ML-DSA-87 verify |
317959 cycles |
318737 cycles |
1.00 |
This comment was automatically generated by workflow using github-action-benchmark.
oqs-bot
left a comment
There was a problem hiding this comment.
Graviton2 (no-opt)
Details
| Benchmark suite | Current: 0b60fcc | Previous: 9c5eceb | Ratio |
|---|---|---|---|
ML-DSA-44 keypair |
212082 cycles |
211802 cycles |
1.00 |
ML-DSA-44 sign |
760968 cycles |
759783 cycles |
1.00 |
ML-DSA-44 verify |
229827 cycles |
229307 cycles |
1.00 |
ML-DSA-65 keypair |
378584 cycles |
377288 cycles |
1.00 |
ML-DSA-65 sign |
1247669 cycles |
1247192 cycles |
1.00 |
ML-DSA-65 verify |
373212 cycles |
371605 cycles |
1.00 |
ML-DSA-87 keypair |
602191 cycles |
601230 cycles |
1.00 |
ML-DSA-87 sign |
1584300 cycles |
1584827 cycles |
1.00 |
ML-DSA-87 verify |
618115 cycles |
616717 cycles |
1.00 |
This comment was automatically generated by workflow using github-action-benchmark.
There was a problem hiding this comment.
Arm Cortex-A55 (Snapdragon 888) benchmarks (no-opt)
Details
| Benchmark suite | Current: 0b60fcc | Previous: 9c5eceb | Ratio |
|---|---|---|---|
ML-DSA-44 keypair |
463207 cycles |
462947 cycles |
1.00 |
ML-DSA-44 sign |
2133669 cycles |
2132913 cycles |
1.00 |
ML-DSA-44 verify |
554980 cycles |
555065 cycles |
1.00 |
ML-DSA-65 keypair |
782046 cycles |
781178 cycles |
1.00 |
ML-DSA-65 sign |
3489865 cycles |
3495803 cycles |
1.00 |
ML-DSA-65 verify |
866230 cycles |
863872 cycles |
1.00 |
ML-DSA-87 keypair |
1263324 cycles |
1265586 cycles |
1.00 |
ML-DSA-87 sign |
4306970 cycles |
4308959 cycles |
1.00 |
ML-DSA-87 verify |
1388306 cycles |
1390453 cycles |
1.00 |
This comment was automatically generated by workflow using github-action-benchmark.
There was a problem hiding this comment.
Arm Cortex-A72 (Raspberry Pi 4) benchmarks (opt)
Details
| Benchmark suite | Current: 0b60fcc | Previous: 9c5eceb | Ratio |
|---|---|---|---|
ML-DSA-44 keypair |
220427 cycles |
214312 cycles |
1.03 |
ML-DSA-44 sign |
612973 cycles |
605496 cycles |
1.01 |
ML-DSA-44 verify |
217524 cycles |
222932 cycles |
0.98 |
ML-DSA-65 keypair |
388694 cycles |
390142 cycles |
1.00 |
ML-DSA-65 sign |
1009947 cycles |
1004679 cycles |
1.01 |
ML-DSA-65 verify |
371807 cycles |
374059 cycles |
0.99 |
ML-DSA-87 keypair |
651583 cycles |
652450 cycles |
1.00 |
ML-DSA-87 sign |
1325525 cycles |
1336385 cycles |
0.99 |
ML-DSA-87 verify |
631906 cycles |
633167 cycles |
1.00 |
This comment was automatically generated by workflow using github-action-benchmark.
There was a problem hiding this comment.
Arm Cortex-A72 (Raspberry Pi 4) benchmarks (no-opt)
Details
| Benchmark suite | Current: 0b60fcc | Previous: 9c5eceb | Ratio |
|---|---|---|---|
ML-DSA-44 keypair |
310128 cycles |
312767 cycles |
0.99 |
ML-DSA-44 sign |
1174409 cycles |
1207539 cycles |
0.97 |
ML-DSA-44 verify |
346573 cycles |
342752 cycles |
1.01 |
ML-DSA-65 keypair |
560465 cycles |
567893 cycles |
0.99 |
ML-DSA-65 sign |
1886833 cycles |
1961247 cycles |
0.96 |
ML-DSA-65 verify |
539864 cycles |
560470 cycles |
0.96 |
ML-DSA-87 keypair |
876500 cycles |
849145 cycles |
1.03 |
ML-DSA-87 sign |
2402521 cycles |
2376735 cycles |
1.01 |
ML-DSA-87 verify |
899316 cycles |
890655 cycles |
1.01 |
This comment was automatically generated by workflow using github-action-benchmark.
There was a problem hiding this comment.
⚠️ Performance Alert ⚠️
Possible performance regression was detected for benchmark 'Arm Cortex-A72 (Raspberry Pi 4) benchmarks (no-opt)'.
Benchmark result of this commit is worse than the previous benchmark result exceeding threshold 1.03.
| Benchmark suite | Current: 0b60fcc | Previous: 9c5eceb | Ratio |
|---|---|---|---|
ML-DSA-87 keypair |
876500 cycles |
849145 cycles |
1.03 |
This comment was automatically generated by workflow using github-action-benchmark.
0b60fcc to
7ef3442
Compare
…mbly
Mirror the AArch64 conversion in poly_decompose_{32,88}_aarch64_asm:
replace the C intrinsics with fully-unrolled AVX2 routines, add HOL-Light
correctness and memory-safety proofs, and CBMC contracts. Helper lemmas
common to both variants are shared via the x86-only mldsa_utils.ml.
- Resolves #420
- Resolves #914
Signed-off-by: Jake Massimo <jakemas@amazon.com>
7ef3442 to
2174def
Compare
Resolves #420
Resolves #914
Supersedes #1163, which could not be reopened after its branch was inadvertently force-pushed while closed. The branch content is unchanged and rebased on the latest
main.Performance
poly_decomposecomponent benchmark, median cycles on AMD EPYC (c6a),OPT=1 CYCLES=PMU:main)decompose_32(ML-DSA-65/87)decompose_88(ML-DSA-44)