x86_64 + HOL-Light: Replace poly_decompose AVX2 intrinsics with hand-written assembly and HOL-Light proofs#1163
x86_64 + HOL-Light: Replace poly_decompose AVX2 intrinsics with hand-written assembly and HOL-Light proofs#1163jakemas wants to merge 1 commit into
Conversation
753befe to
460553d
Compare
CBMC Results (ML-DSA-44, REDUCE-RAM)Full Results (205 proofs)
|
CBMC Results (ML-DSA-87, REDUCE-RAM)Full Results (205 proofs)
|
CBMC Results (ML-DSA-65, REDUCE-RAM)Full Results (205 proofs)
|
CBMC Results (ML-DSA-65)Full Results (205 proofs)
|
CBMC Results (ML-DSA-87)Full Results (205 proofs)
|
CBMC Results (ML-DSA-44)Full Results (205 proofs)
|
There was a problem hiding this comment.
Mac Mini (M1, 2020) benchmarks (opt)
Details
| Benchmark suite | Current: 460553d | Previous: 08d40f9 | Ratio |
|---|---|---|---|
ML-DSA-44 keypair |
46539 cycles |
46537 cycles |
1.00 |
ML-DSA-44 sign |
131082 cycles |
131061 cycles |
1.00 |
ML-DSA-44 verify |
47348 cycles |
47345 cycles |
1.00 |
ML-DSA-65 keypair |
81686 cycles |
81683 cycles |
1.00 |
ML-DSA-65 sign |
215322 cycles |
215331 cycles |
1.00 |
ML-DSA-65 verify |
79305 cycles |
79302 cycles |
1.00 |
ML-DSA-87 keypair |
132401 cycles |
132400 cycles |
1.00 |
ML-DSA-87 sign |
277532 cycles |
277357 cycles |
1.00 |
ML-DSA-87 verify |
134051 cycles |
134055 cycles |
1.00 |
This comment was automatically generated by workflow using github-action-benchmark.
There was a problem hiding this comment.
Mac Mini (M1, 2020) benchmarks (no-opt)
Details
| Benchmark suite | Current: 460553d | Previous: 08d40f9 | Ratio |
|---|---|---|---|
ML-DSA-44 keypair |
112752 cycles |
112752 cycles |
1 |
ML-DSA-44 sign |
400901 cycles |
400863 cycles |
1.00 |
ML-DSA-44 verify |
119445 cycles |
119445 cycles |
1 |
ML-DSA-65 keypair |
192978 cycles |
192931 cycles |
1.00 |
ML-DSA-65 sign |
649977 cycles |
649957 cycles |
1.00 |
ML-DSA-65 verify |
192863 cycles |
192871 cycles |
1.00 |
ML-DSA-87 keypair |
318842 cycles |
318724 cycles |
1.00 |
ML-DSA-87 sign |
828816 cycles |
828761 cycles |
1.00 |
ML-DSA-87 verify |
326790 cycles |
326654 cycles |
1.00 |
This comment was automatically generated by workflow using github-action-benchmark.
There was a problem hiding this comment.
Arm Cortex-A72 (Raspberry Pi 4) benchmarks (opt)
Details
| Benchmark suite | Current: 460553d | Previous: 08d40f9 | Ratio |
|---|---|---|---|
ML-DSA-44 keypair |
231326 cycles |
226291 cycles |
1.02 |
ML-DSA-44 sign |
636953 cycles |
613701 cycles |
1.04 |
ML-DSA-44 verify |
218748 cycles |
223287 cycles |
0.98 |
ML-DSA-65 keypair |
401703 cycles |
401301 cycles |
1.00 |
ML-DSA-65 sign |
1038618 cycles |
1019858 cycles |
1.02 |
ML-DSA-65 verify |
382587 cycles |
377404 cycles |
1.01 |
ML-DSA-87 keypair |
663545 cycles |
662182 cycles |
1.00 |
ML-DSA-87 sign |
1385036 cycles |
1364271 cycles |
1.02 |
ML-DSA-87 verify |
639267 cycles |
646030 cycles |
0.99 |
This comment was automatically generated by workflow using github-action-benchmark.
There was a problem hiding this comment.
⚠️ Performance Alert ⚠️
Possible performance regression was detected for benchmark 'Arm Cortex-A72 (Raspberry Pi 4) benchmarks (opt)'.
Benchmark result of this commit is worse than the previous benchmark result exceeding threshold 1.03.
| Benchmark suite | Current: 460553d | Previous: 08d40f9 | Ratio |
|---|---|---|---|
ML-DSA-44 sign |
636953 cycles |
613701 cycles |
1.04 |
This comment was automatically generated by workflow using github-action-benchmark.
oqs-bot
left a comment
There was a problem hiding this comment.
Intel Xeon 4th gen (c7i)
Details
| Benchmark suite | Current: 460553d | Previous: 08d40f9 | Ratio |
|---|---|---|---|
ML-DSA-44 keypair |
43326 cycles |
43362 cycles |
1.00 |
ML-DSA-44 sign |
130586 cycles |
131357 cycles |
0.99 |
ML-DSA-44 verify |
45079 cycles |
45329 cycles |
0.99 |
ML-DSA-65 keypair |
75623 cycles |
75529 cycles |
1.00 |
ML-DSA-65 sign |
214943 cycles |
215494 cycles |
1.00 |
ML-DSA-65 verify |
74300 cycles |
74395 cycles |
1.00 |
ML-DSA-87 keypair |
123196 cycles |
123052 cycles |
1.00 |
ML-DSA-87 sign |
271568 cycles |
270650 cycles |
1.00 |
ML-DSA-87 verify |
120595 cycles |
120614 cycles |
1.00 |
This comment was automatically generated by workflow using github-action-benchmark.
oqs-bot
left a comment
There was a problem hiding this comment.
Intel Xeon 4th gen (c7i) (no-opt)
Details
| Benchmark suite | Current: 460553d | Previous: 08d40f9 | Ratio |
|---|---|---|---|
ML-DSA-44 keypair |
91451 cycles |
91711 cycles |
1.00 |
ML-DSA-44 sign |
351826 cycles |
353153 cycles |
1.00 |
ML-DSA-44 verify |
99751 cycles |
100089 cycles |
1.00 |
ML-DSA-65 keypair |
153919 cycles |
153963 cycles |
1.00 |
ML-DSA-65 sign |
571974 cycles |
570637 cycles |
1.00 |
ML-DSA-65 verify |
160011 cycles |
159765 cycles |
1.00 |
ML-DSA-87 keypair |
255432 cycles |
256233 cycles |
1.00 |
ML-DSA-87 sign |
726067 cycles |
727004 cycles |
1.00 |
ML-DSA-87 verify |
264170 cycles |
264081 cycles |
1.00 |
This comment was automatically generated by workflow using github-action-benchmark.
oqs-bot
left a comment
There was a problem hiding this comment.
AMD EPYC 3rd gen (c6a)
Details
| Benchmark suite | Current: 460553d | Previous: 08d40f9 | Ratio |
|---|---|---|---|
ML-DSA-44 keypair |
55425 cycles |
55232 cycles |
1.00 |
ML-DSA-44 sign |
159735 cycles |
159604 cycles |
1.00 |
ML-DSA-44 verify |
57634 cycles |
57852 cycles |
1.00 |
ML-DSA-65 keypair |
96240 cycles |
95882 cycles |
1.00 |
ML-DSA-65 sign |
263039 cycles |
264042 cycles |
1.00 |
ML-DSA-65 verify |
96049 cycles |
96233 cycles |
1.00 |
ML-DSA-87 keypair |
154779 cycles |
154587 cycles |
1.00 |
ML-DSA-87 sign |
320859 cycles |
322317 cycles |
1.00 |
ML-DSA-87 verify |
151689 cycles |
151310 cycles |
1.00 |
This comment was automatically generated by workflow using github-action-benchmark.
oqs-bot
left a comment
There was a problem hiding this comment.
AMD EPYC 3rd gen (c6a) (no-opt)
Details
| Benchmark suite | Current: 460553d | Previous: 08d40f9 | Ratio |
|---|---|---|---|
ML-DSA-44 keypair |
133300 cycles |
133370 cycles |
1.00 |
ML-DSA-44 sign |
519465 cycles |
519316 cycles |
1.00 |
ML-DSA-44 verify |
146603 cycles |
146733 cycles |
1.00 |
ML-DSA-65 keypair |
224063 cycles |
224213 cycles |
1.00 |
ML-DSA-65 sign |
843482 cycles |
843252 cycles |
1.00 |
ML-DSA-65 verify |
234302 cycles |
234223 cycles |
1.00 |
ML-DSA-87 keypair |
367801 cycles |
367144 cycles |
1.00 |
ML-DSA-87 sign |
1060988 cycles |
1060336 cycles |
1.00 |
ML-DSA-87 verify |
381200 cycles |
380930 cycles |
1.00 |
This comment was automatically generated by workflow using github-action-benchmark.
oqs-bot
left a comment
There was a problem hiding this comment.
Graviton2
Details
| Benchmark suite | Current: 460553d | Previous: 08d40f9 | Ratio |
|---|---|---|---|
ML-DSA-44 keypair |
112335 cycles |
112525 cycles |
1.00 |
ML-DSA-44 sign |
354217 cycles |
354017 cycles |
1.00 |
ML-DSA-44 verify |
117414 cycles |
117394 cycles |
1.00 |
ML-DSA-65 keypair |
194466 cycles |
194697 cycles |
1.00 |
ML-DSA-65 sign |
584370 cycles |
584501 cycles |
1.00 |
ML-DSA-65 verify |
193464 cycles |
193283 cycles |
1.00 |
ML-DSA-87 keypair |
320864 cycles |
320987 cycles |
1.00 |
ML-DSA-87 sign |
747862 cycles |
746658 cycles |
1.00 |
ML-DSA-87 verify |
318010 cycles |
318698 cycles |
1.00 |
This comment was automatically generated by workflow using github-action-benchmark.
oqs-bot
left a comment
There was a problem hiding this comment.
AMD EPYC 4th gen (c7a)
Details
| Benchmark suite | Current: 460553d | Previous: 08d40f9 | Ratio |
|---|---|---|---|
ML-DSA-44 keypair |
46713 cycles |
47001 cycles |
0.99 |
ML-DSA-44 sign |
140686 cycles |
139110 cycles |
1.01 |
ML-DSA-44 verify |
49619 cycles |
49262 cycles |
1.01 |
ML-DSA-65 keypair |
82371 cycles |
82519 cycles |
1.00 |
ML-DSA-65 sign |
228022 cycles |
227885 cycles |
1.00 |
ML-DSA-65 verify |
82195 cycles |
82013 cycles |
1.00 |
ML-DSA-87 keypair |
130193 cycles |
129228 cycles |
1.01 |
ML-DSA-87 sign |
280026 cycles |
279733 cycles |
1.00 |
ML-DSA-87 verify |
128511 cycles |
128347 cycles |
1.00 |
This comment was automatically generated by workflow using github-action-benchmark.
oqs-bot
left a comment
There was a problem hiding this comment.
Graviton4
Details
| Benchmark suite | Current: 460553d | Previous: 08d40f9 | Ratio |
|---|---|---|---|
ML-DSA-44 keypair |
67357 cycles |
67336 cycles |
1.00 |
ML-DSA-44 sign |
198377 cycles |
198343 cycles |
1.00 |
ML-DSA-44 verify |
70274 cycles |
70240 cycles |
1.00 |
ML-DSA-65 keypair |
119485 cycles |
119389 cycles |
1.00 |
ML-DSA-65 sign |
326268 cycles |
325890 cycles |
1.00 |
ML-DSA-65 verify |
117012 cycles |
116943 cycles |
1.00 |
ML-DSA-87 keypair |
196441 cycles |
196722 cycles |
1.00 |
ML-DSA-87 sign |
421298 cycles |
421918 cycles |
1.00 |
ML-DSA-87 verify |
193282 cycles |
193428 cycles |
1.00 |
This comment was automatically generated by workflow using github-action-benchmark.
There was a problem hiding this comment.
Arm Cortex-A72 (Raspberry Pi 4) benchmarks (no-opt)
Details
| Benchmark suite | Current: 460553d | Previous: 08d40f9 | Ratio |
|---|---|---|---|
ML-DSA-44 keypair |
299428 cycles |
301288 cycles |
0.99 |
ML-DSA-44 sign |
1141001 cycles |
1145421 cycles |
1.00 |
ML-DSA-44 verify |
330622 cycles |
332207 cycles |
1.00 |
ML-DSA-65 keypair |
563729 cycles |
547720 cycles |
1.03 |
ML-DSA-65 sign |
1920855 cycles |
1895789 cycles |
1.01 |
ML-DSA-65 verify |
539918 cycles |
531514 cycles |
1.02 |
ML-DSA-87 keypair |
861758 cycles |
855749 cycles |
1.01 |
ML-DSA-87 sign |
2426173 cycles |
2378667 cycles |
1.02 |
ML-DSA-87 verify |
906246 cycles |
883654 cycles |
1.03 |
This comment was automatically generated by workflow using github-action-benchmark.
oqs-bot
left a comment
There was a problem hiding this comment.
AMD EPYC 4th gen (c7a) (no-opt)
Details
| Benchmark suite | Current: 460553d | Previous: 08d40f9 | Ratio |
|---|---|---|---|
ML-DSA-44 keypair |
118433 cycles |
118101 cycles |
1.00 |
ML-DSA-44 sign |
458646 cycles |
458153 cycles |
1.00 |
ML-DSA-44 verify |
130869 cycles |
130875 cycles |
1.00 |
ML-DSA-65 keypair |
200886 cycles |
200971 cycles |
1.00 |
ML-DSA-65 sign |
745083 cycles |
742473 cycles |
1.00 |
ML-DSA-65 verify |
209164 cycles |
209101 cycles |
1.00 |
ML-DSA-87 keypair |
331397 cycles |
332976 cycles |
1.00 |
ML-DSA-87 sign |
938261 cycles |
938796 cycles |
1.00 |
ML-DSA-87 verify |
343275 cycles |
342887 cycles |
1.00 |
This comment was automatically generated by workflow using github-action-benchmark.
oqs-bot
left a comment
There was a problem hiding this comment.
Graviton4 (no-opt)
Details
| Benchmark suite | Current: 460553d | Previous: 08d40f9 | Ratio |
|---|---|---|---|
ML-DSA-44 keypair |
127600 cycles |
127655 cycles |
1.00 |
ML-DSA-44 sign |
441155 cycles |
441153 cycles |
1.00 |
ML-DSA-44 verify |
136410 cycles |
136366 cycles |
1.00 |
ML-DSA-65 keypair |
220532 cycles |
220720 cycles |
1.00 |
ML-DSA-65 sign |
714274 cycles |
713831 cycles |
1.00 |
ML-DSA-65 verify |
221102 cycles |
220771 cycles |
1.00 |
ML-DSA-87 keypair |
364562 cycles |
365122 cycles |
1.00 |
ML-DSA-87 sign |
915619 cycles |
921347 cycles |
0.99 |
ML-DSA-87 verify |
370883 cycles |
370803 cycles |
1.00 |
This comment was automatically generated by workflow using github-action-benchmark.
oqs-bot
left a comment
There was a problem hiding this comment.
Graviton2 (no-opt)
Details
| Benchmark suite | Current: 460553d | Previous: 08d40f9 | Ratio |
|---|---|---|---|
ML-DSA-44 keypair |
212223 cycles |
211830 cycles |
1.00 |
ML-DSA-44 sign |
761057 cycles |
759846 cycles |
1.00 |
ML-DSA-44 verify |
229916 cycles |
229343 cycles |
1.00 |
ML-DSA-65 keypair |
378717 cycles |
377180 cycles |
1.00 |
ML-DSA-65 sign |
1247998 cycles |
1247170 cycles |
1.00 |
ML-DSA-65 verify |
373347 cycles |
371571 cycles |
1.00 |
ML-DSA-87 keypair |
602516 cycles |
600508 cycles |
1.00 |
ML-DSA-87 sign |
1584868 cycles |
1584292 cycles |
1.00 |
ML-DSA-87 verify |
618340 cycles |
616048 cycles |
1.00 |
This comment was automatically generated by workflow using github-action-benchmark.
oqs-bot
left a comment
There was a problem hiding this comment.
Intel Xeon 3rd gen (c6i)
Details
| Benchmark suite | Current: 460553d | Previous: 08d40f9 | Ratio |
|---|---|---|---|
ML-DSA-44 keypair |
61876 cycles |
61490 cycles |
1.01 |
ML-DSA-44 sign |
189410 cycles |
189504 cycles |
1.00 |
ML-DSA-44 verify |
66354 cycles |
66611 cycles |
1.00 |
ML-DSA-65 keypair |
112014 cycles |
109562 cycles |
1.02 |
ML-DSA-65 sign |
320464 cycles |
315340 cycles |
1.02 |
ML-DSA-65 verify |
109875 cycles |
109818 cycles |
1.00 |
ML-DSA-87 keypair |
171027 cycles |
171377 cycles |
1.00 |
ML-DSA-87 sign |
379568 cycles |
379373 cycles |
1.00 |
ML-DSA-87 verify |
170309 cycles |
170479 cycles |
1.00 |
This comment was automatically generated by workflow using github-action-benchmark.
oqs-bot
left a comment
There was a problem hiding this comment.
Intel Xeon 3rd gen (c6i) (no-opt)
Details
| Benchmark suite | Current: 460553d | Previous: 08d40f9 | Ratio |
|---|---|---|---|
ML-DSA-44 keypair |
154733 cycles |
154452 cycles |
1.00 |
ML-DSA-44 sign |
590183 cycles |
589881 cycles |
1.00 |
ML-DSA-44 verify |
169693 cycles |
170191 cycles |
1.00 |
ML-DSA-65 keypair |
262845 cycles |
263626 cycles |
1.00 |
ML-DSA-65 sign |
965740 cycles |
966009 cycles |
1.00 |
ML-DSA-65 verify |
272539 cycles |
273308 cycles |
1.00 |
ML-DSA-87 keypair |
432704 cycles |
431959 cycles |
1.00 |
ML-DSA-87 sign |
1211442 cycles |
1210781 cycles |
1.00 |
ML-DSA-87 verify |
448136 cycles |
447098 cycles |
1.00 |
This comment was automatically generated by workflow using github-action-benchmark.
There was a problem hiding this comment.
Arm Cortex-A76 (Raspberry Pi 5) benchmarks (opt)
Details
| Benchmark suite | Current: 460553d | Previous: 08d40f9 | Ratio |
|---|---|---|---|
ML-DSA-44 keypair |
112168 cycles |
112136 cycles |
1.00 |
ML-DSA-44 sign |
353497 cycles |
353809 cycles |
1.00 |
ML-DSA-44 verify |
117008 cycles |
117213 cycles |
1.00 |
ML-DSA-65 keypair |
194787 cycles |
194348 cycles |
1.00 |
ML-DSA-65 sign |
583930 cycles |
583675 cycles |
1.00 |
ML-DSA-65 verify |
192719 cycles |
193087 cycles |
1.00 |
ML-DSA-87 keypair |
320908 cycles |
320087 cycles |
1.00 |
ML-DSA-87 sign |
747304 cycles |
747202 cycles |
1.00 |
ML-DSA-87 verify |
318766 cycles |
317903 cycles |
1.00 |
This comment was automatically generated by workflow using github-action-benchmark.
oqs-bot
left a comment
There was a problem hiding this comment.
Graviton3
Details
| Benchmark suite | Current: 460553d | Previous: 08d40f9 | Ratio |
|---|---|---|---|
ML-DSA-44 keypair |
71374 cycles |
71546 cycles |
1.00 |
ML-DSA-44 sign |
208951 cycles |
208995 cycles |
1.00 |
ML-DSA-44 verify |
74784 cycles |
74742 cycles |
1.00 |
ML-DSA-65 keypair |
125930 cycles |
125949 cycles |
1.00 |
ML-DSA-65 sign |
345622 cycles |
345451 cycles |
1.00 |
ML-DSA-65 verify |
124102 cycles |
124199 cycles |
1.00 |
ML-DSA-87 keypair |
207053 cycles |
206632 cycles |
1.00 |
ML-DSA-87 sign |
443985 cycles |
439852 cycles |
1.01 |
ML-DSA-87 verify |
204072 cycles |
204472 cycles |
1.00 |
This comment was automatically generated by workflow using github-action-benchmark.
oqs-bot
left a comment
There was a problem hiding this comment.
Graviton3 (no-opt)
Details
| Benchmark suite | Current: 460553d | Previous: 08d40f9 | Ratio |
|---|---|---|---|
ML-DSA-44 keypair |
137939 cycles |
138023 cycles |
1.00 |
ML-DSA-44 sign |
486208 cycles |
486039 cycles |
1.00 |
ML-DSA-44 verify |
149048 cycles |
149068 cycles |
1.00 |
ML-DSA-65 keypair |
241520 cycles |
241810 cycles |
1.00 |
ML-DSA-65 sign |
792069 cycles |
791663 cycles |
1.00 |
ML-DSA-65 verify |
242200 cycles |
241314 cycles |
1.00 |
ML-DSA-87 keypair |
395771 cycles |
396304 cycles |
1.00 |
ML-DSA-87 sign |
1013680 cycles |
1019188 cycles |
0.99 |
ML-DSA-87 verify |
403660 cycles |
403745 cycles |
1.00 |
This comment was automatically generated by workflow using github-action-benchmark.
There was a problem hiding this comment.
Arm Cortex-A76 (Raspberry Pi 5) benchmarks (no-opt)
Details
| Benchmark suite | Current: 460553d | Previous: 08d40f9 | Ratio |
|---|---|---|---|
ML-DSA-44 keypair |
211517 cycles |
211659 cycles |
1.00 |
ML-DSA-44 sign |
758603 cycles |
760082 cycles |
1.00 |
ML-DSA-44 verify |
228966 cycles |
229485 cycles |
1.00 |
ML-DSA-65 keypair |
377434 cycles |
377835 cycles |
1.00 |
ML-DSA-65 sign |
1247925 cycles |
1246585 cycles |
1.00 |
ML-DSA-65 verify |
371551 cycles |
371729 cycles |
1.00 |
ML-DSA-87 keypair |
600603 cycles |
601814 cycles |
1.00 |
ML-DSA-87 sign |
1582984 cycles |
1582429 cycles |
1.00 |
ML-DSA-87 verify |
616265 cycles |
617716 cycles |
1.00 |
This comment was automatically generated by workflow using github-action-benchmark.
There was a problem hiding this comment.
Arm Cortex-A55 (Snapdragon 888) benchmarks (opt)
Details
| Benchmark suite | Current: 460553d | Previous: 08d40f9 | Ratio |
|---|---|---|---|
ML-DSA-44 keypair |
267515 cycles |
268565 cycles |
1.00 |
ML-DSA-44 sign |
807339 cycles |
806886 cycles |
1.00 |
ML-DSA-44 verify |
269280 cycles |
269851 cycles |
1.00 |
ML-DSA-65 keypair |
461007 cycles |
459957 cycles |
1.00 |
ML-DSA-65 sign |
1320496 cycles |
1317354 cycles |
1.00 |
ML-DSA-65 verify |
447023 cycles |
445463 cycles |
1.00 |
ML-DSA-87 keypair |
788862 cycles |
788523 cycles |
1.00 |
ML-DSA-87 sign |
1808107 cycles |
1804026 cycles |
1.00 |
ML-DSA-87 verify |
770607 cycles |
772022 cycles |
1.00 |
This comment was automatically generated by workflow using github-action-benchmark.
There was a problem hiding this comment.
Arm Cortex-A55 (Snapdragon 888) benchmarks (no-opt)
Details
| Benchmark suite | Current: 460553d | Previous: 08d40f9 | Ratio |
|---|---|---|---|
ML-DSA-44 keypair |
463001 cycles |
462832 cycles |
1.00 |
ML-DSA-44 sign |
2131558 cycles |
2130663 cycles |
1.00 |
ML-DSA-44 verify |
554513 cycles |
554950 cycles |
1.00 |
ML-DSA-65 keypair |
780676 cycles |
781350 cycles |
1.00 |
ML-DSA-65 sign |
3483972 cycles |
3478846 cycles |
1.00 |
ML-DSA-65 verify |
863731 cycles |
864624 cycles |
1.00 |
ML-DSA-87 keypair |
1265658 cycles |
1261131 cycles |
1.00 |
ML-DSA-87 sign |
4297254 cycles |
4307837 cycles |
1.00 |
ML-DSA-87 verify |
1390213 cycles |
1384611 cycles |
1.00 |
This comment was automatically generated by workflow using github-action-benchmark.
There was a problem hiding this comment.
SpacemiT K1 8 (Banana Pi F3) benchmarks (no-opt)
Details
| Benchmark suite | Current: 460553d | Previous: 08d40f9 | Ratio |
|---|---|---|---|
ML-DSA-44 keypair |
760293 cycles |
759832 cycles |
1.00 |
ML-DSA-44 sign |
3140737 cycles |
3139623 cycles |
1.00 |
ML-DSA-44 verify |
859554 cycles |
859077 cycles |
1.00 |
ML-DSA-65 keypair |
1286158 cycles |
1285222 cycles |
1.00 |
ML-DSA-65 sign |
5077195 cycles |
5072020 cycles |
1.00 |
ML-DSA-65 verify |
1364303 cycles |
1363676 cycles |
1.00 |
ML-DSA-87 keypair |
2112223 cycles |
2110495 cycles |
1.00 |
ML-DSA-87 sign |
6355356 cycles |
6366388 cycles |
1.00 |
ML-DSA-87 verify |
2228739 cycles |
2230493 cycles |
1.00 |
This comment was automatically generated by workflow using github-action-benchmark.
d6ee368 to
cd47996
Compare
a0d2135 to
211fb52
Compare
| mldsa/poly_decompose_32_avx2_asm.o \ | ||
| mldsa/poly_decompose_88_avx2_asm.o \ |
There was a problem hiding this comment.
Please also add them to the README.
…mbly
Mirror the AArch64 conversion in poly_decompose_{32,88}_aarch64_asm:
replace the C intrinsics with fully-unrolled AVX2 routines, add HOL-Light
correctness and memory-safety proofs, and CBMC contracts. Helper lemmas
common to both variants are shared via the x86-only mldsa_utils.ml.
- Resolves #420
- Resolves #914
Signed-off-by: Jake Massimo <jakemas@amazon.com>
211fb52 to
b138eef
Compare
|
Added to readme |
|
This PR's branch was accidentally force-pushed while the PR was closed (I was trying to add the update my ammending the commit, but failed), which left it unreopenable on GitHub's side. Reopened the work as #1181 with identical content (rebased on latest main, 31 files). The README addition you asked for (@mkannwischer) is included there. Apologies for the churn. |
Resolves #420
Resolves #914
Performance
poly_decomposecomponent benchmark, median cycles on AMD EPYC (c6a),OPT=1 CYCLES=PMU:main)decompose_32(ML-DSA-65/87)decompose_88(ML-DSA-44)