mirror of
https://github.com/AuxXxilium/linux_dsm_epyc7002.git
synced 2024-12-02 18:06:40 +07:00
6dd7a82cc5
Use the vector polynomial multiply-sum instructions in POWER8 to speed up crc32c. This is just over 41x faster than the slice-by-8 method that it replaces. Measurements on a 4.1 GHz POWER8 show it sustaining 52 GiB/sec. A simple btrfs write performance test: dd if=/dev/zero of=/mnt/tmpfile bs=1M count=4096 sync is over 3.7x faster. Signed-off-by: Anton Blanchard <anton@samba.org> Signed-off-by: Herbert Xu <herbert@gondor.apana.org.au> |
||
---|---|---|
.. | ||
aes-spe-core.S | ||
aes-spe-glue.c | ||
aes-spe-keys.S | ||
aes-spe-modes.S | ||
aes-spe-regs.h | ||
aes-tab-4k.S | ||
crc32c-vpmsum_asm.S | ||
crc32c-vpmsum_glue.c | ||
Makefile | ||
md5-asm.S | ||
md5-glue.c | ||
sha1-powerpc-asm.S | ||
sha1-spe-asm.S | ||
sha1-spe-glue.c | ||
sha1.c | ||
sha256-spe-asm.S | ||
sha256-spe-glue.c |