Adds X11K algorithm #48

bedri · 2020-11-16T14:30:02Z

This pull request adds X11K algorithm to cpuminer-multi. X11K algorithm is used in Kyanite Coin mining right now.

should be set to 14 for NEOS-blake and pentablake

also ensure blake context was initialised...

based on https://github.com/ghostlander/cpuminer-neoscrypt with reduced changes in cpu-miner.c Signed-off-by: Tanguy Pruvot <tanguy.pruvot@gmail.com>

Fix some existing bugs : cryptonight hashrate log and lock when missing stratum diff colors: enable colored output by default and also trap signals on windows (Ctrl+C) Current state: much slower than linux (and x64 almost twice the x86 speed) Signed-off-by: Tanguy Pruvot <tanguy.pruvot@gmail.com>

tested ok with curl-7.38.0 openssl 1.0.1j was really not easy to setup, ssl config: CROSS_COMPILE="x86_64-w64-mingw32-" ./Configure mingw64 no-asm no-shared make && make install curl config: extraopts=--enable-ipv6 extraflags="-DOPENSSL_NO_ASM -D_THREAD_SAFE" openssl=/usr/local/ssl CROSS_COMPILE="x86_64-w64-mingw32-" ./configure --enable-shared=no \ --disable-manual --without-libssh2 --disable-rtsp --disable-ldap \ --disable-dict --disable-pop3 --disable-ftp --disable-telnet --disable-tftp \ --disable-smtp --disable-imap --disable-ldaps --disable-gopher --with-zlib \ --with-ssl=$openssl --with-libssl-prefix=$openssl CPPFLAGS="$extraflags" ${extraopts} Signed-off-by: Tanguy Pruvot

fix also some remaining aligned attributes for VC++ TODO: support ASM linkage in VC2013 (USE_ASM define)

and update icon branch in README

add also a linux build scrypt

also reduce x11 intensity to output a bit more while benchmarking

and fix qubit difficulty

and update http headers

version seen: 1024 (Neos), 6 (Mist in PoW phase)

speed increase by 2 (same logic as i made in ccminer) Signed-off-by: Tanguy Pruvot <tanguy.pruvot@gmail.com>

curl static lib built with HTTP_ONLY define to build x86 ones, check curl-for-windows project on github

Signed-off-by: Tanguy Pruvot <tanguy.pruvot@gmail.com>

ARM boards don't build when enabling ASM, and some of them benefit from using -march=native. Let's have a dedicated build file for this. Also it takes care of cleaning old autoconf/automake remains that can make the build fail after pulling updates.

it's only needed on platforms who don't have a CRC32 instruction.

it is counter productive to avoid writing 50% of the times, it adds conditional jumps which are mispredicted half of the times. Better use conditional moves and always write. This increase performance by 6% on ARMv8.

By moving some fields in the structure, we can increase the performance by an extra 6% on Cortex-A53 at least.

Drop all hashes which will have one of their highest 16 bits set since they will not match. This saves 4 calls to rf256_one_round() via rf256_final() and almost doubles the performance.

It's really expensive to use memcpy() to copy 16 kB of data on some small processors like ARM Cortex A53 which only have 64-bit data paths from the L1 cache. This roughly consumes 2k cycles just for the copy. "Perf top" shows that half of the time is spent in memcpy(), and given that this exhausts the L1 cache, the rest of the operations must cause a lot of thrashing. Since there are few modifications applied to the rambox between two consecutive calls, better keep a history of recent changes inside the context itself. This doesn't cost much because the write bus between the CPU and the L1 cache is 128 bit on A53 so we can afford a few writes. Also, the typical amount of updates apparently is between 16 and 32 so it makes sense to put an upper bound on 32 and remain the memory footprint low.. The performance is roughly multiplied by 5 on A53 just by doing this, the hash rate reaches about 14.4k/s on NanoPI-Neo4, or almost 10 times the performance of the original code.

and fix modifier for arm64

properly...

Array of function pointers optimization Since you said it's ok, I am merging. It will be merged to linux branch anyways.

tpruvot and others added 30 commits August 26, 2014 23:36

util: add other algos in cputest, in color

112f76b

blake: fix rounds for blake256

324afb5

should be set to 14 for NEOS-blake and pentablake

x11: sizeof fix, break was hiding the bug

14f0acf

remove extra memset in hash functions

9e74d1d

hash funcs: do some cleanup...

df0c0ad

also ensure blake context was initialised...

[BUGFIX] Fix stale share display bug

e9dce85

fix last warnings

df27be4

rpc: use the user agent constant

d213fe9

NeoScrypt Support (simplified)

15f243e

based on https://github.com/ghostlander/cpuminer-neoscrypt with reduced changes in cpu-miner.c Signed-off-by: Tanguy Pruvot <tanguy.pruvot@gmail.com>

neoscrypt: use official blake2s

50f955b

version: add compiler and pthreads version

ccccf3b

fix also some remaining aligned attributes for VC++ TODO: support ASM linkage in VC2013 (USE_ASM define)

update README for v1.0.5

756db9e

add curl and crypto to travis dependencies

322e5b6

and update icon branch in README

scrypt: fix segfault related to neoscrypt nfactor

f27cba6

vc2013: more fixes playing with defines

f34c567

fix for single core cpus

f64adb3

add also a linux build scrypt

build.sh: compat with gcc 4.4

5751a49

add gcc flags to MinGW64 build script

2cf78c9

debug: add -f option to test with reduced diff

070ff29

Add S3 algo

98a5c87

mingw64: gcc 4.9 doesnt like every obscure cflags

4406425

mingw: fix neoscrypt bug (related to cflags)

3bf8615

also reduce x11 intensity to output a bit more while benchmarking

Add Qubit and Nist5 algos

9f3083c

stratum: show url and height on new blocks

7e4b0ea

and fix qubit difficulty

Fix for non-rpc2 solo mining

d0a018a

and update http headers

solo: switch to getwork on unhandled gbt versions

f5b2e50

version seen: 1024 (Neos), 6 (Mist in PoW phase)

blake: full rewrite with a midstate cache

cb38b41

speed increase by 2 (same logic as i made in ccminer) Signed-off-by: Tanguy Pruvot <tanguy.pruvot@gmail.com>

update curl to 7.38.0, add vc libs (x64 only)

06a8774

curl static lib built with HTTP_ONLY define to build x86 ones, check curl-for-windows project on github

tpruvot and others added 30 commits January 30, 2019 13:56

lyra2v3 algo for incoming VTC fork

6a51875

Signed-off-by: Tanguy Pruvot <tanguy.pruvot@gmail.com>

vstudio: fix typo

605ff82

rainforest: do not emit rf_crc32_table if not needed

8cbca8d

it's only needed on platforms who don't have a CRC32 instruction.

rainforest: optimize rf_rambox()

d37dae5

it is counter productive to avoid writing 50% of the times, it adds conditional jumps which are mispredicted half of the times. Better use conditional moves and always write. This increase performance by 6% on ARMv8.

rainforest: optimize rf_ctx organization for better cache locality

3ab1167

By moving some fields in the structure, we can increase the performance by an extra 6% on Cortex-A53 at least.

rainforest: avoid closing the hash when it doesn't match

a9a6056

Drop all hashes which will have one of their highest 16 bits set since they will not match. This saves 4 calls to rf256_one_round() via rf256_final() and almost doubles the performance.

rainforest: fix compilation for windows (tpruvot#37)

8719995

rainforest: add missing entry in cputest

6aa43ad

and fix modifier for arm64

Add geek algo (tpruvot#35)

6d51b16

properly...

fix os x build with asm (tpruvot#14)

0d7a97a

restore build.sh code

39fff9e

phi2: avoid possible memory overflow

c207355

Force __arm__ define on aarch64

a79da32

X16Rv2 Algorithm (for RVN and clones)

7be0721

X16Rv2: pad zeros with 32-bits integers at once

4cbcd00

readme: add zlib1g-dev dep on deb-based Linux (tpruvot#41)

bde0223

README markdown quotes (tpruvot#40)

1d6d48a

increase version to 1.3.7

04666ea

x11k algorithm is added

89bca51

Array of function pointers optimization

bf5d5c3

cleanup

d82cd3f

fixes

2070448

Adds x11k lines to vcxproj files

2d86bc3

Correction to ordering & cleanup

c2ab8aa

Merge pull request #1 from bedri/array-of-function-pointers-optimization

ec3f768

Array of function pointers optimization Since you said it's ok, I am merging. It will be merged to linux branch anyways.

X11K adjustments

f3a088b

improves the X11K memory allocations

70b0e30

removes the GetUint64 function from the X11K hash

11505e1

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adds X11K algorithm #48

Adds X11K algorithm #48

bedri commented Nov 16, 2020

Adds X11K algorithm #48

Are you sure you want to change the base?

Adds X11K algorithm #48

Conversation

bedri commented Nov 16, 2020