Have decided to simplify the API even further and bring configuration down to essentially one decision: do you want auto salting, or key caching?
This (native Java) implementation appears to be about as fast as the old (JNI) implementation, but has better support across more platforms. Going to evaluate stability on this branch for possible later merging into master.