Are you talking about voice output (TTS) or voice input (speech recognition)?
Those are totally different.
For voice output, RHVoice is supposedly good, also there is a new fancy app framework with way more advanced voices
https://k2-fsa.github.io/sherpa/onnx/tts/apk-engine.html
For voice input, you can use FUTO voice input or some google stuff