the development of the android program using Baidu speech recognition SDK, is currently recognized after the cloud to return audio files or text for playback, now want to achieve audio or text playback by typing keywords to interrupt playback, but because the playback will also be recorded, resulting in unable to correctly identify keywords, do not know how to solve?