    Can Java Speech be used to match audio and text together?


      I am trying to plan the initial requirements to an in-house captioning software that I am working on. One of the features that I'd like to try and implement is the ability load an audio file, load a text file, and then have the program match them together - creating a new textfile with "timecodes".

      I spent the last 30 minutes looking for the Java Speech specs, and while it seems like it might be possible, I'd like to hear it from someone who is more familiar with the API. (Perhaps one of it's developers could answer?)