Software Archive
Read-only legacy content
17061 Discussions

Hey RealSense Staff! TTS phoneme timing maps yet?

Robert_Oschler
Beginner
630 Views

Hello RealSense people,

I've been waiting for this features since I posted the question during the original PercSDK contest over a year ago.  So I'm asking again.  Does the RealSense Text To Speech API give you phoneme timing maps yet?  I'm referring to the timing information you can get with other TTS packages like NeoSpeech, that let you time something like an animated characters mouth movements.  That is, a table you get with each Text To Speech generated waveform that tells you, with accuracy, at what millisecond offset a particular phoneme occurs in the generated audio waveform.  You can use that information to choose a particular mouth shape for an animated character to "lip sync" the character to the TTS waveform.  You didn't have it last year.  I really hope you do now!  So, is this feature there?

0 Kudos
5 Replies
Robert_Oschler
Beginner
630 Views

I hope you don't mind me bumping this up the forum.  I'd really like an answer. 

Robert

0 Kudos
Colleen_C_Intel
Employee
630 Views

What we have currently available is in the released SDK.

0 Kudos
Robert_Oschler
Beginner
630 Views

Hi Colleen.  Are you saying that the current RealSense SDK does have phoneme timing maps unlike the PercSDK?

0 Kudos
Colleen_C_Intel
Employee
630 Views

I'm saying that the content in the released Gold R1 2014 SDK is what is currently available for the RealSense camera. Items that are there, are marked experimental or beta if they are not yet at Gold. Other items may be on the roadmap.

0 Kudos
Robert_Oschler
Beginner
630 Views

Ok, thanks Colleen. 

0 Kudos
Reply