Software Archive
Read-only legacy content
17061 토론

Hey RealSense Staff! TTS phoneme timing maps yet?

Robert_Oschler
초급자
625 조회수

Hello RealSense people,

I've been waiting for this features since I posted the question during the original PercSDK contest over a year ago.  So I'm asking again.  Does the RealSense Text To Speech API give you phoneme timing maps yet?  I'm referring to the timing information you can get with other TTS packages like NeoSpeech, that let you time something like an animated characters mouth movements.  That is, a table you get with each Text To Speech generated waveform that tells you, with accuracy, at what millisecond offset a particular phoneme occurs in the generated audio waveform.  You can use that information to choose a particular mouth shape for an animated character to "lip sync" the character to the TTS waveform.  You didn't have it last year.  I really hope you do now!  So, is this feature there?

0 포인트
5 응답
Robert_Oschler
초급자
625 조회수

I hope you don't mind me bumping this up the forum.  I'd really like an answer. 

Robert

0 포인트
Colleen_C_Intel
625 조회수

What we have currently available is in the released SDK.

0 포인트
Robert_Oschler
초급자
625 조회수

Hi Colleen.  Are you saying that the current RealSense SDK does have phoneme timing maps unlike the PercSDK?

0 포인트
Colleen_C_Intel
625 조회수

I'm saying that the content in the released Gold R1 2014 SDK is what is currently available for the RealSense camera. Items that are there, are marked experimental or beta if they are not yet at Gold. Other items may be on the roadmap.

0 포인트
Robert_Oschler
초급자
625 조회수

Ok, thanks Colleen. 

0 포인트
응답