Speech Synthesis?

eboats · March 19, 2026, 1:38pm

Is there any support in SC for doing speech synthesis ( creating voices for talking rather than singing ). With Text to Speech (TTS) applications getting better now, am wondering if there are any UGens in SC related to this.

girthrub · March 29, 2026, 9:50pm

that reminds me of a question of mine from some one-two years back (Speech synthesis). Basically the short answer back then was
a) maybe pinktrombone, but there is no sc port available
b) the TTS advances are mostly “AI” generated, so can’t be as straightforwardly ported to SC as conventional dsp and will probably be tricky real time? Maybe this has changed since then.

semiquaver · March 29, 2026, 10:02pm

there are some low latency open source text to speech tools that could maybe be used… but something like pink trombone seems super interesting… there is apparently a cpp port:

hmmm

eboats · March 30, 2026, 10:36pm

Thanks all for the info, and looks like @girthrub has already asked the question. Basically, I’d just like to be able to synthesize vocal sounds in a programming language context like SC, and write routines to manipulate vocal parameters ( even if the vocal sounds are somewhat robot-like and not as polished as what AI TTS does ). An advantage of using SC would be the ability to combine with other Ugens for interesting effects. Otherwise, I guess I could look into C++ though am not as fluent with that.

I hadn’t heard of Pink Trombone, but seems there may be an SC port ( though not sure if working )

I also noticed there was a thread on a Vowel class so maybe working with that and the Formant Ugen might be something to look into.

girthrub · March 31, 2026, 1:33am

ah nice, I was somehow firmly convinced that there was no sc port around, no idea why

jamshark70 · March 31, 2026, 2:10am

I’ve used FormantTable from sc3-plugins along with PM synthesis to produce vowels that sound a lot better than they have any right to (since it’s “just” PM).

hjh

semiquaver · March 31, 2026, 4:24pm

yep it works though repo owner calls it a work in progress and says they would like to expose more params to sc…

eboats · March 31, 2026, 6:15pm

Thanks everyone for your help with some things to explore!

toneburst · April 24, 2026, 3:03pm

I’ll be watching this thread with interest, as speech-synthesis is something I’ve long been fascinated with myself.

One thing that’s worth considering. If you need intelligable words, it’s not just a question of modelling or otherwise approximating and parameterising the vocal tract, you’ll need some kind of text-to-phoneme-stream conversion, paired with preset parameter settings for vowels, consonants etc.

This is probably not the kind of thing SuperCollider would be good at, but I’d be interested to see someone have a stab at it, to control, say the PinkTrombone vocal synthesiser.

semiquaver · May 4, 2026, 10:53pm

this seems somewhat doable - not sure how realistic the ultimate quality can get but worth a stab…

LuxEtObscuritas · May 11, 2026, 1:03pm

Regarding PinkTrombone: This person here extended the original version in various ways (including an interface for text-input), but it’s mostly written in JavaScript. The current version of the SC port works well, but I was unable to approximate the parameter combinations needed for certain phonemes. There is some text-to-speech demo video out there, but its far beyond the quality of current neural text-to-speech systems/models when it comes to intelligibility at least.

Back in the day there was Speech which was rather limited and is also deprecated now. So if you’re looking explicitly for text-to-(intelligeble)-speech with modulatable parameters, there currently seems to be no option…

toneburst · May 11, 2026, 3:36pm

Interesting! My aim, personally isn’t to create super-realistic speech. I like obviously synthetic speech, and vocal-like sounds. That said some kind of text-to-phoneme script would be great.

I’ll probably be writing some kind of script for the Norns audio computer, so if I come up with anything in SC, it will likely be the sound-generation part only, controlled by a Lua script for Norns.

I’m reasonable au-fait with JavaScript, so I may be able to convert relevant control code from JS to Lua.

Incidentally, does anyone happen to know what the “VO-6” speech-synch engine in the Elektron Monomachine is derived from?

semiquaver · May 11, 2026, 3:46pm

Claude identified some of the parameters not accessible in the port - one was ‘velar’ IIRC - might be worth vibe-coding the port to add these… I’ll try to remember to have a whack… these might be useful: Voximplant Docs

toneburst · May 11, 2026, 3:58pm

I have a bit of an obsession with Speak’n’Spell/Texas Instruments-style LPC speech. I wonder if I could use Teachable Machines to find parameter settings for standard English phonemes.

The nice thing about LPC speech resynthesis is you can really f*ck it up in interesting ways by feeding random values to the model.

dscheiba · May 11, 2026, 5:12pm

Well, there is still Speech synthesis | Apple Developer Documentation - looks like a fun project/plugin to write, using e.g. plugin commands to send the string.

Please note that GitHub - v7b1/mi-UGens: some mutable instruments eurorack modules ported to SuperCollider · GitHub has a port of Plaits, which has

A collection of speech synthesis algorithms (formant filter, SAM, LPC), with phoneme control and formant shifting. Several banks of phonemes or segments of words are available.

(
Ndef(\x, {
	MiPlaits.ar(
		pitch: 45,
		engine: 7,
		harm: \harm.kr(0.1, spec: [0.0, 1.0]),
		timbre: \timbre.kr(0.5, spec: [0.0, 1.0]),
		morph: \morph.kr(0.5, spec: [0.0, 1.0]),
		trigger: Impulse.kr(\speed.kr(1.5)),
		fm_mod: \fmMod.kr(0.0, spec: [0.0, 1.0]),
		timb_mod: \timbMod.kr(0.0, spec: [0.0, 1.0]),
		morph_mod: \morphMod.kr(0.0, spec: [0.0, 1.0])
	) * \amp.kr(0.2)
}).play.gui
)

toneburst · May 12, 2026, 8:15am

I’m a bit of an MI fanboy, going back to before they starting making Eurorack modules, so I’m aware of Plaits

LuxEtObscuritas · June 9, 2026, 1:08pm

Indeed, LPC can create very interesting textures! I guess you already checked LPCAnalyzer.ar?