Experimental speech streaming for LMNT (useChat/useCompletion React) #922

lgrammel · 2024-01-17T15:19:16Z

Summary

Adds speech streaming to useChat and useCompletion with streamData.

useCompletion & useChat (for React) provide a experimental_speechUrl that can be used html audio elements
Integration functions for lmnt speech streams through experimental_forwardLmntSpeechStream
streamData.experimental_appendSpeech: add speech stream chunks to data stream (used automatically through forward functions)
Example: examples/next-lmnt: LMNT completion & chat speech streaming
Docs: LMNT provider docs, API docs for experimental_forwardLmntSpeechStream

Notes

The LMNT SDK does not work in the edge environment (as of v1.1.2)

untilhamza · 2024-01-18T02:12:20Z

This is exciting

llermaly · 2024-01-24T02:08:06Z

This is awesome!

llermaly · 2024-01-26T01:22:22Z

@lgrammel Hi Lars, I tried to test this one locally with no luck, it is showing this error:

 ⚠ ./app/api/chat-speech-elevenlabs/route.ts
Attempted import error: 'forwardModelFusionSpeechStream' is not exported from 'ai' (imported as 'forwardModelFusionSpeechStream').

I go to node_modules/ai and I see the function there, not sure if I need to do anything else. (I cloned the fork, checkout to the branch and run the example)

It is ready to test?

Thanks!

lgrammel · 2024-01-26T09:47:45Z

@lgrammel Hi Lars, I tried to test this one locally with no luck, it is showing this error:
 ⚠ ./app/api/chat-speech-elevenlabs/route.ts
Attempted import error: 'forwardModelFusionSpeechStream' is not exported from 'ai' (imported as 'forwardModelFusionSpeechStream').
I go to node_modules/ai and I see the function there, not sure if I need to do anything else. (I cloned the fork, checkout to the branch and run the example)

It is ready to test?

Thanks!

Have you rebuilt the ai package? The easiest way is to just rebuild the whole repository (pnpm i, pnpm build) and then try out the example.

llermaly · 2024-01-27T15:46:27Z

@lgrammel Hi Lars, I tried to test this one locally with no luck, it is showing this error:
 ⚠ ./app/api/chat-speech-elevenlabs/route.ts
Attempted import error: 'forwardModelFusionSpeechStream' is not exported from 'ai' (imported as 'forwardModelFusionSpeechStream').
I go to node_modules/ai and I see the function there, not sure if I need to do anything else. (I cloned the fork, checkout to the branch and run the example)
It is ready to test?
Thanks!
Have you rebuilt the ai package? The easiest way is to just rebuild the whole repository (pnpm i, pnpm build) and then try out the example.

That did the trick thank you!. I was doing npm run dev , I did pnpm build , npm start and it worked.

It works really, really fast. I hope we can get this merged very soon.

llermaly · 2024-01-30T14:22:59Z

Hi @MaxLeiter! did you have a chance to take a look?

tgonzales · 2024-02-22T20:54:45Z

Hello @lgrammel I saw that you changed from eleven labs to LMNT, there is a technical reason for this, eleven labs supports multi languages, LMNT still has no plans to launch this, wouldn't it be interesting to keep both options?

Thank you and congratulations for the excellent work

lgrammel · 2024-02-23T09:18:07Z

Hello @lgrammel I saw that you changed from eleven labs to LMNT, there is a technical reason for this, eleven labs supports multi languages, LMNT still has no plans to launch this, wouldn't it be interesting to keep both options?

Thank you and congratulations for the excellent work

Thanks. We want to use the official elevenlabs node SDK, but it does not support duplex streaming yet: elevenlabs/elevenlabs-js#4

In the meantime, you could use modelfusion elevenlabs with the adapter that I had in an earlier version of this PR.

Iven2132 · 2024-03-05T09:14:36Z

@lgrammel Hi! I can't find the example app for speech streaming in the Vercel AI SDK repo. where it's gone?

lgrammel · 2024-03-05T09:24:18Z

@lgrammel Hi! I can't find the example app for speech streaming in the Vercel AI SDK repo. where it's gone?

this feature has not been merged yet

Iven2132 · 2024-03-05T10:33:55Z

Hi @MaxLeiter Can you merge this?

Iven2132 · 2024-03-09T08:36:41Z

Hi @MaxLeiter Can you please approve this?

llermaly · 2024-03-09T18:12:48Z

bump

pixelcatgg · 2024-03-10T16:11:42Z

we could really use this as well 🙏 thank you so much for the work on this

shaper · 2024-03-15T18:22:30Z

examples/next-lmnt/app/api/chat/route.ts

+const speech = new Speech(process.env.LMNT_API_KEY || 'no key');
+
+// Note: The LMNT SDK does not work on edge yet (as of v1.1.2)
+// export const runtime = 'edge';


Hi @lgrammel FYI @kaikato just merged a short README in lmnt-node describing how we got this working -- if you see an even better way let us know, but with the one change to the next.config.js file it should work with edge. lmnt-com/lmnt-node#32

I thought to add that when we were hacking on vercel/ai-chatbot#151 I did see issues that looked like a challenge re: websockets staying alive on edge and so for deployment I switched to nodejs and didn't look further at the time. You can see the deployment focused work I did atop that PR here: https://github.com/shaper/lmnt-ai-chatbot/commits/main/

@lgrammel @MaxLeiter When TTS will come?

allenchuang · 2024-05-01T19:21:29Z

Any reason why this is closed? TTS is a great feature to have

solanacryptodev · 2024-05-14T23:54:17Z

Would be cool to see TTS added with the addition of gpt-4o

alokwhitewolf · 2024-05-20T05:57:58Z

@lgrammel / @MaxLeiter
Any follow up plans on adding TTS to vercel AI ?

alicercedigital · 2024-09-08T20:58:06Z

we are very excited about this!

Speech streaming prototype.

da585bb

lgrammel self-assigned this Jan 17, 2024

lgrammel added 16 commits January 18, 2024 17:04

Extract forwardLmntSpeechStream. Cleanup route code.

feb1868

Add 11labs example.

0333d04

Switch voice.

28cb758

Refactor to useSWR.

afc5a7d

Cleanup.

fa8826b

Prettier fix.

d988947

Set key for CI.

0e73024

Extract useMediaSource hook.

ebfde61

Add chat example.

58430b9

Add styles.

2e0de35

Edge support for elevenlabs.

e7bcfce

Fix bug.

e621899

Add useCallback.

d3ce315

Improve LMNT streaming.

93fbef3

Revert.

5bffd51

Cleanup.

759e890

lgrammel changed the title ~~[WIP] Speech streaming prototype.~~ [RFC] Speech streaming prototype Jan 23, 2024

lgrammel requested a review from MaxLeiter January 23, 2024 18:54

Add missing dependencies.

0ce4678

Merge branch 'main' into lg/use-speech

1974628

llermaly mentioned this pull request Jan 27, 2024

Text to Speech utils? #885

Open

Merge branch 'main' into lg/use-speech

02ac372

lgrammel added 3 commits February 19, 2024 10:54

Switch LMNT voice to 'lily'.

9b0af88

Revert changes.

8e90ad6

Hide audio elements.

7287847

lgrammel changed the title ~~[RFC] Speech streaming prototype~~ Speech streaming for LMNT & ElevenLabs Feb 19, 2024

lgrammel added 3 commits February 19, 2024 11:31

Mark forward functions as experimental.

80c060e

Mark speechUrl as experimental.

740b6a1

Mark appendSpeech as experimental.

bc0f578

lgrammel changed the title ~~Speech streaming for LMNT & ElevenLabs~~ Experimental speech streaming for LMNT & ElevenLabs (useChat/useCompletion React) Feb 19, 2024

Remove elevenlabs integration.

e552976

lgrammel changed the title ~~Experimental speech streaming for LMNT & ElevenLabs (useChat/useCompletion React)~~ Experimental speech streaming for LMNT (useChat/useCompletion React) Feb 19, 2024

lgrammel added 3 commits February 19, 2024 12:24

Add LMNT provider docs.

8ae1f59

Add experimental_speechURL API docs.

8b62b2f

Add API docs for experimental_forwardLmntSpeechStream.

c9c9f8e

lgrammel marked this pull request as ready for review February 19, 2024 13:18

Add changeset.

acb8e83

lgrammel mentioned this pull request Feb 19, 2024

Proposing onToken callback method for useChat hook #793

Open

shaper reviewed Mar 15, 2024

View reviewed changes

lgrammel closed this Apr 9, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Experimental speech streaming for LMNT (useChat/useCompletion React) #922

Experimental speech streaming for LMNT (useChat/useCompletion React) #922

lgrammel commented Jan 17, 2024 •

edited

Loading

untilhamza commented Jan 18, 2024

llermaly commented Jan 24, 2024

llermaly commented Jan 26, 2024

lgrammel commented Jan 26, 2024

llermaly commented Jan 27, 2024 •

edited

Loading

llermaly commented Jan 30, 2024

tgonzales commented Feb 22, 2024

lgrammel commented Feb 23, 2024

Iven2132 commented Mar 5, 2024

lgrammel commented Mar 5, 2024

Iven2132 commented Mar 5, 2024

Iven2132 commented Mar 9, 2024

llermaly commented Mar 9, 2024

pixelcatgg commented Mar 10, 2024

shaper Mar 15, 2024

shaper Mar 15, 2024

Iven2132 Mar 18, 2024

allenchuang commented May 1, 2024

solanacryptodev commented May 14, 2024

alokwhitewolf commented May 20, 2024

alicercedigital commented Sep 8, 2024

Experimental speech streaming for LMNT (useChat/useCompletion React) #922

Experimental speech streaming for LMNT (useChat/useCompletion React) #922

Conversation

lgrammel commented Jan 17, 2024 • edited Loading

Summary

Notes

untilhamza commented Jan 18, 2024

llermaly commented Jan 24, 2024

llermaly commented Jan 26, 2024

lgrammel commented Jan 26, 2024

llermaly commented Jan 27, 2024 • edited Loading

llermaly commented Jan 30, 2024

tgonzales commented Feb 22, 2024

lgrammel commented Feb 23, 2024

Iven2132 commented Mar 5, 2024

lgrammel commented Mar 5, 2024

Iven2132 commented Mar 5, 2024

Iven2132 commented Mar 9, 2024

llermaly commented Mar 9, 2024

pixelcatgg commented Mar 10, 2024

shaper Mar 15, 2024

Choose a reason for hiding this comment

shaper Mar 15, 2024

Choose a reason for hiding this comment

Iven2132 Mar 18, 2024

Choose a reason for hiding this comment

allenchuang commented May 1, 2024

solanacryptodev commented May 14, 2024

alokwhitewolf commented May 20, 2024

alicercedigital commented Sep 8, 2024

lgrammel commented Jan 17, 2024 •

edited

Loading

llermaly commented Jan 27, 2024 •

edited

Loading