To use an audio as data source, just add data_type as audio and pass in the path of the audio (local or hosted).

We use Deepgram to transcribe the audiot to text, and then use the generated text as the data source.

You would require an Deepgram API key which is available here to use this feature.

Without customization

import os
from embedchain import App

os.environ["DEEPGRAM_API_KEY"] = "153xxx"

app = App()
app.add("introduction.wav", data_type="audio")
response = app.query("What is my name and how old am I?")
print(response)
# Answer: Your name is Dave and you are 21 years old.

Was this page helpful?