with AssemblyAI and Google Gemini?
Emit new event when a transcribed audio file from AssemblyAI is ready. See the documentation
Export your completed transcripts in SRT (srt) or VTT (vtt) format, which can be used for subtitles and closed captions in videos. See the documentation
Generates content from text input using the Google Gemini API. See the documentation
Fetches a specific transcribed result from the AssemblyAI API. See the documentation
Generates content from both text and image input using the Gemini API. See the documentation
Generate embeddings from text input using Google Gemini. See the documentation
The AssemblyAI API provides powerful speech recognition and natural language processing capabilities. It allows users to transcribe audio, analyze sentiment, detect topics, and more. In Pipedream, you can leverage these features to create automated workflows that process audio and text data. Connect AssemblyAI to various apps and services, trigger actions based on the API's output, and build robust, serverless data pipelines.
import { axios } from "@pipedream/platform"
export default defineComponent({
props: {
assemblyai: {
type: "app",
app: "assemblyai",
}
},
async run({steps, $}) {
const data = {
"audio_url": `{{your_audio_url}}`,
//for testing, try: https://storage.googleapis.com/aai-web-samples/espn-bears.m4a
}
return await axios($, {
method: "POST",
url: `https://api.assemblyai.com/v2/transcript`,
headers: {
"authorization": `${this.assemblyai.$auth.api_key}`,
},
data,
})
},
})
The Google Gemini API is a cutting-edge tool from Google that enables developers to leverage AI models like Imagen and MusicLM to create and manipulate images and music based on textual descriptions. With Pipedream, you can harness this API to automate workflows that integrate AI-generated content into a variety of applications, from generating visuals for social media posts to composing background music for videos. Pipedream's serverless platform allows you to connect Google Gemini API with other apps to create complex, event-driven workflows without managing infrastructure.
import { axios } from "@pipedream/platform"
export default defineComponent({
props: {
google_gemini: {
type: "app",
app: "google_gemini",
}
},
async run({steps, $}) {
const data = `{{your_promptt}}`;
//E.g. {"contents":[{"parts":[{"text":"Write a story about a magic backpack"}]}]}
return await axios($, {
method: "POST",
url: `https://generativelanguage.googleapis.com/v1beta/models/gemini-pro:generateContent`,
headers: {
"Content-Type": "application/json",
},
params: {
key: `${this.google_gemini.$auth.api_key}`,
},
data
})
},
})