with IBM Cloud - Speech to Text and Google Gemini?
Generates content from text input using the Google Gemini API. See the documentation
Generates content from both text and image input using the Gemini API. See the documentation
Generate embeddings from text input using Google Gemini. See the documentation
The IBM Cloud - Speech to Text API transforms spoken language into written text, offering a powerful tool for creating transcriptions, enabling voice control and command features, and feeding speech into analytics platforms. With Pipedream, you can build automated workflows that leverage this capability, such as transcribing meetings in real-time, analyzing customer service calls for sentiment and keywords, or even creating subtitles for videos. The ability to connect with other apps on Pipedream allows for complex workflows that can turn spoken data into actionable insights or accessible content.
import { axios } from "@pipedream/platform"
export default defineComponent({
props: {
ibm_cloud_speech_to_text: {
type: "app",
app: "ibm_cloud_speech_to_text",
}
},
async run({steps, $}) {
const data = {
"text": `hello world`,
}
return await axios($, {
method: "post",
url: `${this.ibm_cloud_speech_to_text.$auth.instance_url}/v1/synthesize`,
headers: {
"Content-Type": `application/json`,
"Accept": `audio/wav`,
},
auth: {
username: `apikey`,
password: `${this.ibm_cloud_speech_to_text.$auth.api_key}`,
},
data,
})
},
})
The Google Gemini API is a cutting-edge tool from Google that enables developers to leverage AI models like Imagen and MusicLM to create and manipulate images and music based on textual descriptions. With Pipedream, you can harness this API to automate workflows that integrate AI-generated content into a variety of applications, from generating visuals for social media posts to composing background music for videos. Pipedream's serverless platform allows you to connect Google Gemini API with other apps to create complex, event-driven workflows without managing infrastructure.
import { axios } from "@pipedream/platform"
export default defineComponent({
props: {
google_gemini: {
type: "app",
app: "google_gemini",
}
},
async run({steps, $}) {
const data = `{{your_promptt}}`;
//E.g. {"contents":[{"parts":[{"text":"Write a story about a magic backpack"}]}]}
return await axios($, {
method: "POST",
url: `https://generativelanguage.googleapis.com/v1beta/models/gemini-pro:generateContent`,
headers: {
"Content-Type": "application/json",
},
params: {
key: `${this.google_gemini.$auth.api_key}`,
},
data
})
},
})