What do you want to automate

with IBM Cloud - Speech to Text and Google Gemini?

Prompt, edit and deploy AI agents that connect to IBM Cloud - Speech to Text, Google Gemini and 2,500+ other apps in seconds.

Trusted by 1,000,000+ developers from startups to Fortune 500 companies

Adyen logo
Appcues logo
Bandwidth logo
Checkr logo
ChartMogul logo
Dataminr logo
Gopuff logo
Gorgias logo
LinkedIn logo
Logitech logo
Replicated logo
Rudderstack logo
SAS logo
Scale AI logo
Webflow logo
Warner Bros. logo
Adyen logo
Appcues logo
Bandwidth logo
Checkr logo
ChartMogul logo
Dataminr logo
Gopuff logo
Gorgias logo
LinkedIn logo
Logitech logo
Replicated logo
Rudderstack logo
SAS logo
Scale AI logo
Webflow logo
Warner Bros. logo
Generate Content from Text with the Google Gemini API

Generates content from text input using the Google Gemini API. See the documentation

 
Try it
Generate Content from Text and Image with the Google Gemini API

Generates content from both text and image input using the Gemini API. See the documentation

 
Try it
Generate Embeddings with the Google Gemini API

Generate embeddings from text input using Google Gemini. See the documentation

 
Try it
Integrate the IBM Cloud - Speech to Text API with the Google Gemini API
Setup the IBM Cloud - Speech to Text API trigger to run a workflow which integrates with the Google Gemini API. Pipedream's integration platform allows you to integrate IBM Cloud - Speech to Text and Google Gemini remarkably fast. Free for developers.

Overview of IBM Cloud - Speech to Text

The IBM Cloud - Speech to Text API transforms spoken language into written text, offering a powerful tool for creating transcriptions, enabling voice control and command features, and feeding speech into analytics platforms. With Pipedream, you can build automated workflows that leverage this capability, such as transcribing meetings in real-time, analyzing customer service calls for sentiment and keywords, or even creating subtitles for videos. The ability to connect with other apps on Pipedream allows for complex workflows that can turn spoken data into actionable insights or accessible content.

Connect IBM Cloud - Speech to Text

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
import { axios } from "@pipedream/platform"
export default defineComponent({
  props: {
    ibm_cloud_speech_to_text: {
      type: "app",
      app: "ibm_cloud_speech_to_text",
    }
  },
  async run({steps, $}) {
    const data = {
      "text": `hello world`,
    }
    return await axios($, {
      method: "post",
      url: `${this.ibm_cloud_speech_to_text.$auth.instance_url}/v1/synthesize`,
      headers: {
        "Content-Type": `application/json`,
        "Accept": `audio/wav`,
      },
      auth: {
        username: `apikey`,
        password: `${this.ibm_cloud_speech_to_text.$auth.api_key}`,
      },
      data,
    })
  },
})

Overview of Google Gemini

The Google Gemini API is a cutting-edge tool from Google that enables developers to leverage AI models like Imagen and MusicLM to create and manipulate images and music based on textual descriptions. With Pipedream, you can harness this API to automate workflows that integrate AI-generated content into a variety of applications, from generating visuals for social media posts to composing background music for videos. Pipedream's serverless platform allows you to connect Google Gemini API with other apps to create complex, event-driven workflows without managing infrastructure.

Connect Google Gemini

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
import { axios } from "@pipedream/platform"
export default defineComponent({
  props: {
    google_gemini: {
      type: "app",
      app: "google_gemini",
    }
  },
  async run({steps, $}) {
    const data = `{{your_promptt}}`;
      //E.g. {"contents":[{"parts":[{"text":"Write a story about a magic backpack"}]}]}
    return await axios($, {
      method: "POST",
      url: `https://generativelanguage.googleapis.com/v1beta/models/gemini-pro:generateContent`,
      headers: {
        "Content-Type": "application/json",        
      },      
      params: {
        key: `${this.google_gemini.$auth.api_key}`,
      },
      data
    })
  },
})

Trusted by 1,000,000+ developers from startups to Fortune 500 companies

Adyen logo
Appcues logo
Bandwidth logo
Checkr logo
ChartMogul logo
Dataminr logo
Gopuff logo
Gorgias logo
LinkedIn logo
Logitech logo
Replicated logo
Rudderstack logo
SAS logo
Scale AI logo
Webflow logo
Warner Bros. logo
Adyen logo
Appcues logo
Bandwidth logo
Checkr logo
ChartMogul logo
Dataminr logo
Gopuff logo
Gorgias logo
LinkedIn logo
Logitech logo
Replicated logo
Rudderstack logo
SAS logo
Scale AI logo
Webflow logo
Warner Bros. logo