Why am I getting an out of memory error for uploading 30-min voice recordings in my workflow with Google Drive, AWS S3, OpenAI Whisper, and Notion?

user-1 · April 17, 2023, 1:08pm

This topic was automatically generated from Slack. You can find the original thread here.

Hey everyone,

I’m trying to create a workflow where I upload voice recordings from my phone to a Google Drive folder, have those recordings pushed into an AWS S3 bucket, then transcribed using the OpenAI Whisper API and then upload the transcription to my Notion database. I have these steps working for <10min recordings but more than that I keep getting an out of memory error. This seems odd to me given a 30min audio recording is like max 20MB and my workflow memory max is set at ~4800 MB. Thoughts on why I may be getting this error? I even delete the variables from memory in the step where I save the files into a variable to upload them to s3

user-1 · April 17, 2023, 1:08pm

Hello , to best save your time and provide support to you on this

If you have subscribed to Pipedream team plan, you will be added to a dedicated slack support channel
If you’d like to hire a Pipedream expert for your usecase, feel free to use this link: Connect with a Pipedream Partner

user-1 · April 17, 2023, 1:08pm

Could you do me a favor and take a look at the OpenAI Transcription action and see if its memory use could be improved? I noticed this too when testing with large files

user-1 · April 17, 2023, 1:08pm

just to confirm, are you hitting the Whisper / Audio API directly, or are you using our built-in Create Transcription action? Either way we should improve our action, but just curious

user-1 · April 17, 2023, 1:08pm

Figured it out, it was actually a missing param for large body lengths. The default value capped those 30+min audios
PR: [OpenAI] Fix memory exceeded for large audio files (30+ min) by andrewjschuang · Pull Request #5929 · PipedreamHQ/pipedream · GitHub

user-1 · April 17, 2023, 1:08pm

awesome , approved!

user-1 · April 17, 2023, 1:08pm

I’m working on this as well - nice to know I’m not the only one dealing with this issue today haha

user-1 · April 17, 2023, 1:08pm

just merged that fix, should be out in a few minutes, let’s see if that helps!

user-1 · April 17, 2023, 1:08pm

Heck yeah! Was just about to swap out my transcription step with Deepgram, but if I can keep Whisper I will

user-1 · April 17, 2023, 1:08pm

nice, yeah there should be nothing preventing us from steaming large files to the Whisper API in the Pipedream execution environment, hopefully we can reduce the needed memory to the minimum for y’all

user-1 · April 17, 2023, 1:08pm

If it helps, I’ve enabled support access for my workflow: https://pipedream.com/@tomfrankly/test-chat-api-test-p_brCyb6n/build

user-1 · April 17, 2023, 1:08pm

do you see the option to update that action in your workflow now? The new version should be out

user-1 · April 17, 2023, 1:08pm

yep, will test on my 24mb test file now.

user-1 · April 17, 2023, 1:08pm

Ha, well one thing I can share: the limit isn’t 25mb

{"error":{"message":"Maximum content size limit (26214400) exceeded (26401220 bytes read)","type":"server_error","param":null,"code":null}}

user-1 · April 17, 2023, 1:08pm

Reconverting to 22m

user-1 · April 17, 2023, 1:08pm

do you know of a service that can connect to Pipedream and split mp3 files?

From what I gather, ffmpeg is out since you’d have to actually install it on the host machines

user-1 · April 17, 2023, 1:08pm

you can actually use ffpmeg in the Pipedream execution environment via an npm installer, let me share another example

user-1 · April 17, 2023, 1:08pm

Could you describe exactly what you’d like to do? I’d love to test our new ai.m.pipedream.net endpoint on this use case, it worked pretty well for another ffmpeg example

user-1 · April 17, 2023, 1:08pm

will look at that in a second - first, I want to say that the fix worked!

user-1 · April 17, 2023, 1:08pm

awesome