This topic was automatically generated from Slack. You can find the original thread here.
Hello everyone, everything good ?
I would like to know if it is possible to do this configuration in pipedream to run docTR
thanks
This topic was automatically generated from Slack. You can find the original thread here.
Hello everyone, everything good ?
I would like to know if it is possible to do this configuration in pipedream to run docTR
thanks
Hi Marcelo, thanks for the link.
Both of these settings are configurable by environment variables, so you can set these within your Pipedream dashboard to comply with these compatibility issues.
- Disable the usage of the
multiprocessingpackage by setting theDOCTR_MULTIPROCESSING_DISABLEenvironment variable toTRUE. This step is necessary because the package uses the/dev/shmdirectory for shared memory.
So settingDOCTR_MULTIPROCESSING_DISABLEtotruein your workspace environment variables in Pipedream should help with this configuration.
- Change the caching directory used by docTR for models. By default, it is set to
~/.cache/doctr, which is outside the/tmpdirectory on AWS Lambda. You can modify this by setting theDOCTR_CACHE_DIRenvironment variable.
So you can set theDOCTR_CACHE_DIRto/tmpwithin your workspace’s environment varibles, and that should help with that configuration.
Thanks Pierce for the guidance, but I’m facing the Import Error
libGL.so.1: cannot open shared object file: No such file or directory
You might want to report that to the maintainers of the project. There might be an undocumented environment variable you’ll need to set in order to exclude that dependency.
ok then, thanks again Pierce !
Sure thing ![]()
Hi Pierce, how are you? Would it be possible for you to help me clarify the mindee/doctr repository team in the issue I opened?
Thanks - the opencv dependency wasn’t listed in that initial requirements page you sent me. I didn’t realize that was required in the environment.
Unfortunately at this time we don’t support custom docker containers so you can pick and choose your own dependencies for workflows. However, this is an active discussion internally. We know there are cases were you need to bring in binaries, but we can’t install custom binaries on request because it would install unwanted binaries to all Pipedream customer workflows.
Please leave a comment on this Github issue with your use case, it helps us aggregate all of the potential use cases and make sure yours is included as we design this feature:
ok, thanks Pierce