Replies: 1 comment
-
Try the malloc alternative memory management? You can load it as an environment variable. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
I'm trying to use Python ONNX Runtime in AWS Lambda (10 GB ram), to run a small-ish neural net - ONNX converted model is 319 KB.
When running inference, I'm getting out of memory exceptions - "Failed to allocate memory for requested buffer of size 7993016064"
What could I do to reduce the memory use? I went through the docs, but didn't find anything that would massively reduce the memory use
Beta Was this translation helpful? Give feedback.
All reactions