when i try to run inference case with --model_type full in A100_40G,show oom error. How to solve this problem。