Partition Mutable Buffer as Core ML State #5165

YifanShenSZ · 2024-09-08T00:02:37Z

#4830 opens the door toward delegate mutable buffer. Now in this PR, we tag the mutable buffers in Core ML partitioner, to delegate them as Core ML state

With #5143, we are able to run the stateful Core ML delegate

pytorch-bot · 2024-09-08T00:02:40Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/5165

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❌ 1 New Failure, 1 Unrelated Failure

As of commit 0bfa422 with merge base 13da62b ():

NEW FAILURE - The following job has failed:

Apple / test-demo-ios / macos-job (gh)
RuntimeError: Command bash /Users/runner/work/_temp/exec_script failed with exit code 1

FLAKY - The following job failed but was likely due to flakiness present on trunk:

Apple / upload-frameworks-ios (gh) (detected as infra flaky with no log or failing log classifier)

This comment was automatically generated by Dr. CI and updates every 15 minutes.

YifanShenSZ · 2024-09-08T00:03:11Z

@cymbalrush @cccclai

facebook-github-bot · 2024-09-08T04:19:53Z

@cccclai has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

cccclai · 2024-09-08T04:20:13Z

Thanks! lintrunner seems failing. Could you help fixing it?

YifanShenSZ · 2024-09-08T04:34:24Z

Thanks! lintrunner seems failing. Could you help fixing it?

Fixed

cccclai · 2024-09-08T15:12:30Z

The coreml end to end CI breaks in this PR, is it expected?

2024-09-08T04:30:41.7032910Z I 00:00:00.006578 executorch:cpuinfo_utils.cpp:62] Reading file /sys/devices/soc0/image_version
2024-09-08T04:30:41.7033530Z I 00:00:00.006641 executorch:cpuinfo_utils.cpp:78] Failed to open midr file /sys/devices/soc0/image_version
2024-09-08T04:30:41.7034130Z I 00:00:00.006656 executorch:cpuinfo_utils.cpp:158] Number of efficient cores 4
2024-09-08T04:30:41.7034630Z I 00:00:00.006658 executorch:main.cpp:65] Resetting threadpool with num threads = 4
2024-09-08T04:30:41.7035260Z I 00:00:00.010802 executorch:runner.cpp:58] Creating LLaMa runner: model_path=llama2.pte, tokenizer_path=tokenizer.bin
2024-09-08T04:30:41.7040040Z E 00:00:00.907857 executorch:coreml_backend_delegate.mm:163] CoreMLBackend: Failed to init the model.
2024-09-08T04:30:41.7040640Z E 00:00:00.907866 executorch:method.cpp:106] Init failed for backend CoreMLBackend: 0x23
2024-09-08T04:30:41.7041030Z ++ date +%H:%M:%S

Does it mean it needs @cymbalrush pr for the CI job?

YifanShenSZ · 2024-09-08T19:19:40Z

Llama runner runs well on my local. Is CI machine MacOS 14?

State is a new feature in MacOS 15, so failure on older MacOS is expected. I can make the stateful llama no longer the default option.

cccclai · 2024-09-08T19:44:01Z

Is CI machine MacOS 14?

Yeah CI machine is still MacOS 14. Looks like it's passing now. Thanks!

facebook-github-bot · 2024-09-08T19:45:06Z

@cccclai has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

cccclai

Looks great. Thanks! Just some minor comments

examples/models/llama2/export_llama_lib.py

cccclai · 2024-09-08T19:51:37Z

Looks like there are still lint error:

>>> Lint for backends/apple/coreml/test/test_coreml_partitioner.py:

  Warning (FLAKE8) F401
    'pytest' imported but unused
    See https://www.flake8rules.com/rules/F401.html.

          8  |
          9  |import executorch.exir
         10  |
    >>>  11  |import pytest
         12  |
         13  |import torch
         14  |import torchvision

backends/apple/coreml/partition/coreml_partitioner.py

…eful llama until CI machine upgraded to MacOS 15

…partition log

YifanShenSZ · 2024-09-08T20:53:27Z

Lint error fixed, comments addressed

facebook-github-bot · 2024-09-08T22:31:24Z

@cccclai has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

cccclai · 2024-09-09T17:34:07Z

The test failing in trunk / test-coreml-model / macos-job (push) seems legit. Could you help fixing it?

+ python -m examples.apple.coreml.scripts.export --model_name=mobilebert
  Torch version 2.5.0.dev20240901 has not been tested with coremltools. You may run into unexpected errors. Torch 2.3.0 is the most recent version that has been tested.
  Traceback (most recent call last):
    File "<frozen runpy>", line 198, in _run_module_as_main
    File "<frozen runpy>", line 88, in _run_code
    File "/Users/ec2-user/runner/_work/executorch/executorch/pytorch/executorch/examples/apple/coreml/scripts/export.py", line 160, in <module>
      model, example_inputs, _ = EagerModelFactory.create_model(
                                 ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
    File "/Users/ec2-user/runner/_work/executorch/executorch/pytorch/executorch/examples/models/model_factory.py", line 38, in create_model
      module = importlib.import_module(
               ^^^^^^^^^^^^^^^^^^^^^^^^
    File "/Users/ec2-user/runner/_work/_temp/conda_environment_10762964118/lib/python3.11/importlib/__init__.py", line 126, in import_module
      return _bootstrap._gcd_import(name[level:], package, level)
             ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
    File "<frozen importlib._bootstrap>", line 1204, in _gcd_import
    File "<frozen importlib._bootstrap>", line 1176, in _find_and_load
    File "<frozen importlib._bootstrap>", line 1147, in _find_and_load_unlocked
    File "<frozen importlib._bootstrap>", line 690, in _load_unlocked
    File "<frozen importlib._bootstrap_external>", line 940, in exec_module
    File "<frozen importlib._bootstrap>", line 241, in _call_with_frames_removed
    File "/Users/ec2-user/runner/_work/executorch/executorch/pytorch/executorch/examples/models/mobilebert/__init__.py", line 7, in <module>
      from .model import MobileBertModelExample
    File "/Users/ec2-user/runner/_work/executorch/executorch/pytorch/executorch/examples/models/mobilebert/model.py", line 11, in <module>
      from transformers import AutoTokenizer, MobileBertModel  # @manual
      ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^

…, that does not support numpy 2.0

YifanShenSZ · 2024-09-09T18:40:31Z

Fixed, it's due to coremltools 8.0b2 starts to support numpy 2.0, but the mobilebert test is using an old transformers that requires numpy 1.x

Temporarily added a numpy downgrade in coreml install_requirements.sh. Will remove it once executorch migrates to numpy 2.0

facebook-github-bot · 2024-09-10T02:08:55Z

@cccclai has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

yifan_shen3 added 2 commits September 7, 2024 16:10

partition mutable buffer to coreml state

98ad6c2

delegate llama mutable buffer to coreml

8a6d0de

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Sep 8, 2024

cccclai added the ciflow/trunk label Sep 8, 2024

fix lint

6a7ccca

support embedding quantize

c20ef9f

YifanShenSZ force-pushed the coreml-state branch from 3dc37e6 to 2d9728f Compare September 8, 2024 19:28

cccclai approved these changes Sep 8, 2024

View reviewed changes

examples/models/llama2/export_llama_lib.py Outdated Show resolved Hide resolved

cccclai reviewed Sep 8, 2024

View reviewed changes

backends/apple/coreml/partition/coreml_partitioner.py Show resolved Hide resolved

try fix CI: 1. pin coremltools 8.0b2; 2. refrain from defaulting stat…

1735a20

…eful llama until CI machine upgraded to MacOS 15

YifanShenSZ force-pushed the coreml-state branch from 2d9728f to 1735a20 Compare September 8, 2024 20:00

address review comments: 1. add arg help info; 2. add mutable buffer …

5af7700

…partition log

fix CI: executorch example model test env is using older transformers…

0bfa422

…, that does not support numpy 2.0

cccclai merged commit f471556 into pytorch:main Sep 10, 2024
100 of 104 checks passed

YifanShenSZ mentioned this pull request Sep 11, 2024

Bump up coremltools version to 8.0b2 #4784

Closed

YifanShenSZ mentioned this pull request Sep 30, 2024

Clean up Outdated Args from Llama Export Script #5762

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Partition Mutable Buffer as Core ML State #5165

Partition Mutable Buffer as Core ML State #5165

YifanShenSZ commented Sep 8, 2024 •

edited

Loading

pytorch-bot bot commented Sep 8, 2024 •

edited

Loading

YifanShenSZ commented Sep 8, 2024

facebook-github-bot commented Sep 8, 2024

cccclai commented Sep 8, 2024

YifanShenSZ commented Sep 8, 2024

cccclai commented Sep 8, 2024

YifanShenSZ commented Sep 8, 2024 •

edited

Loading

cccclai commented Sep 8, 2024

facebook-github-bot commented Sep 8, 2024

cccclai left a comment

cccclai commented Sep 8, 2024

YifanShenSZ commented Sep 8, 2024 •

edited

Loading

facebook-github-bot commented Sep 8, 2024

cccclai commented Sep 9, 2024

YifanShenSZ commented Sep 9, 2024 •

edited

Loading

facebook-github-bot commented Sep 10, 2024

Partition Mutable Buffer as Core ML State #5165

Partition Mutable Buffer as Core ML State #5165

Conversation

YifanShenSZ commented Sep 8, 2024 • edited Loading

pytorch-bot bot commented Sep 8, 2024 • edited Loading

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/5165

❌ 1 New Failure, 1 Unrelated Failure

YifanShenSZ commented Sep 8, 2024

facebook-github-bot commented Sep 8, 2024

cccclai commented Sep 8, 2024

YifanShenSZ commented Sep 8, 2024

cccclai commented Sep 8, 2024

YifanShenSZ commented Sep 8, 2024 • edited Loading

cccclai commented Sep 8, 2024

facebook-github-bot commented Sep 8, 2024

cccclai left a comment

Choose a reason for hiding this comment

cccclai commented Sep 8, 2024

YifanShenSZ commented Sep 8, 2024 • edited Loading

facebook-github-bot commented Sep 8, 2024

cccclai commented Sep 9, 2024

YifanShenSZ commented Sep 9, 2024 • edited Loading

facebook-github-bot commented Sep 10, 2024

YifanShenSZ commented Sep 8, 2024 •

edited

Loading

pytorch-bot bot commented Sep 8, 2024 •

edited

Loading

YifanShenSZ commented Sep 8, 2024 •

edited

Loading

YifanShenSZ commented Sep 8, 2024 •

edited

Loading

YifanShenSZ commented Sep 9, 2024 •

edited

Loading