Support benchmark using prebuilt artifacts #8246

guangy10 · 2025-02-06T01:32:21Z

🚀 The feature, motivation and pitch

Enable this for on-demand workflow only, to offer developers additional flexibility and efficiency.

Scenarios where benchmarking w/ prebuilt artifacts are needed:

Sometimes the pte model may come from outside, for example the model may come from external partners. Or the model is downloaded from the executorch community from Hugging Face, https://huggingface.co/executorch-community/DeepSeek-R1-Distill-Llama-8B/tree/main.
Developers who work on the runtime may not necessarily to re-export the same model all the time.
Developers who work on exporting may not need to build the banchmark app all the time

UX:

via GitHub UI
via script

Source of the artifacts to be used in the benchmark workflow:

pte models from Hugging Face, e.g. https://huggingface.co/executorch-community/Llama-3.2-1B-Instruct-ET/tree/main
From S3 (uploaded by developers/users)

We will need to define the UX to support this feature. For example, allow users to upload prebuilt artifacts via script. The script will return with links to the artifacts. Then users can schedule an on-demand workflow via UI, or users can do everything via the script.

Policy and TTL to keep the uploaded artifacts.

CC: @digantdesai @kimishpatel @cccclai

Alternatives

No response

Additional context

No response

RFC (Optional)

No response

cc @huydhn @kirklandsign @shoumikhin @mergennachin @byjlw

guangy10 · 2025-02-06T01:51:40Z

IMO this infra work can potentially benefit both #8249 and #8250

huydhn · 2025-02-11T22:10:36Z

AI: Circle back with @kirklandsign and @shoumikhin on what pre-built artifacts that need to be there on the device besides the app and the export model

shoumikhin · 2025-02-12T01:56:30Z

@huydhn if you're asking about the benchmarking app, we need to put all .xcframework under Frameworks, and all .pte under Resources, then build and run the app.

guangy10 · 2025-02-12T03:43:16Z

@shoumikhin @kirklandsign I believe @huydhn is referring to the new artifacts generated by the profilers, e.g. ETdump, chrome trace, simpleperf, etc. specifically, generated from these work:

For example, for users be able to accessible the artifacts and visualize in framegraphs, we will need to store the artifacts somewhere, e.g. S3. Same for Etdump.

guangy10 · 2025-02-12T03:46:55Z

@shoumikhin @kirklandsign fyi, Huy will be on leave from Feb 25, if you guys need help to store the artifacts in DB, or access the artifacts from the dashboard UI or showing the URL in the job log, Huy can help with it or give you a pointer before he in on leave.

guangy10 · 2025-02-12T21:58:49Z

Follow up on the discussion around supporting artifacts generated from profilers. I think if we can provide a generic script allow uploading arbitrary blobs (typically zipped) to S3, it should provide the flexibility to store any artifact from profiling. wdyt @shoumikhin @kirklandsign @huydhn?

@huydhn is there a size limit for uploading to S3? If yes, what it is?

Put together with the work in #8245, the UX would like what I described here: #8402 (comment)

guangy10 · 2025-02-18T23:21:40Z

Uploading from linux/macos runner should be ok per Huy.
Experiment to trigger an upload from device farm to see if there is a size limits

guangy10 added feature module: benchmark Issues related to the benchmark infrastructure module: user experience Issues related to reducing friction for users labels Feb 6, 2025

guangy10 assigned huydhn Feb 6, 2025

github-project-automation bot added this to ExecuTorch DevX Feb 6, 2025

github-project-automation bot moved this to To triage in ExecuTorch DevX Feb 6, 2025

digantdesai added the triaged This issue has been looked at a team member, and triaged and prioritized into an appropriate module label Feb 6, 2025

mergennachin moved this from To triage to Ready in ExecuTorch DevX Feb 6, 2025

guangy10 added this to ExecuTorch Benchmark Feb 6, 2025

huydhn added this to PyTorch OSS Dev Infra Feb 7, 2025

github-actions bot mentioned this issue Feb 11, 2025

Weekly issue metrics report - 2025-02-01..2025-02-07 wdvr/pytorch#5

Open

guangy10 moved this to Ready in ExecuTorch Benchmark Feb 12, 2025

github-actions bot mentioned this issue Feb 17, 2025

Weekly issue metrics report - 2025-02-01..2025-02-07 wdvr/pytorch#7

Open

huydhn moved this to Cold Storage in PyTorch OSS Dev Infra Feb 17, 2025

jackzhxng removed the feature label Feb 21, 2025

github-actions bot mentioned this issue Feb 24, 2025

Weekly issue metrics report - 2025-02-01..2025-02-07 wdvr/pytorch#9

Open

huydhn moved this from Ready to In Progress in ExecuTorch Benchmark Feb 27, 2025

github-actions bot mentioned this issue Mar 3, 2025

Weekly issue metrics report - 2025-02-01..2025-02-07 wdvr/pytorch#13

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support benchmark using prebuilt artifacts #8246

Support benchmark using prebuilt artifacts #8246

guangy10 commented Feb 6, 2025 •

edited

Loading

guangy10 commented Feb 6, 2025

huydhn commented Feb 11, 2025

shoumikhin commented Feb 12, 2025

guangy10 commented Feb 12, 2025

guangy10 commented Feb 12, 2025

guangy10 commented Feb 12, 2025 •

edited

Loading

guangy10 commented Feb 18, 2025

Support benchmark using prebuilt artifacts #8246

Support benchmark using prebuilt artifacts #8246

Comments

guangy10 commented Feb 6, 2025 • edited Loading

🚀 The feature, motivation and pitch

Alternatives

Additional context

RFC (Optional)

guangy10 commented Feb 6, 2025

huydhn commented Feb 11, 2025

shoumikhin commented Feb 12, 2025

guangy10 commented Feb 12, 2025

guangy10 commented Feb 12, 2025

guangy10 commented Feb 12, 2025 • edited Loading

guangy10 commented Feb 18, 2025

guangy10 commented Feb 6, 2025 •

edited

Loading

guangy10 commented Feb 12, 2025 •

edited

Loading