Skip to content

Conversation

@robell
Copy link
Collaborator

@robell robell commented Sep 27, 2023

Added shell of runtime Arm Backend for Ethos-U

./backends/arm/cmake/build.sh
will pull the appropriate compiler and build the ethos-u driver and delegate

This is yet to be linked into the executorch runtime

This also needs to be linked into a higher level app with the .pte in SRAM.

This is for early feedback - assuming some fairly heavy changes from here.

* basic build system amendments
* toolchain file for baremetal platforms
* use of ethos-u-core-driver submodule
* some scripts to pull a compiler and test-build the backend

Signed-off-by: Rob Elliott <[email protected]>
…M, this will be reworked as wrapped ethos buffers in .pte become available

Signed-off-by: Rob Elliott <[email protected]>
* Added cmmss submodule dependency
* Added command_stream.cpp into the build
* Added the target to the build script

Signed-off-by: Rob Elliott <[email protected]>
@facebook-github-bot facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Sep 27, 2023
@netlify
Copy link

netlify bot commented Sep 27, 2023

Deploy Preview for resplendent-gnome-14e531 ready!

Name Link
🔨 Latest commit 7007120
🔍 Latest deploy log https://app.netlify.com/sites/resplendent-gnome-14e531/deploys/651ad0bf30e7700008821238
😎 Deploy Preview https://deploy-preview-501--resplendent-gnome-14e531.netlify.app
📱 Preview on mobile
Toggle QR Code...

QR Code

Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify site configuration.

robell and others added 14 commits September 28, 2023 12:19
 * vela binaries returned from preprocess
 * Included in PTE captured from arm_tosa_e2e
 * currently assumes vela is on path

Signed-off-by: Rob Elliott <[email protected]>
 * added a scratch block in the vela_bin to preallocate it
 * added a vela_bin reading routine into ArmBackendEthosU
 * set pointers passed to vela based on vela_bin

Signed-off-by: Rob Elliott <[email protected]>
Also helps with c++11 compliance if we have less c++20 code esp when we
don't use it.
 * fix to encode and decode of vela_bin_stream block sizes
 * hardcoded input/output population to check operation behaviour
 * use manual.h to init and register backend

Signed-off-by: Rob Elliott <[email protected]>
Copy link
Contributor

@digantdesai digantdesai left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Feel free to ignore comments for now ;)

endif()

# Build Arm Baremetal backend
option(EXECUTORCH_BUILD_ARM_BAREMETAL
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Nice. If we are planning to have multiple delegate flavors - i.e. runtime logic - under arm name then we can do something like,

EXECUTORCH_BUILD_ARM && EXECUTORCH_BUILD_ARM_<RUNTIME_FLAVOR>

Suggested change
option(EXECUTORCH_BUILD_ARM_BAREMETAL
option(EXECUTORCH_BUILD_ARM

ET_SWITCH_REAL_TYPES_AND(Bool, b_type, ctx, "add", CTYPE_B, [&]() {
ET_SWITCH_REAL_TYPES_AND(Bool, common_type, ctx, "add", CTYPE_IN, [&]() {
ET_SWITCH_REAL_TYPES_AND(Bool, out_type, ctx, "add", CTYPE_OUT, [&]() {
// ET_SWITCH_REAL_TYPES_AND(Bool, a_type, ctx, "add", CTYPE_A, [&]() {
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Another alternative is that we can switch the CPU-only example to softmax, while keeping the delegate example to add so we don't have to deal with this.

Because there is no good short-term solution for this besides we write a custom add op in ET which overwrites this Portable::Add

#include <cstring>
#include <numeric>
#include <type_traits>
#include <cinttypes>
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

will be merged soon #558

- arg_meta: null
kernel_name: torch::executor::acosh_out

- op: add.out
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This is temp hack we can do better i.e. supply a new file with one op as oppose to delete rest of the ops in the default one - I was just lazy here :p

namespace torch {
namespace executor {
void manual_override();
void digant_add_out(torch::executor::KernelRuntimeContext & context, EValue** stack);
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

this was also for testing the manual.h hack - we don't need this


/**
* Case 2: A plain function pointer.
* Case 1: A plain function pointer.
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Nicer fix is almost ready - #554, #555

if (!initialized) { \
fprintf( \
ET_LOG_OUTPUT_FILE, \
printf( \
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

So this is because stderr is patched to garbage value at link time when I link libexecutorch.a - easiest solution is to copy this into core_platforms "app" and overwrite this weak symbol.

Long term plan can be to have platform/target/ArmBaremetal.cpp PAL layer.


tosaname = "out.tosa"
flatbuffer = tosa_fb.serialize()
f = open(os.path.join(tmpdir,tosaname), "wb")
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

nit: use with for this resource

np_path = os.path.join(tmpdir,"output","out_sg0_vela.npz")
blocks = b''
with np.load(np_path, allow_pickle=False) as data:
# Emit the NPZ regions as:
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

OK for now but this should be refactored into its own function to emit_block


# Compilation warnings
add_compile_options(
# -Wall
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

we (ET runtime) want to enable these eventually (or at least be cool when someone enables them) but right now we are not ready I guess

robell added 3 commits October 2, 2023 09:39
 * Removed dependencies on aything but driver
 * moved to minimal invocation pattern

Signed-off-by: Rob Elliott <[email protected]>
 * currently assumes function signature
 * read relevant argument data from vela_bin

Signed-off-by: Rob Elliott <[email protected]>
@robell robell closed this Oct 6, 2023
@robell
Copy link
Collaborator Author

robell commented Oct 6, 2023

replaced by #595

@robell robell added the partner: arm For backend delegation, kernels, demo, etc. from the 3rd-party partner, Arm label Jul 4, 2024
Gasoonjia pushed a commit that referenced this pull request Jul 30, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. partner: arm For backend delegation, kernels, demo, etc. from the 3rd-party partner, Arm

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants