FireRedASR2 supported axcl backend by Abandon-ht · Pull Request #3272 · k2-fsa/sherpa-onnx

Abandon-ht · 2026-03-09T09:40:14Z

Summary by CodeRabbit

New Features
- Added AXCL-accelerated support for Fire-Red ASR CTC models, enabling faster offline inference on compatible devices.
- Recognizer now accepts Fire-Red ASR CTC models when AXCL is selected, and model selection falls back to existing providers when AXCL is unavailable.

gemini-code-assist · 2026-03-09T09:40:30Z

Summary of Changes

Hello, I'm Gemini Code Assist¹! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed!

This pull request expands the capabilities of the sherpa-onnx project by introducing support for the FireRedASR2 CTC model to leverage the AXCL backend. This enhancement allows for potentially accelerated inference of this specific Automatic Speech Recognition (ASR) model, broadening the range of models that can benefit from AXCL's optimizations within the framework.

Highlights

New AXCL Backend Support: Added a new OfflineFireRedAsrCtcModelAxcl class to provide support for the FireRedASR2 CTC model using the AXCL backend.
Model Integration: Integrated the new AXCL-backed FireRedASR2 CTC model into the OfflineCtcModel factory, allowing it to be created when the 'axcl' provider is specified.
Recognizer Updates: Updated the OfflineRecognizerImpl to correctly identify and utilize the new AXCL-backed FireRedASR2 CTC model for offline recognition tasks.
Build System Configuration: Modified the CMake build configuration to include the source files for the new AXCL-specific FireRedASR2 CTC model.

Changelog

sherpa-onnx/csrc/CMakeLists.txt
- Added the new AXCL-specific FireRed ASR CTC model source file to the build configuration.
sherpa-onnx/csrc/axcl/offline-fire-red-asr-ctc-model-axcl.cc
- Implemented the core logic for the OfflineFireRedAsrCtcModelAxcl class, handling model loading, feature processing, inference, and output retrieval using the AXCL backend.
- Defined feature normalization parameters (mean and inverse standard deviation) specific to the FireRed ASR CTC model.
sherpa-onnx/csrc/axcl/offline-fire-red-asr-ctc-model-axcl.h
- Declared the OfflineFireRedAsrCtcModelAxcl class, inheriting from OfflineCtcModel, to expose its public interface.
sherpa-onnx/csrc/offline-ctc-model.cc
- Included the header for the new AXCL-backed FireRed ASR CTC model.
- Modified the Create methods to conditionally instantiate OfflineFireRedAsrCtcModelAxcl when the 'axcl' provider is selected for FireRed ASR CTC models.
sherpa-onnx/csrc/offline-recognizer-impl.cc
- Updated the Create methods to support the AXCL-backed FireRed ASR CTC model.
- Revised the error message to reflect that both SenseVoice and FireRedASR CTC models are supported by AXCL for non-streaming ASR.

Activity

No specific activity (comments, reviews, progress) has been recorded for this pull request yet.

Using Gemini Code Assist

The full guide for Gemini Code Assist can be found on our documentation page, here are some quick tips.

Invoking Gemini

You can request assistance from Gemini at any point by creating a comment using either /gemini <command> or @gemini-code-assist <command>. Below is a summary of the supported commands on the current page.

Feature	Command	Description
Code Review	`/gemini review`	Performs a code review for the current pull request in its current state.
Pull Request Summary	`/gemini summary`	Provides a summary of the current pull request in its current state.
Comment	@gemini-code-assist	Responds in comments when explicitly tagged, both in pull request comments and review comments.
Help	`/gemini help`	Displays a list of available commands.

Customization

To customize Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a .gemini/ folder in the base of the repository. Detailed instructions can be found here.

Limitations & Feedback

Gemini Code Assist may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counter productive. You can react with 👍 and 👎 on @gemini-code-assist comments. If you're interested in giving your feedback about your experience with Gemini Code Assist for Github and other Google products, sign up here.

You can also get AI-powered code generation, chat, as well as code reviews directly in the IDE at no cost with the Gemini Code Assist IDE Extension.

Review the Privacy Notices, Generative AI Prohibited Use Policy, Terms of Service, and learn how to configure Gemini Code Assist in GitHub here. Gemini can make mistakes, so double check it and use code with caution. ↩

coderabbitai · 2026-03-09T09:40:32Z

No actionable comments were generated in the recent review. 🎉

ℹ️ Recent review info

⚙️ Run configuration

Configuration used: defaults

Review profile: CHILL

Plan: Pro

Run ID: 674b4958-3451-4b01-bf3e-86f2cedad311

📥 Commits

Reviewing files that changed from the base of the PR and between 25e01ae and 67e348b.

📒 Files selected for processing (3)

sherpa-onnx/csrc/axcl/axcl-model.cc
sherpa-onnx/csrc/axcl/axcl-model.h
sherpa-onnx/csrc/axcl/offline-fire-red-asr-ctc-model-axcl.cc

🚧 Files skipped from review as they are similar to previous changes (1)

sherpa-onnx/csrc/axcl/offline-fire-red-asr-ctc-model-axcl.cc

📝 Walkthrough

Walkthrough

Adds an AXCL-backed OfflineFireRedAsrCtcModel implementation, exposes AxclModel raw-output retrieval, and integrates the AXCL model into factories and recognizer creation paths with CMake and conditional compilation updates.

Changes

Cohort / File(s)	Summary
Build Configuration `sherpa-onnx/csrc/CMakeLists.txt`	Appends `./axcl/offline-fire-red-asr-ctc-model-axcl.cc` to AXCL sources for the build.
AXCL Model Interface & Impl `sherpa-onnx/csrc/axcl/offline-fire-red-asr-ctc-model-axcl.h`, `sherpa-onnx/csrc/axcl/offline-fire-red-asr-ctc-model-axcl.cc`	Adds `OfflineFireRedAsrCtcModelAxcl` (pimpl): constructors (config and templated Manager), Forward(), VocabSize/SubsamplingFactor/Allocator, NormalizeFeatures, Init/feature padding, inference orchestration, and output-length parsing.
AXCL Core API `sherpa-onnx/csrc/axcl/axcl-model.h`, `sherpa-onnx/csrc/axcl/axcl-model.cc`	Adds `GetOutputTensorDataRaw(name)` to retrieve raw device output bytes and implements device-to-host copy of an output tensor buffer.
Factory Integration `sherpa-onnx/csrc/offline-ctc-model.cc`	Conditionally instantiates the AXCL-backed OfflineFireRedAsrCtcModel when `provider=="axcl"` and `SHERPA_ONNX_ENABLE_AXCL` is defined; falls back to existing implementation otherwise.
Recognizer Integration `sherpa-onnx/csrc/offline-recognizer-impl.cc`	Adds AXCL branch to create `OfflineRecognizerCtcImpl` for FireRedASR CTC models and updates axcl-related error messaging.

Sequence Diagram(s)

sequenceDiagram
    participant Factory
    participant OfflineModelAxcl
    participant AxclModel
    participant HostMemory
    Factory->>OfflineModelAxcl: Create(config) / Create(mgr, config)
    OfflineModelAxcl->>AxclModel: Load model (file or buffer) and Init()
    OfflineModelAxcl->>OfflineModelAxcl: NormalizeFeatures(features)
    OfflineModelAxcl->>AxclModel: SetInputTensors(padded_features, speech_length)
    OfflineModelAxcl->>AxclModel: Run()
    AxclModel-->>OfflineModelAxcl: GetOutputTensorDataRaw(logits_name)
    AxclModel-->>OfflineModelAxcl: GetOutputTensorDataRaw(lengths_name)
    OfflineModelAxcl->>HostMemory: Parse outputs -> create Ort::Value tensors
    OfflineModelAxcl-->>Factory: Return logits_tensor, lengths_tensor

Estimated code review effort

🎯 4 (Complex) | ⏱️ ~50 minutes

Possibly related PRs

Support AXERA ax630, ax650, and axcl backends. #2849: Adds AXCL backend infrastructure and CMake toggles that this change builds upon.
Support FireRedASR CTC models #3221: Adds FireRedASR CTC core model support and related factory/recognizer integration patterns.
Add C++ and Python API for Omnilingual ASR models. #2772: Modifies OfflineCtcModel factory branches similar to the provider-based dispatch added here.

Suggested reviewers

csukuangfj

Poem

🐰 A rabbit hops into AXCL land,
With padded frames held in a tiny hand.
Means and stddev tidy and bright,
Device bytes swoop back to light —
Speech wakes up and hops to text. 🥕

🚥 Pre-merge checks | ✅ 2 | ❌ 1

❌ Failed checks (1 warning)

Check name	Status	Explanation	Resolution
Docstring Coverage	⚠️ Warning	Docstring coverage is 0.00% which is insufficient. The required threshold is 80.00%.	Write docstrings for the functions missing them to satisfy the coverage threshold.

✅ Passed checks (2 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Title check	✅ Passed	The title accurately describes the main change: adding AXCL backend support for FireRedASR2 models, which is reflected across all modified files.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing Touches

🧪 Generate unit tests (beta)

Create PR with unit tests

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

coderabbitai

Actionable comments posted: 3

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Inline comments:
In `@sherpa-onnx/csrc/axcl/offline-fire-red-asr-ctc-model-axcl.cc`:
- Around line 47-71: The Forward method is incorrectly using features_shape[1]
(padded width) as the valid frame count and silently truncating long utterances;
instead read the authoritative valid-frame count from the features_length tensor
(use p_features_length[0]) and use that as valid_frames, then check against the
model window (expected_frames) and reject or handle oversized utterances rather
than silently clipping. Update Forward (function Forward, variables
p_features_length, expected_frames, padded_features) to set valid_frames =
static_cast<int32_t>(p_features_length[0]), validate that valid_frames <=
expected_frames and if not log an error and exit (or implement explicit chunking
logic) before copying/padding; otherwise proceed to copy valid_frames worth of
data into padded_features.
- Around line 73-99: After calling model_->Run() and retrieving outputs via
model_->GetOutputTensorData(model_->OutputTensorNames()[0]) and [1], validate
that out_logits and out_lengths are non-empty and that
TensorShape(model_->OutputTensorNames()[0]) yields expected dimensions before
creating Ort::Value tensors and using std::copy or indexing out_lengths[0]; if
any check fails, return or throw an error (or log and abort) so you don't
dereference empty vectors or copy zero elements into p_logits or read
p_lengths[0]. Ensure checks reference the existing symbols: SetInputTensorData,
Run, GetOutputTensorData, out_logits, out_lengths,
TensorShape(model_->OutputTensorNames()[0]), std::copy, and p_lengths[0].

In `@sherpa-onnx/csrc/axcl/offline-fire-red-asr-ctc-model-axcl.h`:
- Around line 19-38: The class OfflineFireRedAsrCtcModelAxcl must explicitly
disable batch processing to avoid multi-stream calls hitting Forward() with
batch_size != 1; add an override of SupportBatchProcessing() in the
OfflineFireRedAsrCtcModelAxcl class that returns false (i.e., bool
SupportBatchProcessing() const override { return false; }) so the runtime will
not route batched inputs to this backend; update the class declaration to
include this method override alongside the existing methods (e.g., Forward,
VocabSize, SubsamplingFactor, Allocator, NormalizeFeatures).

ℹ️ Review info

⚙️ Run configuration

Configuration used: defaults

Review profile: CHILL

Plan: Pro

Run ID: f27920d6-2061-4d1d-973c-9b497eb5dc4a

📥 Commits

Reviewing files that changed from the base of the PR and between 9b2fc65 and 25e01ae.

📒 Files selected for processing (5)

sherpa-onnx/csrc/CMakeLists.txt
sherpa-onnx/csrc/axcl/offline-fire-red-asr-ctc-model-axcl.cc
sherpa-onnx/csrc/axcl/offline-fire-red-asr-ctc-model-axcl.h
sherpa-onnx/csrc/offline-ctc-model.cc
sherpa-onnx/csrc/offline-recognizer-impl.cc

coderabbitai · 2026-03-09T09:51:18Z

+  std::vector<Ort::Value> Forward(Ort::Value features,
+                                  Ort::Value features_length) {
+    auto features_shape = features.GetTensorTypeAndShapeInfo().GetShape();
+    int32_t batch_size = features_shape[0];
+    int32_t num_frames = features_shape[1];
+    int32_t feat_dim = features_shape[2];
+
+    const float *p_features = features.GetTensorData<float>();
+    const int64_t *p_features_length = features_length.GetTensorData<int64_t>();
+
+    if (batch_size != 1) {
+      SHERPA_ONNX_LOGE("Only batch size 1 is supported by axcl. Given: %d",
+                       batch_size);
+      SHERPA_ONNX_EXIT(-1);
+    }
+
+    auto expected_shape = model_->TensorShape(model_->InputTensorNames()[0]);
+    int32_t expected_frames = expected_shape[1];
+
+    int32_t valid_frames = std::min<int32_t>(num_frames, expected_frames);
+    std::vector<float> padded_features(expected_frames * feat_dim, 0.0f);
+    std::copy(p_features, p_features + valid_frames * feat_dim,
+              padded_features.begin());
+
+    std::vector<int32_t> speech_length = {valid_frames};


⚠️ Potential issue | 🟠 Major

Use features_length here and don't silently clip long utterances.

features_shape[1] is the padded tensor width, not the authoritative valid-frame count. With std::min(num_frames, expected_frames), any utterance longer than the AXCL window is truncated and the tail audio is lost instead of being rejected or chunked.

Suggested fix

- int32_t num_frames = features_shape[1]; int32_t feat_dim = features_shape[2]; @@ - int32_t valid_frames = std::min<int32_t>(num_frames, expected_frames); + int32_t valid_frames = static_cast<int32_t>(p_features_length[0]); + if (valid_frames > expected_frames) { + SHERPA_ONNX_LOGE( + "Input has %d valid frames, but the AXCL FireRedASR CTC model only " + "accepts %d frames.", + valid_frames, expected_frames); + SHERPA_ONNX_EXIT(-1); + } std::vector<float> padded_features(expected_frames * feat_dim, 0.0f); std::copy(p_features, p_features + valid_frames * feat_dim, padded_features.begin());

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@sherpa-onnx/csrc/axcl/offline-fire-red-asr-ctc-model-axcl.cc` around lines 47 - 71, The Forward method is incorrectly using features_shape[1] (padded width) as the valid frame count and silently truncating long utterances; instead read the authoritative valid-frame count from the features_length tensor (use p_features_length[0]) and use that as valid_frames, then check against the model window (expected_frames) and reject or handle oversized utterances rather than silently clipping. Update Forward (function Forward, variables p_features_length, expected_frames, padded_features) to set valid_frames = static_cast<int32_t>(p_features_length[0]), validate that valid_frames <= expected_frames and if not log an error and exit (or implement explicit chunking logic) before copying/padding; otherwise proceed to copy valid_frames worth of data into padded_features.

coderabbitai · 2026-03-09T09:51:19Z

+class OfflineFireRedAsrCtcModelAxcl : public OfflineCtcModel {
+ public:
+  explicit OfflineFireRedAsrCtcModelAxcl(const OfflineModelConfig &config);
+
+  template <typename Manager>
+  OfflineFireRedAsrCtcModelAxcl(Manager *mgr, const OfflineModelConfig &config);
+
+  ~OfflineFireRedAsrCtcModelAxcl() override;
+
+  std::vector<Ort::Value> Forward(Ort::Value features,
+                                  Ort::Value features_length) override;
+
+  int32_t VocabSize() const override;
+
+  int32_t SubsamplingFactor() const override;
+
+  OrtAllocator *Allocator() const override;
+
+  void NormalizeFeatures(float *features, int32_t num_frames,
+                         int32_t feat_dim) const override;


⚠️ Potential issue | 🟠 Major

Declare this AXCL model as non-batchable.

OfflineCtcModel defaults SupportBatchProcessing() to true, but this backend hard-fails in Forward() when batch_size != 1. That mismatch can route multi-stream decode into a path that is guaranteed to abort.

Suggested fix

class OfflineFireRedAsrCtcModelAxcl : public OfflineCtcModel { public: explicit OfflineFireRedAsrCtcModelAxcl(const OfflineModelConfig &config); @@ int32_t VocabSize() const override; int32_t SubsamplingFactor() const override; + + bool SupportBatchProcessing() const override { return false; } OrtAllocator *Allocator() const override;

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

class OfflineFireRedAsrCtcModelAxcl : public OfflineCtcModel {

public:

explicit OfflineFireRedAsrCtcModelAxcl(const OfflineModelConfig &config);

template <typename Manager>

OfflineFireRedAsrCtcModelAxcl(Manager *mgr, const OfflineModelConfig &config);

~OfflineFireRedAsrCtcModelAxcl() override;

std::vector<Ort::Value> Forward(Ort::Value features,

Ort::Value features_length) override;

int32_t VocabSize() const override;

int32_t SubsamplingFactor() const override;

OrtAllocator *Allocator() const override;

void NormalizeFeatures(float *features, int32_t num_frames,

int32_t feat_dim) const override;

class OfflineFireRedAsrCtcModelAxcl : public OfflineCtcModel {

public:

explicit OfflineFireRedAsrCtcModelAxcl(const OfflineModelConfig &config);

template <typename Manager>

OfflineFireRedAsrCtcModelAxcl(Manager *mgr, const OfflineModelConfig &config);

~OfflineFireRedAsrCtcModelAxcl() override;

std::vector<Ort::Value> Forward(Ort::Value features,

Ort::Value features_length) override;

int32_t VocabSize() const override;

int32_t SubsamplingFactor() const override;

bool SupportBatchProcessing() const override { return false; }

OrtAllocator *Allocator() const override;

void NormalizeFeatures(float *features, int32_t num_frames,

int32_t feat_dim) const override;

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@sherpa-onnx/csrc/axcl/offline-fire-red-asr-ctc-model-axcl.h` around lines 19 - 38, The class OfflineFireRedAsrCtcModelAxcl must explicitly disable batch processing to avoid multi-stream calls hitting Forward() with batch_size != 1; add an override of SupportBatchProcessing() in the OfflineFireRedAsrCtcModelAxcl class that returns false (i.e., bool SupportBatchProcessing() const override { return false; }) so the runtime will not route batched inputs to this backend; update the class declaration to include this method override alongside the existing methods (e.g., Forward, VocabSize, SubsamplingFactor, Allocator, NormalizeFeatures).

gemini-code-assist

Code Review

This pull request adds support for the FireRedASR2 CTC model with the AXCL backend. The changes include a new implementation for the model (offline-fire-red-asr-ctc-model-axcl.cc) and integrating it into the build system and model factory functions.

My review focuses on the new implementation. I've identified a few areas for improvement regarding performance, code clarity, and maintainability, such as optimizing the initialization of constant data and removing unused code. Overall, the changes look good and are consistent with the goal of the pull request.

_{Note: Security Review did not run due to the size of the PR.}

gemini-code-assist · 2026-03-09T09:55:30Z

+    int32_t feat_dim = features_shape[2];
+
+    const float *p_features = features.GetTensorData<float>();
+    const int64_t *p_features_length = features_length.GetTensorData<int64_t>();


The variable p_features_length is initialized but never used. It should be removed to improve code clarity.

gemini-code-assist · 2026-03-09T09:55:30Z

+    ans.push_back(std::move(logits));
+    ans.push_back(std::move(lengths));


For slightly better performance, consider using emplace_back instead of push_back to construct the Ort::Value in-place within the vector, avoiding an extra move operation.

ans.emplace_back(std::move(logits)); ans.emplace_back(std::move(lengths));

gemini-code-assist · 2026-03-09T09:55:30Z

+  void NormalizeFeatures(float *features, int32_t num_frames,
+                         int32_t feat_dim) const {
+    if (static_cast<int32_t>(mean_.size()) != feat_dim) {
+      SHERPA_ONNX_LOGE("Bad things happened");


The log message "Bad things happened" is not very descriptive. The following log message already provides specific details about the feature dimension mismatch. This line can be removed for cleaner and more precise logging.

gemini-code-assist · 2026-03-09T09:55:30Z

+    mean_ = {10.498912811279297, 10.948603630065918, 11.889163970947266,
+             12.634881973266602, 13.397452354431152, 14.010934829711914,
+             14.450813293457031, 14.649748802185059, 14.791581153869629,
+             14.72234058380127,  14.802156448364258, 14.86101245880127,
+             15.077230453491211, 15.26024341583252,  15.328754425048828,
+             15.397353172302246, 15.395853996276855, 15.34103775024414,
+             15.4662446975708,   15.271865844726562, 15.108253479003906,
+             15.295886993408203, 15.07359504699707,  15.177886009216309,
+             15.0756254196167,   15.154109001159668, 15.051127433776855,
+             15.130733489990234, 15.090286254882812, 15.099433898925781,
+             15.128166198730469, 15.123964309692383, 15.144022941589355,
+             15.198014259338379, 15.251392364501953, 15.329950332641602,
+             15.4017972946167,   15.45089340209961,  15.500616073608398,
+             15.435726165771484, 15.51086139678955,  15.44755744934082,
+             15.510979652404785, 15.491739273071289, 15.538031578063965,
+             15.608367919921875, 15.694382667541504, 15.762181282043457,
+             15.821470260620117, 15.901959419250488, 15.907241821289062,
+             15.925711631774902, 15.952259063720703, 16.000732421875,
+             16.030330657958984, 16.060592651367188, 16.09003448486328,
+             16.100107192993164, 16.091808319091797, 16.062585830688477,
+             16.05771255493164,  15.997002601623535, 15.946383476257324,
+             15.865278244018555, 15.778145790100098, 15.67629623413086,
+             15.569791793823242, 15.515979766845703, 15.472077369689941,
+             15.423379898071289, 15.382068634033203, 15.345854759216309,
+             15.301891326904297, 15.26984691619873,  15.165450096130371,
+             15.004508972167969, 14.87544059753418,  14.564188003540039,
+             14.031693458557129, 13.159259796142578};
+    inv_stddev_ = {
+        0.2522108852863312,  0.23741021752357483, 0.23185651004314423,
+        0.23331022262573242, 0.23203925788402557, 0.22906658053398132,
+        0.22519451379776,    0.22010253369808197, 0.21958276629447937,
+        0.22198699414730072, 0.22393390536308289, 0.22370608150959015,
+        0.22321352362632751, 0.2220749408006668,  0.22118520736694336,
+        0.22136786580085754, 0.2220366895198822,  0.222808837890625,
+        0.22362081706523895, 0.224283829331398,   0.22464141249656677,
+        0.22580783069133759, 0.22700978815555573, 0.22852766513824463,
+        0.22993983328342438, 0.23110738396644592, 0.23227347433567047,
+        0.23270530998706818, 0.23330524563789368, 0.23406001925468445,
+        0.23448589444160461, 0.23556077480316162, 0.23632891476154327,
+        0.23703691363334656, 0.2377307415008545,  0.23786373436450958,
+        0.2380155622959137,  0.23858875036239624, 0.23943373560905457,
+        0.2399062216281891,  0.24094033241271973, 0.24173252284526825,
+        0.24236661195755005, 0.2430112659931183,  0.24341483414173126,
+        0.243240088224411,   0.24262498319149017, 0.24218837916851044,
+        0.24165891110897064, 0.241318941116333,   0.2413933277130127,
+        0.24139994382858276, 0.241432324051857,   0.24122384190559387,
+        0.24079066514968872, 0.24032147228717804, 0.24016834795475006,
+        0.24034327268600464, 0.24069449305534363, 0.24123424291610718,
+        0.24136029183864594, 0.24150611460208893, 0.24179506301879883,
+        0.24160170555114746, 0.24221885204315186, 0.24253536760807037,
+        0.24262426793575287, 0.2428186535835266,  0.24223484098911285,
+        0.24199971556663513, 0.24160003662109375, 0.24074721336364746,
+        0.23965489864349365, 0.23850350081920624, 0.2359732687473297,
+        0.23006057739257812, 0.22904986143112183, 0.22814501821994781,
+        0.22893856465816498, 0.23093441128730774};


These large hardcoded vectors for mean_ and inv_stddev_ are inefficiently initialized on every object creation and make the code hard to read.

Consider defining them as static const std::array at file scope and assigning them to the member vectors in Init(). This will improve both performance and maintainability.

For example:

// At file scope namespace { static const std::array<float, 80> kFireRedAsrCtcMean = {{ // ... values }}; } // namespace // In Init() mean_.assign(kFireRedAsrCtcMean.begin(), kFireRedAsrCtcMean.end());

A similar change should be applied to inv_stddev_.

…CTC model to use it

FireRedASR2 supported axcl backend

25e01ae

dosubot Bot added the size:L This PR changes 100-499 lines, ignoring generated files. label Mar 9, 2026

coderabbitai Bot reviewed Mar 9, 2026

View reviewed changes

gemini-code-assist Bot reviewed Mar 9, 2026

View reviewed changes

Add GetOutputTensorDataRaw method to AxclModel and update FireRedASR …

67e348b

…CTC model to use it

		ans.push_back(std::move(logits));
		ans.push_back(std::move(lengths));

Conversation

Abandon-ht commented Mar 9, 2026 • edited by coderabbitai Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary by CodeRabbit

Uh oh!

gemini-code-assist Bot commented Mar 9, 2026

Summary of Changes

Highlights

Footnotes

Uh oh!

coderabbitai Bot commented Mar 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Changes

Sequence Diagram(s)

Estimated code review effort

Possibly related PRs

Suggested reviewers

Poem

❌ Failed checks (1 warning)

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot Mar 9, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

coderabbitai Bot Mar 9, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist Bot Mar 9, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot Mar 9, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot Mar 9, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot Mar 9, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Abandon-ht commented Mar 9, 2026 •

edited by coderabbitai Bot

Loading

coderabbitai Bot commented Mar 9, 2026 •

edited

Loading