Accelerate the mean operations without axis #589

ThreeMonth03 · 2025-09-26T18:37:28Z

In this pull request, I accelerate the mean operations.
For 1D contiguous array(unrolling, wo simd):

For 3D non contiguous array(unrolling + multithread):

ThreeMonth03

@yungyuc , could you please review this pull request when you are available?

ThreeMonth03 · 2025-09-26T18:39:21Z

cpp/modmesh/buffer/SimpleArray.hpp

+template <typename A, typename T>
+class SimpleArrayMixinSum
+{
+


Move sum operation to a seperate class because of complex optimization.

ThreeMonth03 · 2025-09-26T18:42:54Z

cpp/modmesh/buffer/SimpleArray.hpp

+    value_type sum_contiguous() const
+    {
+        auto athis = static_cast<A const *>(this);
+        value_type result;
+        if constexpr (is_complex_v<value_type>)
+        {
+            result = value_type{};
+        }
+        else
+        {
+            result = 0;
+        }
+        sum_unrolled_generic(athis->data(), athis->size(), 1, result);
+        return result;
+    }


I forget to implement simd for common data type. Would it become a seperate pull request?

ThreeMonth03 · 2025-09-26T18:44:45Z

cpp/modmesh/buffer/SimpleArray.hpp

+        return total;
+    }
+
+    void sum_unrolled_generic(const value_type * data_ptr, size_t size, size_t stride, value_type & result) const


I'm not sure whether it is really unroll the loop.

It's hard to tell. If you are not sure about it, why adding it?

yungyuc

Good progress. Points to address:

Do not use threads at the time being. We need a system for controlling threading from outside the computing kernel and it is outside the scope of speeding up one operation.
Make functions static when you can.
Clarify why adding seemingly unrolled loop that you are not sure about.

yungyuc · 2025-09-28T14:28:11Z

cpp/modmesh/buffer/SimpleArray.hpp

 private:
-    void check_c_contiguous(small_vector<size_t> const & shape,
-                            small_vector<size_t> const & stride) const
+    bool is_c_contiguous(small_vector<size_t> const & shape,


This can be static.

yungyuc · 2025-09-28T14:29:46Z

cpp/modmesh/buffer/SimpleArray.hpp

+        return true;
+    }
+
+    void check_c_contiguous(small_vector<size_t> const & shape,


This can be static.

yungyuc · 2025-09-28T14:47:24Z

cpp/modmesh/buffer/SimpleArray.hpp

+        const size_t prefix_len = ndim - 1;
+        const size_t total_combinations = calculate_total_combinations(shape, prefix_len);
+
+        const size_t num_threads = static_cast<size_t>(std::thread::hardware_concurrency());


We are not ready for using threads. Without a system to control how to use threads from outside the computing kernel here, the performance and resource consumption are not predictable.

yungyuc · 2025-09-28T14:49:10Z

cpp/modmesh/buffer/SimpleArray.hpp

+        return total;
+    }
+
+    void sum_unrolled_generic(const value_type * data_ptr, size_t size, size_t stride, value_type & result) const


It's hard to tell. If you are not sure about it, why adding it?

Accelerate the mean operations.

03e791e

ThreeMonth03 changed the title ~~Accelerate the mean operations.~~ Accelerate the mean operations without axis. Sep 26, 2025

ThreeMonth03 commented Sep 26, 2025

View reviewed changes

yungyuc requested changes Sep 28, 2025

View reviewed changes

yungyuc assigned ThreeMonth03 Sep 28, 2025

yungyuc added performance Profiling, runtime, and memory consumption array Multi-dimensional array implementation labels Sep 28, 2025

yungyuc changed the title ~~Accelerate the mean operations without axis.~~ Accelerate the mean operations without axis Sep 28, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Accelerate the mean operations without axis #589

Accelerate the mean operations without axis #589

Uh oh!

ThreeMonth03 commented Sep 26, 2025

Uh oh!

ThreeMonth03 left a comment

Uh oh!

ThreeMonth03 Sep 26, 2025

Uh oh!

ThreeMonth03 Sep 26, 2025

Uh oh!

ThreeMonth03 Sep 26, 2025

Uh oh!

yungyuc Sep 28, 2025

Uh oh!

yungyuc left a comment

Uh oh!

yungyuc Sep 28, 2025

Uh oh!

yungyuc Sep 28, 2025

Uh oh!

yungyuc Sep 28, 2025

Uh oh!

yungyuc Sep 28, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Accelerate the mean operations without axis #589

Are you sure you want to change the base?

Accelerate the mean operations without axis #589

Uh oh!

Conversation

ThreeMonth03 commented Sep 26, 2025

Uh oh!

ThreeMonth03 left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

yungyuc left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants