Feature/can rx perf #11

miDeb · 2025-07-09T21:19:54Z

wip, will get the opportunity to continue some work on this during the weekend

src/can/CANDriverKvaser.cpp

raffael0 · 2025-07-21T15:00:44Z

@miDeb are you still planning on working on this? Otherwise i'd take over

miDeb · 2025-07-21T15:05:22Z

@raffael0 sorry for being inactive the last2 weeks; feel free to take this over if you have the capacity

raffael0 · 2025-07-21T15:35:04Z

No worries. I'll see what I can do

use std::string_view and avoid unnecessary copies instead of using 2 fixed-size buffers, use dynamically allocated strings that are then reused

…es the execution time of the callback thread which will hopefully alleviate the weird sampling frequency drops. Additionally the memory leak of in Node.cpp was fixed

Copilot

Pull Request Overview

This PR introduces performance improvements for CAN reception, refactors the InfluxDB writer to use modern C++ features, and implements a threaded message processing system for Kvaser CAN drivers.

Key Changes:

Refactors InfluxDB writer to use std::string buffers and std::format instead of C-style arrays and sprintf
Introduces a dedicated thread-based message queue for Kvaser CAN reception using the readerwriterqueue library
Modernizes CAN driver callback signatures with type aliases and lambda expressions

Reviewed Changes

Copilot reviewed 18 out of 18 changed files in this pull request and generated 5 comments.

Show a summary per file

File	Description
src/logging/InfluxDbWriter.cpp	Refactored to use std::string buffers and std::format for data formatting
src/can/CanKvaserReceiveThread.cpp	New threaded message processing implementation for Kvaser CAN
src/can/CANManager.cpp	Updated to use lambda expressions instead of std::bind
include/can/CANDriver.h	Added type alias for CAN receive callback signature
CMakeLists.txt	Added readerwriterqueue dependency

_{Tip: Customize your code reviews with copilot-instructions.md. Create the file or learn how to get started.}

src/can/CanKvaserReceiveThread.cpp

src/logging/InfluxDbWriter.cpp

src/can/CanKvaserReceiveThread.cpp

src/logging/InfluxDbWriter.cpp

- added a mutex to the enqueue since sometimes the driver runs two callbacks at the same time

This reverts commit e95b536.

This reverts commit b7bfe5e.

raffael0 · 2025-08-31T13:00:22Z

@miDeb Can you please review the code? I had to remove your influx performance improvements as it threw very weird runtime exceptions (Resource busy). I think it may have to do with the multithreaded addDatapoint calls.

src/can/CanKvaserReceiveThread.cpp

miDeb · 2025-09-01T10:05:31Z

Is there a measurable perf improvement? And can we tell if it's good enough now?

Otherwise changes look good to me 👍

raffael0 · 2025-09-01T12:20:05Z

No there is no measurable performance impact(yet). I am still evaluating it on the hardware.
Here is the sampling frequency on a test I did yesterday. No sequences or commands were sent during this time. As you can see there wasn't a huge dip like we've seen in previous coldflows/hotfires(The one at the start is a restart of the server). We are conducting a reference test today to trigger the frequency dip and verify that it improved

What I already noticed is that the driver doesn't use a single thread in the callbacks. i.e. multiple threads may call the callback at the same time. In the previous solution this was just ignored but now I added a mutex to prevent concurrent access to the queue. My current theory is that there is one thread per channel but I haven't looked into it in detail so far.

This multithreaded callbacks concern me a bit since this might make the readouts non-deterministic and mess up the ordering of the datapoints. But since the software design is better than the previous solution I do not think it got any worse and I guess that's fine for now.

Edit: Also I forgot to push the revert commits of your influx stuff. I pushed it now
Edit: Here is the datarate from the test today. I think it's pretty safe to say that the patch fixes the issue.

raffael0 · 2025-09-01T17:26:50Z

The latest commit changes the dequeue to a blocking version with a 1ms timeout. I looked at a profile and it looks a lot better

This commit decouples the kvaser message receive callback from the message handling in the llserver. This is done by adding a new thread and a queue. Messagesa are added to the queue and removed/processed as soon as possible by a new message handling thread. Previously the long message parsing lead to sampling rate dropoffs. The new version appears to have fixed the bug Additionally this commit fixes a memory leak in Node.cpp

raffael0 · 2025-09-05T08:28:09Z

I merged manually

Fixes a bug, where the whole llserver would freeze up for ~10 seconds. Ultimately the influxdb is responsible for this, since it sometimes takes >200ms to respond to an insert, instead of the normal 2-20. This PR fixes it by **completely** decoupling the sending thread from the rest of the llserver. I believe that this is also the root cause behind #11.

miDeb commented Jul 9, 2025

View reviewed changes

src/can/CANDriverKvaser.cpp Outdated Show resolved Hide resolved

miDeb and others added 4 commits August 28, 2025 19:05

modernize InfluxDbWriter.cpp

b7bfe5e

use std::string_view and avoid unnecessary copies instead of using 2 fixed-size buffers, use dynamically allocated strings that are then reused

move handling of can rx messages to new thread

69c352a

create new sockets when launching influx sender threads

e95b536

Messages are now cached in a queue before being processed. This reduc…

0d5ba54

…es the execution time of the callback thread which will hopefully alleviate the weird sampling frequency drops. Additionally the memory leak of in Node.cpp was fixed

raffael0 force-pushed the feature/can-rx-perf branch from 5cc386e to 0d5ba54 Compare August 28, 2025 17:05

raffael0 requested a review from Copilot August 28, 2025 21:52

Copilot AI reviewed Aug 28, 2025

View reviewed changes

raffael0 and others added 4 commits August 30, 2025 20:11

- Fixed suggestions

681de9c

- added a mutex to the enqueue since sometimes the driver runs two callbacks at the same time

removed move operation which broke the reference

cfcde19

Revert "create new sockets when launching influx sender threads"

7a1bde6

This reverts commit e95b536.

Revert "modernize InfluxDbWriter.cpp"

dcb0315

This reverts commit b7bfe5e.

miDeb commented Sep 1, 2025

View reviewed changes

src/can/CanKvaserReceiveThread.cpp Outdated Show resolved Hide resolved

raffael0 marked this pull request as ready for review September 1, 2025 12:20

changed the queue to a blocking queue

1420327

raffael0 closed this Sep 5, 2025

Feature/can rx perf #11

Feature/can rx perf #11

Uh oh!

Conversation

miDeb commented Jul 9, 2025

Uh oh!

Uh oh!

raffael0 commented Jul 21, 2025

Uh oh!

miDeb commented Jul 21, 2025

Uh oh!

raffael0 commented Jul 21, 2025

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Key Changes:

Reviewed Changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

raffael0 commented Aug 31, 2025

Uh oh!

Uh oh!

miDeb commented Sep 1, 2025

Uh oh!

raffael0 commented Sep 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

raffael0 commented Sep 1, 2025

Uh oh!

raffael0 commented Sep 5, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

raffael0 commented Sep 1, 2025 •

edited

Loading