Effective way to get data in and out of an AudioFrame #194

microbit-carlos · 2024-04-11T15:10:37Z

This is mostly an issue to discuss what is the most effective way to get data into and from an AudioFrame and if anything needs to be implemented to improve this.

For uses cases where we want to send and receive microphone recordings, one of the approaches would be to record a few seconds of audio into an AudioFrame, break it down into smaller chunks to be sent, and then on the other side we need to stitch them back together.

When trying a to build a few examples with radio we tried a few different methods and had some struggles, so these are some of the notes from that.

Options for breaking down a larger AudioFrame into smaller chunks

The main approached we followed was to create a bytes or bytearray object from the AudioFrame and use slicing
- This works but creates an unnecessary copy of the data, which takes time and memory
- Lists should also work, but would be more wasteful of resources
The most efficient way, that currently works, is possibly to use a memoryview
- This is a less known feature, which we haven't really used or documented in the past for micro:bit users, so while perfectly valid, we would need to make it more visible in the docs
Another thing we tried, but didn't work, was slicing the AudioFrame directly
- AudioFrame doesn't accept negative indexes or slices #188
- This might be the most intuitive way to do this
Are there any other ways to do this?

So my suggestion would be to go from something like:

my_audio = microphone.record(duration=AUDIO_DURATION)
audio_mv = memoryview(my_audio)
for i in range(0, len(my_audio), PACKET_SIZE):
    radio.send_bytes(my_audio[i:i+PACKET_SIZE])

To skip the memory view and be able to use slices directly (#188):

my_audio = microphone.record(duration=AUDIO_DURATION)
for i in range(0, len(my_audio), PACKET_SIZE):
    radio.send_bytes(my_audio[i:i+PACKET_SIZE])

It's a small change, but I think it can help avoid users converting to a bytes object instead:

my_audio = microphone.record(duration=AUDIO_DURATION)
audio_bytes = bytes(my_audio)
for i in range(0, len(audio_bytes), PACKET_SIZE):
    radio.send_bytes(audio_bytes[i:i+PACKET_SIZE])

Options for combining smaller chunks into a larger AudioFrame

Modules like radio, uart and spi have the option to either return a bytes object, or write into an existing buffer.
There isn't a way for the receive_into() methods to write into a buffer offset, so we cannot write directly into a larger buffer
The micro:bit version of MicroPython doesn't support bytearray slice assignment (e.g. my_bytearray[1:3] = (1,2)), so I don't think there is currently way to inject the bytes data directly into a pre-allocated larger buffer
We do have the bytearray.extend() method, so we can grow a bytearray as we receive data packets
- Not sure what the allocation policy, or how this is internally implemented inside MicroPython, but there is the potential of having wasteful allocations while growing the bytearray
Ideally we could add the received data directly into an AudioFrame.
- Updating all the receive_into() is more intrusive than updating AudioFrame, so that would be my preferred option
- Option a) slice assignment: my_audioframe[i:i+PACKET_SIZE] = received_bytes
- Option b) provide a new method similar to insert(i, x) but that can take a buffer
- Option c) update existing copyfrom(buffer, index=0)
- My preference would be c) as it feels intuitive enough and insert() already exisit so we would have to find a different name

So my suggestion would be to go from something like:

my_audio = audio.AudioFrame(duration=AUDIO_DURATION)
audio_bytes = bytearray()
packets_received = 0
while packets_received < TOTAL_PACKETS:
    radio_data = radio.receive_bytes()
    if radio_data:
        audio_bytes.extend(radio_data)
        packets_received += 1
my_audio.copyfrom(audio_bytes)
audio.play(my_audio)

To something like this:

my_audio = audio.AudioFrame(duration=AUDIO_DURATION)
packets_received = 0
while packets_received < TOTAL_PACKETS:
    radio_data = radio.receive_bytes()
    if radio_data:
        my_audio.copyfrom(radio_data, packets_received * PACKET_SIZE)
        packets_received += 1
audio.play(my_audio)

It's only two lines but it could save us from expensive reallocations that will also do more memory fragmentation.

The text was updated successfully, but these errors were encountered:

microbit-carlos · 2024-04-29T18:29:55Z

A lot of this functionality could be covered via:

AudioFrame proposal: Reference external buffer #205

microbit-carlos · 2024-07-18T15:33:07Z

This has been enabled by the feature as described in:

AudioFrame proposal: Reference external buffer #205 (comment)

microbit-carlos added this to the 2.2.0-beta.1 milestone Apr 11, 2024

This was referenced Apr 12, 2024

AudioFrame internal used_size marker accessors #196

Closed

WIP: Audio recording and playback #163

Draft

microbit-carlos mentioned this issue Apr 26, 2024

AudioFrame.copyfrom() should move the internal used_size marker #190

Closed

microbit-carlos mentioned this issue May 2, 2024

AudioFrame proposal: Reference external buffer #205

Closed

microbit-carlos closed this as completed Jul 18, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Effective way to get data in and out of an AudioFrame #194

Effective way to get data in and out of an AudioFrame #194

microbit-carlos commented Apr 11, 2024 •

edited

Loading

microbit-carlos commented Apr 29, 2024

microbit-carlos commented Jul 18, 2024

Effective way to get data in and out of an AudioFrame #194

Effective way to get data in and out of an AudioFrame #194

Comments

microbit-carlos commented Apr 11, 2024 • edited Loading

Options for breaking down a larger AudioFrame into smaller chunks

Options for combining smaller chunks into a larger AudioFrame

microbit-carlos commented Apr 29, 2024

microbit-carlos commented Jul 18, 2024

microbit-carlos commented Apr 11, 2024 •

edited

Loading