Introduce a long lived section of the heap. #547

tannewt · 2018-01-24T01:18:56Z

This adapts the allocation process to start from either end of the heap
when searching for free space. The default behavior is identical to the
existing behavior where it starts with the lowest block and looks higher.
Now it can also look from the highest block and lower depending on the
long_lived parameter to gc_alloc. As the heap fills, the two sections may
overlap. When they overlap, a collect may be triggered in order to keep
the long lived section compact. However, free space is always eligable
for each type of allocation.

Heap prior would end up looking something like:

Afterwards its:

Video of it working here: https://www.youtube.com/watch?v=S0uEZqxOWOc

By starting from either of the end of the heap we have ability to separate
short lived objects from long lived ones. This separation reduces heap
fragmentation because long lived objects are easy to densely pack.

Most objects are short lived initially but may be made long lived when
they are referenced by a type or module. This involves copying the
memory and then letting the collect phase free the old portion.

QSTR pools and chunks are always long lived because they are never freed.

The reallocation, collection and free processes are largely unchanged. They
simply also maintain an index to the highest free block as well as the lowest.
These indices are used to speed up the allocation search until the next collect.

In practice, this change may slightly slow down import statements with the
benefit that memory is much less fragmented afterwards. For example, a test
import into a 20k heap that leaves ~6k free previously had the largest
continuous free space of ~400 bytes. After this change, the largest continuous
free space is over 3400 bytes.

…loc. gc_alloc's API is changing and we shouldn't need to care about it. So, we switch to m_malloc which has the default behavior we expect.

string instead of the heap.

tannewt · 2018-01-24T01:19:20Z

I'm still looking into fixing the tests so sit tight.

dhalbert · 2018-01-24T01:31:49Z

I would like to go first :)

This adapts the allocation process to start from either end of the heap when searching for free space. The default behavior is identical to the existing behavior where it starts with the lowest block and looks higher. Now it can also look from the highest block and lower depending on the long_lived parameter to gc_alloc. As the heap fills, the two sections may overlap. When they overlap, a collect may be triggered in order to keep the long lived section compact. However, free space is always eligable for each type of allocation. By starting from either of the end of the heap we have ability to separate short lived objects from long lived ones. This separation reduces heap fragmentation because long lived objects are easy to densely pack. Most objects are short lived initially but may be made long lived when they are referenced by a type or module. This involves copying the memory and then letting the collect phase free the old portion. QSTR pools and chunks are always long lived because they are never freed. The reallocation, collection and free processes are largely unchanged. They simply also maintain an index to the highest free block as well as the lowest. These indices are used to speed up the allocation search until the next collect. In practice, this change may slightly slow down import statements with the benefit that memory is much less fragmented afterwards. For example, a test import into a 20k heap that leaves ~6k free previously had the largest continuous free space of ~400 bytes. After this change, the largest continuous free space is over 3400 bytes.

It can now render the heap layout over a sequence of ram dumps. The mpy analysis is also better at parsing mpy files.

tannewt · 2018-01-24T19:52:43Z

Ok, this is ready for review.

dhalbert · 2018-01-24T20:05:12Z

py/gc.c

    MP_STATE_MEM(gc_last_free_atb_index) = 0;
+    // Set last free ATB index to the end of the heap.
+    MP_STATE_MEM(gc_last_free_atb_index) = MP_STATE_MEM(gc_alloc_table_byte_len) - 1;


line 150 and 152 are both setting MP_STATE_MEM(gc_last_free_atb_index), so line 150 is wrong or redundant?

150 was wrong. Good catch!

dhalbert · 2018-01-24T20:31:14Z

py/gc_long_lived.c

+        mp_raw_code_t* raw_code = MP_OBJ_TO_PTR(fun_bc->const_table[i]);
+        if (raw_code->kind == MP_CODE_BYTECODE) {
+            raw_code->data.u_byte.bytecode = gc_make_long_lived((byte*) raw_code->data.u_byte.bytecode);
+            // TODO(tannewt): Do we actually want to recurse here?


Is this still a question?

I'm still unsure about it but the comment isn't useful so I removed it.

dhalbert · 2018-01-24T20:32:36Z

py/gc_long_lived.c

+    fun_bc->const_table = gc_make_long_lived((mp_uint_t*) fun_bc->const_table);
+    // extra_args stores keyword only argument default values.
+    size_t words = gc_nbytes(fun_bc) / sizeof(mp_uint_t*);
+    for (size_t i = 0; i < words - 4; i++) {


What's the 4? Is that number of bytes? could it be 8 on 64-bit-word impls?

Its the number of pointers stored in mp_obj_fun_bc_t before the extra_args array. Is there another way to get the array length?

The struct defn is:

typedef struct _mp_obj_fun_bc_t { mp_obj_base_t base; mp_obj_dict_t *globals; // the context within which this function was defined const byte *bytecode; // bytecode for the function const mp_uint_t *const_table; // constant table // the following extra_args array is allocated space to take (in order): // - values of positional default args (if any) // - a single slot for default kw args dict (if it has them) // - a single slot for var args tuple (if it takes them) // - a single slot for kw args dict (if it takes them) mp_obj_t extra_args[]; } mp_obj_fun_bc_t;

I'm not sure why it doesn't say [4]. then I think you could use sizeof(). And if it's a VLA (variable length array), you can use sizeof() also. Found this: https://stackoverflow.com/questions/14995870/behavior-of-sizeof-on-variable-length-arrays-c-only.

But I'm not sure this is worth fixing.

I guess it's allocated separately and assigned to there?

Its done through a cast so I'm not sure if sizeof would work: https://github.com/adafruit/circuitpython/blob/master/py/objfun.c#L356

dhalbert

Fantastic!

tannewt added 2 commits January 23, 2018 16:45

Switch to m_malloc_maybe and m_free to reduce our dependence on gc_al…

6560596

…loc. gc_alloc's API is changing and we shouldn't need to care about it. So, we switch to m_malloc which has the default behavior we expect.

Speed up qstr loading by using the stack to store a temporary

56bd078

string instead of the heap.

tannewt added the memory optimization label Jan 24, 2018

tannewt added this to the 3.0 milestone Jan 24, 2018

tannewt requested a review from dhalbert January 24, 2018 01:18

tannewt force-pushed the alloc_long_lived branch from e0795f6 to 51e361c Compare January 24, 2018 18:07

tannewt added 2 commits January 24, 2018 10:33

Polish up the heap analysis script and make it more CLI friendly.

da330f0

It can now render the heap layout over a sequence of ram dumps. The mpy analysis is also better at parsing mpy files.

tannewt force-pushed the alloc_long_lived branch from 51e361c to da330f0 Compare January 24, 2018 18:34

dhalbert reviewed Jan 24, 2018

View reviewed changes

Fix the initial state and polish a couple comments.

aa0ce98

dhalbert approved these changes Jan 25, 2018

View reviewed changes

dhalbert merged commit 5de29ac into adafruit:master Jan 25, 2018

deshipu mentioned this pull request Feb 21, 2018

Truncated MemoryError exception message when importing too-large .py #569

Closed

dhalbert mentioned this pull request Apr 21, 2020

py/gc: Print fragmentation stats in micropython.mem_info(1) micropython/micropython#5195

Open

embeddedt mentioned this pull request Sep 20, 2021

RPi Pico port compilation lvgl/lv_binding_micropython#174

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Introduce a long lived section of the heap. #547

Introduce a long lived section of the heap. #547

Uh oh!

tannewt commented Jan 24, 2018

Uh oh!

tannewt commented Jan 24, 2018

Uh oh!

dhalbert commented Jan 24, 2018

Uh oh!

tannewt commented Jan 24, 2018

Uh oh!

dhalbert Jan 24, 2018

Uh oh!

tannewt Jan 24, 2018

Uh oh!

dhalbert Jan 24, 2018

Uh oh!

tannewt Jan 24, 2018

Uh oh!

dhalbert Jan 24, 2018

Uh oh!

tannewt Jan 24, 2018

Uh oh!

dhalbert Jan 24, 2018

Uh oh!

dhalbert Jan 24, 2018

Uh oh!

tannewt Jan 25, 2018

Uh oh!

dhalbert left a comment

Uh oh!

Uh oh!

Introduce a long lived section of the heap. #547

Introduce a long lived section of the heap. #547

Uh oh!

Conversation

tannewt commented Jan 24, 2018

Uh oh!

tannewt commented Jan 24, 2018

Uh oh!

dhalbert commented Jan 24, 2018

Uh oh!

tannewt commented Jan 24, 2018

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

dhalbert left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!