Simple implementation of malloc and free with sbrk system call.

To get familiar with heap memory operation in linux, I try to implement a easy one of malloc and free with leverage of system call sbrk().

void *malloc(size_t size);
// return pointer to allocted heap memory with assigned size.
void free(void *ptr);
// release the allocated heap memory with *ptr.
void * sbrk(intptr_t incr);
// return prior program break.

First, we start the topic with some notes about sbrk:

  • The current value of the program break may be determined by calling sbrk(0).
  • The sbrk() function raises the program break by incr bytes, thus allocating at least incr bytes of new memory in the data segment.
  • The sbrk() function returns the prior program break value if successful; otherwise the value (void *)-1 is returned and the global variable errno is set to indicate the error.

malloc() and free() are wrapper functions of brk() and sbrk() to manipulate program break and maintain a free list holding current unused segment in heap memory. It could decrease the frequency of using system call for moving program break by dispatching and recycling memory within this free list.

Then we talk about my implementation, here is the link: https://github.com/bladesu/linux_app_practice/blob/master/memory_allocate/impl_malloc_free.c

The definition of struct for memory block in free list call MBLOCK in my implementation. Here I define last = NULL for the first one in free list and next = NULL for tail one. In my implementation, a little track here is that the tail block can not be returned or merged into larger block for shorten the flow control and reducing the number of code.

typedef struct __MBL
{
    size_t length;
    struct __MBL *last;
    struct __MBL *next;
} MBLOCK;

In heap memory, a MBLCOK should look like following adjacent pattern:

Low memory address  --> high memory address

MBLOCK *block= 0x50000000                      MBLOCK *next_block= 0x50000100
        |<----------length=0xff------------->|
[lengthL][addressL][addressL]     ...          [lengthL][addressL][addressL]]
                                            
                 content=(address to next free block)=0x500000ff
        content=(address to last free block)=...
content=0xff

For a dispatched heap memory segment with required length = length1, it would look like following pattern:

Low memory address  --> high memory address

         head of dispatched block 0x50000008     head of other free block  
[lengthL][  some data   ]        ...              [lengthL][addressL][addressL] ...
        |<-- length1-->|         

content=length1

Next, we are going to talk about some complicated part in the impl. I think most difficult part is considering the boundary of data. Let us look some implemented function.

About the idea of split() function, which to split free block with assiged length into allocated one (left) and smaller free block(right).

/* Original block on free list*/
head of free block (MBLOCK)
[lengthL][addressL][addressL][<-               __free_space1                   ->]
         |<------------------------ "block->length" ---------------------------->|
/* Split one free block into two */                                             
         head allocated block       remained free block                         
[lengthL][<-   allocated space    ->][lengthL][addressL][addressL][ __free_space2]
         |<-   assigned length    ->|                                             
  • code of split():
void split(MBLOCK *_block, const size_t _length)
{
    MBLOCK *right = (MBLOCK *)(((void *)_block) + _length + sizeof(size_t));
    right->next = _block->next;
    right->last = _block->last;
    right->length = _block->length - _length - sizeof(size_t);

    MBLOCK *left = _block;
    left->length = _length;
    // It is __HEAD
    if (_block->last == NULL)
    {
        // move __HEAD to right
        __HEAD = right;
    }
}

A critical part I think that is the heap memory expansion when no more available heap space to provide. Here the function __expand_newblock(MBLOCK *last_block, intptr_t _length) attach a space to current program break and make a new block to free list.

  • idea of __expand_newblock()
1. Current end of memory around program break.
     (current program break)
[lengthL][addressL][addressL]

2. Expansion for requested _length byte.
                                                             
[lengthL][addressL][addressL][   "_length" - 2 * addressL    ]
Asked    |<----------------- "_length" --------------------->|                           
Expanded mem_size            |<----------------------------->|[lengthL][addressL][addressL]
                                                                       (new program break)
// Therefore, 
// Expanded = _length + (- 2 * addressL + 2 * addressL)  + lengthL. = _length + lengthL
  • code: __expand_newblock()
/*
    append new allocated heap space to last_block.
*/
void *__expand_newblock(MBLOCK *last_block, intptr_t _length)
{
    intptr_t mem_size = (_length > _ALLOCK_INTERVAL) ? _length + sizeof(size_t) : _ALLOCK_INTERVAL;
    void *ptr;
    // EXPAND __CURRENT_HEAP_PTR
    if ((ptr = __get_expanded_heap(mem_size)) != NULL)
        __CURRENT_HEAP_PTR = ptr;
    else
        return NULL;
    // attach new block
    last_block->length = _length;
    last_block->next = __CURRENT_HEAP_PTR - sizeof(MBLOCK) + sizeof(size_t);
    // fill data of the new block.
    last_block->next->last = last_block;
    last_block = last_block->next;
    last_block->length = sizeof(MBLOCK) - sizeof(size_t);
    last_block->next = NULL;
    return ptr;
}

About free(), there are 3 induced cases in this implementation. It should be noted the block to join back to free list would not located after the least one of free block.

  • case 1: The block to join is located in front of first free block.
[ block to join ]       [ first free block ]
  • case 2: The block to join is located in front of first free block with no additional space.
[ block to join ][ first free block ]
  • case 3: The block to join is located between two free block. [ free block i ] [ block to join ] [ free block i+1 ]

More detail in my github: https://github.com/bladesu/linux_app_practice/blob/master/memory_allocate/impl_malloc_free.c

References:

comments powered by Disqus