srdja / collections-c Goto Github PK

View Code? Open in Web Editor NEW

2.8K 106.0 328.0 1.85 MB

A library of generic data structures for the C language.

Home Page: http://srdja.github.io/Collections-C

License: GNU Lesser General Public License v3.0

C 99.26% CMake 0.74%

data-structures c collections library algorithms array deque hashtable list memory-pool

collections-c's Introduction

Collections-C

Collections-C is a library of generic data structures for the C language.

Features
Building and Installing
Using the library
Contributing

Features

Pointer Containers
Sized Containers
Memory Pools

Pointer Containers

Structures that store data in the form of void*.

Container	description
`CC_Array`	A dynamic array that expands automatically as elements are added.
`CC_List`	Doubly Linked list.
`CC_SList`	Singly linked list.
`CC_Deque`	A dynamic array that supports amortized constant time insertion and removal at both ends and constant time access.
`CC_HashTable`	An unordered key-value map. Supports best case amortized constant time insertion, removal, and lookup of values.
`CC_TreeTable`	An ordered key-value map. Supports logarithmic time insertion, removal and lookup of values.
`CC_HashSet`	An unordered set. The lookup, deletion, and insertion are performed in amortized constant time and in the worst case in amortized linear time.
`CC_TreeSet`	An ordered set. The lookup, deletion, and insertion are performed in logarithmic time.
`CC_Queue`	A FIFO (first in first out) structure. Supports constant time insertion, removal and lookup.
`CC_Stack`	A LIFO (last in first out) structure. Supports constant time insertion, removal and lookup.
`CC_PQueue`	A priority queue.
`CC_RingBuffer`	A ring buffer.
`CC_TSTTable`	A ternary search tree table. Supports insertion, search, iteration, and deletion.

Example

int value = 20;
CC_Array *array;

if (cc_array_new(&array) != CC_OK) { /*Create a new array.*/
    // handle error
}
if (cc_array_add(&array, (void*) &value) != CC_OK) { /* Add the pointer to the value to the array */
    // handle error
}

Sized Containers

Structures that store data of arbitrary length directly.

Container	description
`CC_ArraySized`	A dynamic array that expands automatically as elements are added.

Example

int value = 20;
CC_SizedArray *array;

if (cc_sized_array_new(sizeof(int), &array) != CC_OK) { /* Create a new array that stores values the size of an int*/
    // handle error
}
if (cc_sized_array_add(&array, &value) != CC_OK) { /* Copy the value into the array */
    // handle error
}

Memory Pools

Memory pools are pre-allocated blocks of contiguous memory

Container	description
`CC_DynamicPool`	On the heap, potentially expandable memory pool
`CC_StaticPool`	Fixed pool

Example

/* CC_StaticPool can enable the use of the structures on the stack */

#include "memory/cc_static_pool.h"
#include "cc_list.h"

CC_StaticPool *pool;

// Alloc wrappers
void *pool_malloc(size_t size)               {cc_static_pool_malloc(size, pool);}
void *pool_calloc(size_t count, size_t size) {cc_static_pool_calloc(count, size, pool);}
void  pool_free(void* ptr)                   {cc_static_pool_free(ptr, pool);}

int main(int argc, char **argv) {
    uint8_t buffer[2000];            /* Large enough buffer. */
    cc_static_pool_new(sizeof(buffer), 0, buffer, buffer, &pool); /* allocate the pool structure inside the buffer */

    CC_ListConf conf;                /* Create a new list config */
    cc_list_conf_init(&conf);        
    conf.mem_alloc  = pool_malloc;   /* Set list memory allocators to pool allocators */
    conf.mem_calloc = pool_calloc;
    conf.mem_free   = pool_free;

    CC_List* list;
    cc_list_new_conf(&conf, &list);  /* The newly created list will be allocated inside the "buffer" array*/

    // Use the list

    return 0;
}

Building and Installation

Dependencies

Linux

C compiler (gcc or clang)
cmake (>= 3.5)
pkg-config

These packages can usually be installed through your distributions package manager.

Windows

Visual Studio (recommended) or MinGW
cmake

Building the project

Linux

To build the project, we first need to create a separate build directory (if it doesn't already exist):

mkdir build

From this directory we can run the cmake command and configure the build:

cmake .. or cmake -DSHARED=True to make Collections-C build as a shared library
cmake -DSHARED=False to build a static library

Once cmake is done generating makefiles, we can build the library by running make inside our build directory.

An example of cloning and building a static library:

git clone https://github.com/Collections-C.git
cd Collections-C
mkdir build
cd build
cmake -DSHARED=False
make

Running tests

To run tests (from the build directory):

make test

To run individual tests, simply run the appropriate executable. For example:

build/test/array_test

Installing

To install the library run:

sudo make install

By default the libraries and headers will be installed in /usr/local/lib/ and /usr/local/include directories.

You have to make the system's runtime aware of the location of the new library to be able to run dynamically linked applications. This might be as simple as running the following command if your /etc/ld.so.conf contains the install directory.

Note: macOS doesn't support ldconfig.

sudo ldconfig

Using Collections-C in your programs

A simple program

If we already built and installed the library, we can write a simple hello world program and save it to a file named hello.c:

#include <stdio.h>
#include <collectc/cc_array.h>

int main(int argc, char **argv) {
    // Create a new array
    CC_Array *ar;
    cc_array_new(&ar);

    // Add a string to the array
    cc_array_add(ar, "Hello World!\n");

    // Retreive the string and print it
    char *str;
    cc_array_get_at(ar, 0, (void*) &str);
    printf("%s", str);

    return 0;
}

Now we need to compile and link our program. Since make builds both the static and the dynamic library we can choose which one we wish to link into our program.

Linking statically

If we wish to statically link the library to our program we can pass the -static flag to the compiler

Note: On macOS, the -static flag is not very friendly (it requires that all the libraries are statically linked). So we can replace -static -lcollectc with the full path to the static library. Which is /usr/local/lib/libcollectc.a by default.

gcc hello.c -static -lcollectc -o hello

or similarly when compiling with clang:

clang hello.c -static -lcollectc -o hello

This will link the library by copying it into the executable. We can use this option if we don't wish to have Collections-C as a runtime dependency, however this comes at the expense of generating a larger executable.

Linking dynamically

We can also choose to link with the library dynamically at runtime. This is the default behaviour if omit the -static compiler flag:

gcc hello.c -lcollectc -o hello

or with clang:

clang hello.c -lcollectc -o hello

Linking dynamically produces a smaller executable, but requires libcollectc.so to be present on every system on which the program is going to be executed.

Linking problems

Sometimes the compiler may have trouble finding the library or the headers. This is usually because it's looking for them in the wrong directory, which may happen if the library or the headers or both are installed in a non-standard directory or not installed at all.

If this is the case, we can explicitly tell the compiler where to look for them by passing the -I[path to headers] and -L[path to libraries] options:

gcc hello.c `pkg-config --cflags --libs collectionc` -o hello

Running the program

If everything went well with the compilation we can run the executable:

./hello

and it should print Hello, World! to the console.

Contributing

Contributions are welcome.

If you have a feature request, or have found a bug, feel free to open a new issue. If you wish to contribute code, see CONTRIBUTING.md for more details.

collections-c's People

Contributors

Stargazers

Watchers

Forkers

fawaf dxq-git gautamdman sudarshansarolkar adamcanady 3k 19h neilbryant neuroradiology aled1027 gohmiaoyu mbtech kepta xorllc jakerockland simple555a gskielian weimingtom shekkbuilder timyah hbcbh1999 chuckm abhilashsk caiyuchao dup2x uname quandt162 hc-liu trungkienle nirmoy bharath1097 vladiulianbogdan wrestle fromxt vh21 jerrylu666 daniel-nj nemochentw brucetsao gene7299 jxrcdh changwu-tw smorin erikdw marktseng allengaller cppnuts zhanglei wuxuer ankurdh imjerrybao unws jjungo kuanyuchen gherlein muhand bootinge ivange94 cw7 badcc melvinvarkey bdacode aebm pranavlathigara mame98 kshvakov corefan kentrix casperhsia nekel-seyew marquesvf haifenghuang zhangyangisme gianmaria dawnft sue602 wpsteak kkpapa linuxxuxiong msaxena2 kpatto dypan ankibahuguna ifzz xfilesee ahmedramadanelamir lamproae pp2codexu ndesh26 kedashoe chinagoth c8jiang peterzs winglechen xdancho danielbak mhcrnl fr0stbyte vovoma forkedreposbak

collections-c's Issues

May cause core dump

Nice to see you!
I see the source code of array. I found that it just stores the address of the element. And in all of the test code , it only adds auto var in the same scope to the array. If I add auto val in a different scope. Then I get it in the origin scope, it would cause code dump. Because the val in other scope has been destoryed. Sorry for my poor English!

Deque iterator remove

This function does not remove the element that was last returned by deque_iter_next. It removes the next item

Using non-string hash{table,set} is harder than needed

The documentation gives some hints that the default hashtable and hashset implementation are assuming C strings as keys. However, the "official" documentation use some defines that have been removed in the meantime, i.e. in 9b4c3cb. As of now using the hashing data structures with anything but strings as keys is rather a PITA. There is also no test case that would show how much boilerplate code is required (in comparison to string keys).

I propose re-adding the lost comparison functions and defines and adding new explicit functions that initialize hashsets and hashtables for use with pointers, strings (and maybe even a generic version if it makes sense) respectively, e.g. hashtable_new_str and hashtable_new_ptr.

list_add does not copy data

When using list_add_all(list1, list2) calling list_destroy_free on both lists will cause a SIGSEGV. The solution, as I see it is that list_add_all needs to do a deep copy of the data.

Looking at the list_add_all, looks like it's calling list_add, which in turn calls list_add_last. That function creates a new Node in the list but the data is not copied from input, just the pointers are equalized.

I suspect element size will need to be known somehow at copy time.

deque_remove_first not implemented correctly.

Issue repro steps

Add 4 numbers 1,2,3,4 in a deque.
Apply deque_remove_first on the deque.
Get the first number using deque_get_first. (We get 2 as expected)
But if we apply deque_get_buffer on the deque and get buff[0] we get 1.

Root cause

enum cc_stat deque_remove_first(Deque *deque, void **out)
{
    if (deque->size == 0)
        return CC_ERR_OUT_OF_RANGE;

    void *element = deque->buffer[deque->first];
    deque->first = (deque->first + 1) & (deque->capacity - 1);
    deque->size--;

    if (out)
        *out = element;

    return CC_OK;
}

In the above code, we only move the pointer a step and we din't move the buffer data causing the confusion. deque_get_first gets the value indexed at first and not the buff[0].

Suggestion for fix
We shall return the buffer starting from first here in the function deque_get_buffer.

const void* const *deque_get_buffer(Deque const * const deque)
{
    return (const void* const*) (deque->buffer + deque->first);
}

Another way is to memmove in deque_remove_first itself.

Let me know your thoughts on this @srdja !

Functions description/implemenation mismatch

The function does not return the removed element though the description says it should:

/**
 * Removes the last returned element by <code>treeset_iter_next()</code>
 * function without invalidating the iterator and optionally sets the
 * out parameter to the value of the removed element.
 *
 * @note This Function should only ever be called after a call to <code>
 * treeset_iter_next()</code>.
 *
 * @param[in] iter The iterator on which this operation is performed
 * @param[out] out Pointer to where the removed element is stored, or NULL
 *                 if it is to be ignored
 *
 * @return CC_OK if the element was successfully removed, or
 * CC_ERR_KEY_NOT_FOUND.
 */
void treeset_iter_remove(TreeSetIter *iter)
{
    treetable_iter_remove(&(iter->i));
}

The same applies to

void treetable_iter_remove(TreeTableIter *iter)
{
    remove_node(iter->table, iter->current);
}

What is the desired behavior?

CC_ITER_END vs CC_ERR_OUT_OF_RANGE

Different statuses are returned in the same situation - when iterator reaches end of array:

 * @return CC_OK if a next element pair is returned, or CC_ITER_END if the end of one
 * of the arrays has been reached.
 */
enum cc_stat array_zip_iter_next(ArrayZipIter *iter, void **out1, void **out2)

 * @return CC_OK if the element was successfully removed, or CC_ERR_OUT_OF_RANGE.
 */
enum cc_stat array_zip_iter_remove(ArrayZipIter *iter, void **out1, void **out2)

Which status should be used to be consistent? I'd suggest using CC_ITER_END whenever an iterator is involved.

Problem in hashtable

The function hashtable_new_conf in hashtable.c. I found that
table->buckets = conf->mem_calloc(table->capacity, sizeof(TableEntry));
It confuse me. The type of table->buckets is TableEntry**. Why do not just use
table->buckets = conf->mem_calloc(table->capacity, sizeof(TableEntry*));
I modify table->buckets = conf->mem_calloc(table->capacity, sizeof(TableEntry)); to table->buckets = conf->mem_calloc(table->capacity, sizeof(TableEntry*)); and do test, It also run well!

reserved identifier violation

I would like to point out that identifiers like "__HASHTABLE_H__" and "__TREESET_H_" do eventually not fit to the expected naming convention of the C language standard.
Would you like to adjust your selection for unique names?

List remove performs heap buffer overflow

The computation of the number of blocks to move in array_remove is wrong. Whenever the array is full, and memory is moved, there is an extra 8 bytes that are moved inside the array's buffer.

A pull request is incoming to fix that.

list.c::list_destroy_free() does not free memory?

valgrind shows many memory leaks when running 'list_test', and they do not disappear when calling list_destroy_free().
The function list_remove_all_free() called by list_destroy_free() looks suspicious as it calls unlink_all(list, false) while the second parameter is supposed to be true, as I understand.
However, after changing it to true the binary starts failing on memory allocation:

(gdb) backtrace 
#0  0x00007ffff7aaeb9d in _int_malloc () from /usr/lib/libc.so.6
#1  0x00007ffff7ab141a in calloc () from /usr/lib/libc.so.6
#2  0x0000000000403b89 in list_add_last (list=0x609010, element=0x6092d0) at ../src/list.c:190
#3  0x0000000000401d1e in list_1234 (l=l@entry=0x7fffffffe5f0) at list_test.c:1486
#4  0x00000000004028ad in test_list_remove_at () at list_test.c:959
#5  0x0000000000400728 in main (argc=<optimized out>, argv=<optimized out>) at list_test.c:77

is a Vector data structure available?

Does the library provide a vector data structure? By a vector, I mean a similar data structure to C++/STL vector, ie a right-extensible array containing elements placed in contiguous storage and you can access by integer indexing.

The library provide a deque data structure ; however elements (not their address) are not placed in a contiguous sequence of bytes.

EDIT The issue is solved since vectors are implemented in array.c.

EDIT 2

After checking more closely, it appears that an array objetct is not a true container: the array stores only a reference to your data element:

enum cc_stat array_add(Array *ar, void *element)
{
    if (ar->size >= ar->capacity) {
        enum cc_stat status = expand_capacity(ar);
        if (status != CC_OK)
            return status;
    }

    ar->buffer[ar->size] = element;
    ar->size++;

    return CC_OK;
}

And the hash table has the same problem. I see little use cases where this kind of implementation could be of interest.

vector.h doesn't exist

In the file collections-c.h, vecor.h file does not exist in src directory.

Add Ring Buffer?

Ring buffers (or circular buffers) are pretty important data structures that are used in many embedded applications (keyboards, drivers, etc). They're pretty similar to queues and are often pretty easy to implement, as the data they store usually consists of relatively small integer values.

Use of angle brackets around file names for include statements

Would you like to replace any double quotes by angle brackets around file names for include statements?

readme.md file, Function error in hello.c example

array_get(ar, 0, (void_) &str);
Is not it should be modified as
array_get_at (ar, 0, (void_) &str);

invalid access in priority queue

This fixes an out-of-bounds lookup, which results in a segfault when popping/heapifying:

diff --git a/src/pqueue.c b/src/pqueue.c
index 0293922..875d40c 100644
--- a/src/pqueue.c
+++ b/src/pqueue.c
@@ -306,13 +306,13 @@ static void pqueue_heapify(PQueue *pq, size_t index)
     size_t R   = CC_RIGHT(index);
     size_t tmp = index;
 
+    if (L >= pq->size || R >= pq->size)
+        return;
+
     void *left     = pq->buffer[L];
     void *right    = pq->buffer[R];
     void *indexPtr = pq->buffer[index];
 
-    if (L >= pq->size || R >= pq->size)
-        return;
-
     if (pq->cmp(indexPtr, left) < 0) {
         indexPtr = left;
         index = L;

Improve cooperation with C++ environments

Some code from C source files should be wrapped by the setting 'extern "C"' for C++ tools, shouldn't it?

in array.c array_contains() doesn't work

/**
 * Returns the number of occurrences of the element within the specified array.
 *
 * @param[in] ar the array that is being searched
 * @param[in] element the element that is being searched for
 *
 * @return the number of occurrences of the element
 */
size_t array_contains(Array *ar, void *element)
{
    size_t o = 0;
    size_t i;
    for (i = 0; i < ar->size; i++) {
        if (element == ar->buffer[i])  <------------ this line
            o++;
    }
    return o;
}

in array.c you are comparing only the addresses not the actual data, you need a comparison function as argument to array_contains
@srdja

Filter functions

Collections should have *_filter and *_filter_mut functions so that one may do something like this:

bool predicate(void *e) {
    int el = *((int*) e);
    if (el > 5) return true;
    return false;
}

...

// filters out all integers greater than 5 by mutating the array
array_filter_mut(array1, predicate);


// filters out all integers greater than 5 and stores the filter results in a new array
Array *new_array;
if (array_filter(array2, predicate, &new_array) != CC_OK)
    ...

Adding a Ternary Search Tree

Hash tables are an excellent data structure for amortized constant time operations. But, calculating the hash is not actually O(1). It's more like O(h), where h is the length of the hashed key.
With a ternary search tree, it's possible to get similar results to those obtained from a Hash Table without the need of resizing the table, or having extra unused entries as with the hash table.

return true while cc_stat is expected

hashtable.c line 185

enum cc_stat hashtable_add(HashTable *table, void *key, void *val)
{
    ...
            return true;
    ...
}

I wonder which cc_stat value would be returned... (probably 1).

hashtable buckets memory allocation

I think memory allocated to the buckets is unnecessarily large:
table->buckets = conf->mem_calloc(table->capacity, sizeof(TableEntry));
should be changed to:
table->buckets = conf->mem_calloc(table->capacity, sizeof(TableEntry *));

The same thing goes for resize case also.

Q: purpose of hashtable_hash_ptr

Newbie here, so please bear with me, but I think I am missing something obvious.

What is the purpose of hashtable_hash_ptr @ https://github.com/srdja/Collections-C/blob/master/src/hashtable.c#L918 ? My understanding is that it would take a pointer value and compute a hash of it.

But since a pointer is 4-8 bytes in most systems, would it not be more convenient to just use the pointer value itself as the "hash" and save the hashing operations?

Thanks.

*_contains function for values

Collections should also have a *_contains function that looks for values instead of pointers.

bool equals(void *e1, void *e2) {
    int el1 = *((int*)e1);
    itn el2 = *((int*)e2);
    if (el1 == el2) return true;
    return false;
}

...

int a = 10;
size_t count = array_contains(array, &a, equals);

ring_buffer actual capacity ?

Hello,

I think there is an issue with the ring buffer capacity. The buffer is allocated in the following way :

ringbuf->buf = rconf->mem_calloc(rconf->capacity, sizeof(Rbuf))

I am pretty sure it should be sizeof(uint64_t) instead. Right now, for a ring buffer of capacity 10, insteda of allocating 80 bytes, it allocates 640 bytes !
It is also possible to call rbuf_peek(rbuf, 79) on a ring buffer of capacity 10.

If I am right, I'll happily submit a PR to fix this

Small typo

Small typo at this line

NLLL should be NULL?

Check for allocator failures

The code shouldn't assume that memory allocation never fails.

Potential memory leaks in slist functions

There are potential memory leaks in the functions like below - if the caller provides 'out', they can free it afterwards; but if not, the memory will leak.

enum cc_stat slist_iter_remove(SListIter *iter, void **out)
{
    if (!iter->current)
        return CC_ERR_VALUE_NOT_FOUND;

    void *e = unlink(iter->list, iter->current, iter->prev);
    iter->current = NULL;
    iter->index--;

    if (out)
        *out = e;

    return CC_OK;
}

linked list with signals

I would like to see a list which is basicly a linked list, but you should be able to add callback functions for the following actions:

new item added
item removed

Similar to: ObservableCollections in C#

I would want to implement this.
What do you think? would you like to see it implemented in the default linked list or in a seperate structure?

list_iter_add throw exception

After called list_iter_init initializing the iterator, iter->last is signed NULL.
Then if invoke list_iter_add,
line link_after(iter->last, new_node); will throw exception.

Zip iterators

Collections with iterators should also have zip iterators.

ArrayZipIterator zip;

array_zip_iter_init(&zip, array1, array2);

void *e1, *e2;
while (array_zip_iter_next(&zip, &e1, &e2) != CC_ITER_END) {
    ...
}

Basically this gives you a simpler and more efficient API when you want to iterate over two collections in locksep.

Wrap includes with extern "C"

Like standard C includes does:

#ifdef __cplusplus
extern "C" {
#endif

Valgrind "Invalid read of size 8" when using hashtable_get_values + array_remove_at

Minimal scenario to reproduce the issue:

HashTable *ht;
hashtable_new(&ht);
hashtable_add(ht, "1", "1");
hashtable_add(ht, "2", "2");
hashtable_add(ht, "3", "3");

Array *arr;
hashtable_get_values(ht, &arr);

printf("Array size: %i\n", (int)array_size(arr));

array_remove_at(arr, 0, NULL);

printf("Array size: %i\n", (int)array_size(arr));

Expecting to see

Array size: 3
Array size: 2

Getting

Array size: 3
==19255== Invalid read of size 8
==19255==    at 0x4C2DECE: memcpy@GLIBC_2.2.5 (vg_replace_strmem.c:1021)
==19255==    by 0x5E41A48: array_remove_at (in /.../reqs/lib/libcollectc.so)
==19255==    ...
==19255==  Address 0xb01b358 is 0 bytes after a block of size 24 alloc'd
==19255==    at 0x4C29BC3: malloc (vg_replace_malloc.c:299)
==19255==    by 0x5E414D4: array_new_conf (in /.../reqs/lib/libcollectc.so)
==19255==    by 0x5E459FF: hashtable_get_values (in /.../reqs/lib/libcollectc.so)
==19255==    ...
==19255==
Array size: 2

If I create array manually using array_new, I don't get the same issue. Looking at the source code of array_remove_at I can see there is a call to memmove (https://github.com/srdja/Collections-C/blob/master/src/array.c#L317) which I'm guessing is calling memcpy internally, but unfortunately I have no guesses as to why that produces the error.

Any ideas what might be causing it and whether there is a way to work around it?

When the length of the array is 0, the function `array_reverse` is wrong.

When the size of the array is 0, the variable j will become very large.

When called, a segmentation fault will be triggered.

Code

HashTable is empty, yet CC_ITER_END is not returned

What could be the possible reason for HashTable iterator while loop body executing even though hashtable_size is reported as 0?

It must be something I'm doing wrong as a simplified test doesn't reproduce the issue and while loop body is not executed on empty HashTable inside the test.

Code structure is roughly like this:

struct MyStruct {
    ...
    HashTable *my_table;
};
extern struct MyStruct **my_struct;

void init_data() {
    
    my_struct = (struct MyStruct **) shm_malloc(sizeof(struct MyStruct *));
    (*my_struct) = (struct MyStruct *) shm_malloc(sizeof(struct MyStruct));
    memset(*my_struct, 0, sizeof(struct MyStruct));
    ...
    (*my_struct)->my_table = NULL;

    HashTableConf htc;
    hashtable_conf_init(&htc);
    htc.mem_alloc   = &shm_malloc_func;
    htc.mem_calloc  = &shm_calloc_func;
    htc.mem_free = &shm_free_func;

    if (hashtable_new_conf(&htc, &((*my_struct)->my_table)) != CC_OK) {
        LM_ERR("Failed to initialise hashtable!\n");
    }
}

And the part of the code that iterates over the hashtable:

// Code that accesses the hashtable
HashTableIter hti;
hashtable_iter_init(&hti, (*my_struct)->my_table);
printf("table size = %i\n", hashtable_size((*my_struct)->my_table));
printf("table capacity = %i\n", hashtable_capacity((*my_struct)->my_table));
TableEntry *entry;
while (hashtable_iter_next(&hti, &entry) != CC_ITER_END) {
    // do something
}

As mentioned, table size is reported as 0 and capacity as 16.

Does anyone have any idea what might be going wrong?

SIGSEGV, Segmentation fault in array.c

Hi,

First of all, thank you for the cool implementations of the collections. I'm using one array implementation to implement a Heap.

I have this function:
`bool heap_insert(Heap *heap, void *element)
{
size_t pos = array_size(heap->v) + 1;

    for(; pos > 1; pos = pos/2 )
            if(element > array_get(heap->v, pos/2))
                    if(!array_add_at(heap->v, array_get(heap->v, pos/2), pos))
                            return false;

    if(pos > array_capacity(heap->v))
            return array_add(heap->v, element);

    return array_add_at(heap->v, element, pos);

}
`

I'm getting a seg fault when array_add_at(heap->v, element, pos) is executed.

(gdb) p ar->capacity
$7 = 8
(gdb) n
Program received signal SIGSEGV, Segmentation fault.
0x00007fff8fb64522 in _platform_memmove$VARIANT$Nehalem () from /usr/lib/system/libsystem_platform.dylib
(gdb) bt
#0 0x00007fff8fb64522 in _platform_memmove$VARIANT$Nehalem () from /usr/lib/system/libsystem_platform.dylib
#1 0x00000001000016c1 in array_add_at (ar=0x100104a70, element=0x7fff5fbffb54, index=1) at ../src/array.c:184
#2 0x00000001000012b3 in heap_insert (heap=, element=) at ../src/heap.c:97
#3 0x0000000100000f69 in test_heap_insert () at heap_test.c:23
#4 main (argc=, argv=) at heap_test.c:12

(gdb) l
179 if (ar->size == ar->capacity && !expand_capacity(ar))
180 return false;
181
182 size_t shift = (ar->size - index) * sizeof(void*);
183
184 memmove(&(ar->buffer[index + 1]),
185 &(ar->buffer[index]),
186 shift);
187
188 ar->buffer[index] = element;

So, the question I have here is: what is the expected behavior of array_add_at()? I see the initial capacity is 8, size = 0 and I'm trying to insert an element at pos 1. It should go through right? Somehow, the call to memmove() fails :(

Automatically free data when removing it from a container

There should be a more convenient way to free data after it's removed from a container and
it should be:

an optional operation
should not assume the same allocators are used for both data and the structure

I suggest adding a callback to remove type functions so that instead of:

void *e;
container_remove_at(c, 3, &e);
free(e);

we could write something like this:

container_remove_at(c, 3, NULL, free);

Binary search on a sorted Array

You should be able do perform a binary search on a sorted array.

Inline source documentation is out of sync

Different approach to error reporting

Curretnly all errors are reported by returning an error code, which is in itself not a problem, however if a function should also return a value, problems arise. One is that the error codes have to share the type with the return value and second, the error codes limit the range of valid values that can be returned from a function.

A better approach would be to have the function return value reserved for errors, while havnig the function result returned through a parameter.

For example:

int foo_new(struct foo *f, int baz) {
    if (baz) {
        *f = (struct foo) {.bar = 3};
         return OK;
    } else {
        // something went wrong
         return ERR;
    }
}

This way the caller can check the return code to see if the value was set or not before using it.

If we did the inverse where the result is returned through function return and the error code set through a parameter like so:

struct foo foo_new(int baz, int *status) {
    if (baz) {
        *status = OK;
        return (struct foo) {.bar = 3};
    } else {
        // something went wrong
        *status = ERR;
        return ??? // What do we return?
    }
}

We wouldn't be able to handle failures properly.

The return error / set result may not be ergonomic as the original way, but it allows for more flexibility in error reporting.

cmake support

@srdja I am curious if you would entertain the idea of using cmake instead of make/autotools.
The advantage would be a little bit more modern build system, integration with testing frameworks, possibly building .rpm/.deb easier.
Speaking of testing frameworks, was wondering why not use something like CppUTest instead of rolling your own.
I understand the portability aspect of autotools, but cmake is readily available on many platforms.

hashtable_add segmentation fault

If there is a null key in the table, when adding a value which has hash value of 0 that line in the code causes crash (when string compare default function is used):
table->key_cmp(replace->key, key) == 0 --> since replace has a key of NULL and strcmp can not handle NULL pointers.
To be honest I am not sure how realistic is having NULL key and non-null keys in the same hashtable. Just for your information.
Thanks.

Use of pointer comparison

Hi,

The treetable datastructure uses the pointer that is added as keys to compare. The function used is
cc_common_comp_ptr.
Since it is possible to add any pointer, this data structure will often compare pointers that are from different structures.

In general, this is also the function you use to see if two pointers are equal in the context of ArrayContains for example.

However, comparing pointers using < and > is undefined behavior. It is not guaranteed that if ptr1 < ptr2 returns 0, ptr2 < ptr1 will not also return 0 even if the pointers are different. This might happen due to compiler optimisations.

See this description.

list_index_of(): shouldn't a comparator function be used?

Similar to

list_contains_value(List *list, void *element, int (*cmp) (const void*, const void*))
{
    ....
        if (cmp(node->data, element) == 0)
            e_count++;
    ....
}

Hashtable free keys

In configuration a free function specifically to free keys in hashtable_remove and hashtable_destroy cases can be added. In current configuration the keys are left in heap if they were dynamically allocated.

Make include guards unique

I find that include guards like "COMMON_H_" and "LIST_H_" are too short for the safe reuse of your header files (when they belong to an application programming interface).

Reduce functions

It might be convinient for collections to have a fold/reduce function so that you may do something like this:

void fn(void *e1, void *e2, void *result)
{
    int el1 = *((int*)e1);
    int el2 = *((int*)e2);
    *((int*) reslut) = el1 + el2;
}

...

// accumulate values of array into result
int reslut;
array_reduce(array, fn, &result);

How about adding the new data structure bitmasks, as the bitset in stl of c++

I once worked with the bitset of c++ its quite handy sometimes when you do competitive coding and is also useful for the works regarding OS studies. So I think it should be implemented

clang reports an issue at slist.c:744

slist.c:744:20: warning: Access to field 'next' results in a dereference of a null pointer (loaded from variable 'flip')
        flip->next = prev;
        ~~~~       ^

The questionable code is at https://github.com/srdja/Collections-C/blob/master/src/slist.c#L747

    for (i = 0; i < list->size; i++) {
        flip->next = prev;

        prev = flip;
        flip = next;
        next = next ? next->next : NULL;
    }

As I understand it, clang is worried about the condition to break the loop: on one hand, list size is used a list iterator; on the other hand, next element is verified against NULL on each cycle - which could mean that the size is not still exhausted but next element is NULL.

Maybe it's worth to change the loop to iterate by list->next to make it more clear?

srdja / collections-c Goto Github PK

collections-c's Introduction

Collections-C

Features

Pointer Containers

Example

Sized Containers

Example

Memory Pools

Example

Building and Installation

Dependencies

Linux

Windows

Building the project

Linux

Running tests

Installing

Using Collections-C in your programs

A simple program

Linking statically

Linking dynamically

Linking problems

Running the program

Contributing

collections-c's People

Contributors

Stargazers

Watchers

Forkers

collections-c's Issues

Recommend Projects

Recommend Topics

Recommend Org