libk  kconf.md at [de2d78ff77]

File mod/kconf/kconf.md artifact 6e72a9b762 part of check-in de2d78ff77


# kconf

while there are a number of existing configuration parser libraries out there, they all have their problems, and all depend on libc. since configuration string parsing is a fairly core functionality that would force many programs to either write their own code or drag libc back into their dependencies, supplying a basic parser with libk makes sense. it may also make sense at some point to add support for a simple binary configuration format, to make it easier to marshal out complex runtime data structures.

# basic principles

kconf is intended to be fast, lightweight, transparent, and low-overhead.

to initialize a kconf structure, we begin by supplying a list of *atoms.* like in Xlib, an atom is a performant way to reference a known string of text from a compiled program. we can do this in two different ways, either by generating the values ourselves at compile time with an enum, or by generating them at runtime using the appropriate kconf interface. note that if you supply your own with an enum, because zero is used for in-band error signalling, the first atom should always be an error token. alternately, it may be explicitly set to a non-zero number and a return value of zero can be checked for implicitly or against a literal 0.

note that, in a recurring libk pattern, if an element count of zero (or `null`) is passed to a function that takes an array of nullables, that function will treat its array argument as a null-terminated array to be counted at runtime. to actually pass an empty array, the pointer to the array itself should be `null`.

kconf has two modes: key-value mode, a very lightweight mode suited for simple applications that only require a few parameters to be specialized, and record mode, which is heavier-weight but allows retrieving and working with complex datastructures.

in key-value mode, kconf is initialized by filling out the appropriate `kconf` struct with the parameters required and then passing it to the appropriate function.

    enum atoms { parse_error, user, pw, email, atoms_count };
    kconf_kv kc = {
    	.decl = { atomct, {
    		{ user,     {4, "user"    }, kconf_string },
    		{ password, {8, "password"}, kconf_string },
    		{ age,      {5, "age"     }, kconf_u8 }
    	}}
    };

	/* with runtime counts:
     * enum atoms { parse_error, user, pw, email };
     * kconf_kv kc = {
     * 	.decl = { null, {
     * 		{ user,     {0, "user"    }, kconf_string },
     * 		{ password, {0, "password"}, kconf_string },
     * 		{ age,      {0, "age"     }, kconf_u8 },
	 *		{ null }
     * 	}}
     * }; */

	/* with macros:
     * enum atoms { parse_error, user, password, email };
     * kconf_kv kc = {
     * 	.decl = {Kmpsa(kconf_decl, {
	 * 		Kconf_atom(user, string),
	 * 		Kconf_atom(password, string),
	 * 		Kconf_atom(age, u8)
	 * 	})};
	 * }; */

when invoked, `kconf_kv_read` populates an array with values read from the config file, and returns a structure containing a status report. kconf can be invoked on a kfile handle (with `kconf_kv_parse_file()`) or a string in memory (with `kconf_kv_parse_string()`), but in the normal course of usage, you'll simply use the `kconf_kv_read()` function, which automatically loads a configuration file from the appropriate location for the system (such as `$XDG_CONFIG_HOME/app` on linux or `~/Library/app preferences` on Mac OS X, where 'app' is the name the binary was invoked with. conveniently, this enables the user to have different 'profiles' simply by creating symbolic links to the binary).

    kconf_val conf[atoms_count];
    kconf_result kr = kconf_read(kc, conf);
    struct { ksraw username, pw; u8 age; } user;
    user.username = conf[user].string;
    user.pw = conf[password].string;
    user.age = conf[age].u8;

kconf record mode works somewhat differently. rather than access the values directly, you'll call the `kconf_rec_read()` function (or `kconf_rec_parse_file()` / `kconf_rec_parse_string()`) to retrieve a struct of type `kconf_digest,` which can then be queried with the `kconf_query()` function. kconf_query takes two arguments, a `struct kconf_digest*` and a `kconf_path.`

    kconf_digest db = kconf_rec_read();
    ksraw user_six_name = kconf_query(&db, (kconf_path){"user", 6, "name", 0} ).val.string;
    /* or with macros:
     * ksraw user_six_name = kconf_query(&db, Kconf_path("user", 6, "name")).val.string;
     */

# types
 * `typedef kconf_path kconf_path_elt[]` - a `kconf_path` is used to unambiguously identify a single field in a `kconf_digest`. it consists of a zero-terminated array of names and machine word-size integers. for instance, the path `(kconf_path){"user", 6, "name", 0}` can be used to identify the name of the sixth member of the "user" array. note that lists are **1-indexed**, to prevent collision with the list terminator.

## struct kconf_kv

## struct kconf_rec

## struct kconf_digest

## enum kconf_type
  * `string`- a string, represented as a ksraw
  * `bool` - a boolean value, either true or false
  * `file` - a file, represented by a filename in system-local format. the file will be opened and a reference to it will be returned in a `kfile` object.
  * additionally, kconf_type will have a member for every kind of machine integer the architecture supports. these members will be named according to libk integer types, so if the value a signed 16-bit integer, you would name its type as `s16`.
 
## struct kconf_atom
 * `sz key`
 * `ksraw string`
 * `kconf_type type`

## struct kconf_result
  * `enum kconf_cond cond`
  
## struct kconf_answer
 * `enum kconf_cond cond`
 * `union kconf_val val`
  
## union kconf_val
  * kconf_val is a union containing fields that match the enumerations in `kconf_type` - for instance, `ksraw string,` `bool bool,` `kfile file,` or `s16`.

## union kconf_path\_elt
 * `const char* name`
 * `sz index`

# functions
 * `struct kconf_result kconf_kv_read()`
 * `struct kconf_result kconf_kv_parse_file()`
 * `struct kconf_result kconf_kv_parse_string()`
 * `struct kconf_digest kconf_rec_read()`
 * `struct kconf_digest kconf_rec_parse_file()`
 * `struct kconf_digest kconf_rec_parse_string()`
 * `struct kconf_answer kconf_query()`

# macros

 * `Kconf_atom(id, atom, type)` - a convenience macro for definining an atom structure (`kconf_atom`). inserts the text `{id, {sizeof(#atom), #atom}, kconf_##type}`
 * `Kconf_atomi(atom, type)`- equivalent to `Kconf_atom(atom,atom,type)`
 * `Kconf_atomp(atom,type)` - a convenience macro for definining an pre-defined atom structure (`kconf_atom`), under the assumption that the C enumerator name is the string used to name the field in the configuration file with a prefix attached. define the prefix with `#define KVconf_prefix` before use. inserts the text `{KVconf_prefix##atom, {sizeof(#atom), #atom}, kconf_##type}`
 * `Kconf_path(path...)` - provides a slightly nicer path syntax than a zero-terminated compound literal (but still requires compound literal support). inserts the text `((kconf_path) {path, 0})`

# file format