Overview
Comment: | add documentation compilation |
---|---|
Downloads: | Tarball | ZIP archive | SQL archive |
Timelines: | family | ancestors | descendants | both | trunk |
Files: | files | file ages | folders |
SHA3-256: |
8d6d792515273e92a2fa145eeb768cb2 |
User & Date: | lexi on 2019-08-19 22:40:04 |
Other Links: | manifest | tags |
Context
2019-08-19
| ||
23:57 | add detection loop for syscall headers check-in: c7732c41c9 user: lexi tags: trunk | |
22:40 | add documentation compilation check-in: 8d6d792515 user: lexi tags: trunk | |
05:42 | switch over to new build mechanism and formally deprecate makefiles check-in: 34c625a47b user: lexi tags: trunk | |
Changes
Modified build.sh from [e19ff3649f] to [669569b504].
47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 ... 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 ... 153 154 155 156 157 158 159 160 161 162 163 164 165 166 167 168 169 170 171 172 173 174 175 176 177 |
*) say "cannot determine correct binary format to use for target $target"; exit 1;; esac # first, we establish the # parameters of the build has cc && _cc=cc has clang && _cc=clang has gcc && _cc=gcc cc=${cc:-$_cc} has m4 && _m4=m4 m4=${m4:-$_m4} has nasm && asm=nasm has yasm && asm=yasm export gen=${gen:-gen} library=${libkind:-static} # {static|shared|both} case $library in static) build_static_library=yes build_shared_library=no;; shared) build_static_library=no build_shared_library=yes;; ................................................................................ # generate errno tables grep -h "#[ ]*define[ ]\+E" $p_headers_errno | sed 's;^#[\t ]*define[\t ]\+\(E[A-Z0-9]\+\).*$;k_platform_error_\1 \1;' > $gen/error_names.tbl cat $p_headers_errno $gen/error_names.tbl | cpp -P >$gen/error_numbers.tbl awk -f arch/errtbl.awk <$gen/error_numbers.tbl >$gen/error_table.h # generate symbol tables for error handling functions mkdir -p $out/k awk -f global/gen-conds.awk <global/modules >$out/k/internal.egroup.h awk -f global/gen-ident.awk <global/modules >$gen/internal.ident.c comp_co $gen/internal.ident.c $out/internal.ident.o # first pass: copy all headers into place, # including ones we need to generate ................................................................................ comp_mac "$h" "$out/k/$dest" done done # second pass: generate manpage, html, and pdf # versions of the documentation from the md src mkdir -p $out/doc/{man,html,pdf} global/build-manpage.sh libk.md for mod in ${modules[@]}; do for doc in $(scan $mod '*.md'); do base="$(basename $doc)" stem="${base%%.md}" global/build-manpage.sh "$doc" global/build-html.sh "$doc" global/build-pdf.sh "$doc" done done # third pass: compile sources and save the # resulting object files to $out, tracking # which is a runtime or function unit. exe's # will not be compiled until a later pass fn_objects=() |
| | > > > > > | | | | | | | | < < | | > |
47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 ... 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 ... 158 159 160 161 162 163 164 165 166 167 168 169 170 171 172 173 174 175 176 177 178 179 180 181 |
*) say "cannot determine correct binary format to use for target $target"; exit 1;; esac # first, we establish the # parameters of the build has cc && _cc=cc has clang && _cc=clang has gcc && _cc=gcc # prefer gcc cc=${cc:-$_cc} has m4 && _m4=m4 m4=${m4:-$_m4} has nasm && asm=nasm has yasm && asm=yasm # prefer yasm export gen=${gen:-gen} library=${libkind:-static} # {static|shared|both} doc=${doc:-yes} export doc_html=${doc_html:-yes} export doc_pdf=${doc_pdf:-yes} export doc_man=${doc_man:-yes} case $library in static) build_static_library=yes build_shared_library=no;; shared) build_static_library=no build_shared_library=yes;; ................................................................................ # generate errno tables grep -h "#[ ]*define[ ]\+E" $p_headers_errno | sed 's;^#[\t ]*define[\t ]\+\(E[A-Z0-9]\+\).*$;k_platform_error_\1 \1;' > $gen/error_names.tbl cat $p_headers_errno $gen/error_names.tbl | cpp -P >$gen/error_numbers.tbl awk -f arch/errtbl.awk <$gen/error_numbers.tbl >$gen/error_table.h # generate symbol tables for error handling functions mkdir -p "$out/k" awk -f global/gen-conds.awk <global/modules >$out/k/internal.egroup.h awk -f global/gen-ident.awk <global/modules >$gen/internal.ident.c comp_co $gen/internal.ident.c $out/internal.ident.o # first pass: copy all headers into place, # including ones we need to generate ................................................................................ comp_mac "$h" "$out/k/$dest" done done # second pass: generate manpage, html, and pdf # versions of the documentation from the md src if test "$doc" = "yes"; then global/build-manpage.sh libk.md for mod in ${modules[@]}; do for doc in $(scan $mod '*.md'); do base="$(basename $doc)" stem="${base%%.md}" global/build-manpage.sh "$doc" done done fi # third pass: compile sources and save the # resulting object files to $out, tracking # which is a runtime or function unit. exe's # will not be compiled until a later pass fn_objects=() |
Deleted global/build-html.sh version [045bd446a1].
1 2 3 4 5 6 7 8 9 10 11 |
#!/usr/bin/env bash (test -d global && test -f build.sh) || { echo >&2 "($0) run [1m$me[21m from root of [1mlibk[21m source directory" exit 1 } . global/common.sh reqpack cmark "generate documentation" say "facility under construction" exit 0 |
< < < < < < < < < < < |
Modified global/build-manpage.sh from [b795aa8a1c] to [790e90297c].
2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 |
(test -d global && test -f build.sh) || { echo >&2 "($0) run [1m$me[21m from root of [1mlibk[21m source directory" exit 1 } . global/common.sh reqpack cmark "generate documentation" file="$1" filename="$(basename "$file")" stem=${filename%%.md} mandest="$out/doc/man" section=${section:-4} out="$mandest/$stem.$section" desc_rec="$(grep -m1 "^$stem:" global/modules)" if test "$desc_rec" != ""; then desc=" - $(echo $desc_rec | cut -d: -f4)" else desc="" fi date="$(date -r"$file" "+%d %A %Y")" mkdir -p "$mandest" echo >"$out" ".TH ${stem^^} \"$section\" \"$date\" \"hale.su libk [r$build]\" \"modules\"" echo >>"$out" ".SH NAME" echo >>"$out" "$stem$desc" echo >>"$out" ".SH SYNOPSIS" dhead=$(grep -m1 -n "^# description$" "$file") if test $? -eq 0; then descline=$(echo "$dhead" | cut -d: -f1) offset=1 else descline=$(grep -m2 -n "^#" "$file" | tail -n1 | cut -d: -f1) offset=0 fi tail -n+2 $file | head -n$(expr $descline - 2) | cmark -t man >>"$out" echo >>"$out" ".SH DESCRIPTION" tail -n+$(expr $descline + $offset) "$file" | cmark -t man >> "$out" if has gzip; then gzip -f "$out" out="$out.gz" fi say "wrote manpage for $stem to $out" |
| > | > > > | | > > > > < | | | | | | | > > > > > > > > > > > > > > > | < < > > > > | | > |
2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 |
(test -d global && test -f build.sh) || { echo >&2 "($0) run [1m$me[21m from root of [1mlibk[21m source directory" exit 1 } . global/common.sh reqpack cmark "generate documentation" check gen "a directory for generated build dependencies" file="$1" filename="$(basename "$file")" stem=${filename%%.md} mandest=${out_manpage:-"$out/doc/man"} htmldest=${out_html:-"$out/doc/html"} pdfdest=${out_pdf:-"$out/doc/pdf"} section=${manual_section:-4} man="$mandest/$stem.$section" html="$htmldest/$stem.html" pdf="$pdfdest/$stem.pdf" fmt="$gen/$stem.tmp" desc_rec="$(grep -m1 "^$stem:" global/modules)" if test "$desc_rec" != ""; then desc=" - $(echo $desc_rec | cut -d: -f4)" else desc="" fi date="$(date -r"$file" "+%d %A %Y")" echo >"$fmt" ".TH ${stem^^} \"$section\" \"$date\" \"hale.su libk [r$build]\" \"modules\"" echo >>"$fmt" ".SH NAME" echo >>"$fmt" "$stem$desc" echo >>"$fmt" ".SH SYNOPSIS" dhead=$(grep -m1 -n "^# description$" "$file") if test $? -eq 0; then descline=$(echo "$dhead" | cut -d: -f1) offset=1 else descline=$(grep -m2 -n "^#" "$file" | tail -n1 | cut -d: -f1) offset=0 fi tail -n+2 $file | head -n$(expr $descline - 2) | cmark -t man >>"$fmt" echo >>"$fmt" ".SH DESCRIPTION" tail -n+$(expr $descline + $offset) "$file" | cmark -t man >> "$fmt" test "$doc_html" = "yes" && { mkdir -p "$htmldest" groff -Thtml -Kutf8 -m man "$fmt" > "$html" say "wrote html page for $stem to $html" } test "$doc_pdf" = "yes" && { mkdir -p "$pdfdest" groff -Tpdf -Kutf8 -m man "$fmt" > "$pdf" say "wrote pdf for $stem to $pdf" } test "$doc_man" = "yes" && { mkdir -p "$mandest" if has gzip; then man="$man.gz" gzip -c -f "$fmt" > "$man" else mv "$fmt" "$man" fi say "wrote manpage for $stem to $man" } |
Deleted global/build-pdf.sh version [045bd446a1].
1 2 3 4 5 6 7 8 9 10 11 |
#!/usr/bin/env bash (test -d global && test -f build.sh) || { echo >&2 "($0) run [1m$me[21m from root of [1mlibk[21m source directory" exit 1 } . global/common.sh reqpack cmark "generate documentation" say "facility under construction" exit 0 |
< < < < < < < < < < < |
Modified libk.md from [debe7693f6] to [a982938576].
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 .. 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 ... 142 143 144 145 146 147 148 149 150 151 152 153 154 155 156 157 158 159 160 161 162 163 164 165 166 ... 182 183 184 185 186 187 188 189 190 191 192 193 194 195 196 197 198 199 200 201 202 203 204 205 206 207 208 209 210 211 212 213 214 215 216 217 218 219 220 221 222 223 224 225 226 227 228 229 230 231 232 |
# libk libk is intended as a modernized replacement *(not* reimplementation) for libc. ## manifesto normally, all C binaries (and binaries from other languages, depending on the platform) use a combination of libraries to get things done: POSIX libraries (interfaces common to UNIX-like operating systems) and libc, the C standard library. unlike POSIX, libc is part of the C language -- it's a standardized interface to various critical parts of the operating system, things like IO, system clock access, random number generation, and more. it's also a piece of shit. libc is ancient, and it shows. it contains decades worth of cruft, masses of different interfaces with completely different design, horrible hacks to get around the fundamental shifts in basic computer architecture that have occurred over the past half-century, and vendor-specific extensions that make porting code a nightmare. using it is painful, tedious, error-prone, and unsafe. for various reasons, there are many different implementations of libc, but all of them have that same broken, bloated interface in common. as far as i can tell, there are been no serious attempts to create an actual *alternative* to libc - a new system interface that takes into account the decades of painful lessons we programmers have learned since the heydays of UNIX. hence, libk. libk aims to offer a better, safer, and most importantly, less unpleasant foundation for modern code in C or any other language. it also aims to be much smaller, simpler, and faster than glibc to build so that there's no arduous bootstrapping process for new architectures. currently, the only dependency on libc in any form is `arch/typesize.c`, a small binary tool which uses libc IO routines to print type information it calculates; however, this could also be augmented to use POISX IO routines, or even potentially libk IO routines to remove any external dependency at all -- the work would be nontrivial, but fully feasible. further, the file it creates can also _in extremis_ be created by hand. the final compiled libc binaries and headers do not depend on or reference libc in any way; typesize is only a makedepend. # goals libk's goals are far-reaching, and suggestions are welcome. note however that libk is *not* intended to be a kitchen-sink library like libiberty. it's meant to do one thing, and to it well: to provide an easy- and pleasant-to-use foundation for modern open source projects. below is a list of some of the project's major goals. 1. **IO.** libc's basic input/output mechanisms are dreadful, built at entirely the wrong level of abstraction. libk is intended to make many more primitives available to the user, and offer a sliding scale of abstraction so libk is suitable for a wide range of needs. 2. **file manipulation.** libc's file manipulation primitives are a relic of a bygone age and in dire need of upgrading. 3. **terminal manipulation.** libc has no provision for simple output formatting, a task that requires a combination of ANSI codes and in some cases pty manipulation with POSIX APIs, both of which are somewhat dark wizardry. this situation forces many innocent coders to drag in the entire unholy bulk of the aptly named library `ncurses`, much of whose code has been utterly obsolete for the last twenty years and whose API is one of the most singularly hateful ones in existence. libk therefore should offer a simple, straightforward way to do gracefully-degrading terminal sorcery. 4. **memory management.** the single memory management function `malloc()` provided by libc is absolutely pitiful. this is 2019. modern applications have much more exotic allocation needs, and a standard library should offer a range of allocators and management techniques, as well as abstract pointer objects so that pointers to objects of different allocation types (including static or stack allocation!) can be mixed freely and safely. 5. **intrinsic reentrancy.** because *jesus christ,* libc. 6. **interprocess communication.** libc offers no useful IPC abstractions over the paltry array of tools POSIX &co. give us to work with. we can do better. 7. **tooling.** libk is intended as more than just a library. it's also intended to work with some basic tooling to automate tasks that current binary tooling is inadequate for -- for instance, embedding binary data into a program binary. (see module [kgraft](kgraft)) 8. **modularity.** libk is not part of the C specification and it isn't always going to be practical for developers to expect the entire library to be present on the end-user's computer. so libk is designed to be usable in many different ways -- as a traditional library, as a static library, in full form or with only components needed by the developer, to be distributed either on its own or as part of a binary. 9. **compatibility.** code that links against libk should be able to compile and run on any operating system. in the ideal case (Linux or FreeBSD) it will be able to do so without touching any other system libraries; for less ideal environments like Windows, libk will when necessary abstract over system libraries or libc itself. 10. **sane error-handling.** every time you type `errno` god murders a puppy. # dependencies libk is designed to be as portable and depedency-free as possible. ideally, it will be possible to compile code against libk using nothing but libk itself. compiling libk is also designed to be as easy as possible. it has only two external dependencies, the macro processor [m4], needed for compile-time header generation, and the [GNU make] utility, whose advanced features are needed to perform the relatively complex task of building all of libk from the ground up. [m4]: http://www.gnu.org/software/m4 [GNU make]: http://www.gnu.org/software/make a different macro processor, gpp, was used in early versions of libk, however, it was so obscure and took so much overly fragile infrastructure to make it work that the cleaner syntax just wasn't worth it; i've since deleted the gpp infra and ported the macro files to m4. # naming conventions one of the most frustrating things about libc is its complete and total *lack* of a naming convention. in C, every function and global is injected into a single global namespace, including macros. this means that every libc header you include scatters words all over that namespace, potentially clobbering your function with a macro! ................................................................................ 3. capital letters are only used in macro prefixes. 4. low-level function names are prefixed with the API they call into. for example, the function that performs the POSIX syscall `write` is named `kio_posix_fd_write`. a wrapper around the Windows function `CreateProcess()` might be called `kproc_win_createprocess`. ## atoms libk uses the concept of "atoms" (small, regular strings of text) to standardize common references, such as operating systems or processor architectures. ### operating systems these atoms will be used to reference operating systems. * Linux: `lin` 1. aaaa * Haiku: `hai` * Android: `and` * FreeBSD: `fbsd` * NetBSD: `nbsd` * OpenBSD: `obsd` * Darwin/Mac OS X/iOS: `dar` * MS-DOS: `dos` * FreeDOS: `fdos` * Windows: `win` * Windows MinGW: `mgw` ### file extensions * C function implementations: `*.c` * C module headers: `*.h` * ancillary C headers: `*.inc.h` * assembly code: `*.s` ### arches these atoms will be used to reference particular system architectures. these will mostly be used in the filenames of assembly code. * Intel/AMD x86: `x86` * ARM: `arm` (aarch64 is specified by `os=arm bits=64`) * MIPS: `mips` * Itanium: `ia64` (no bits) * PowerPC: `ppc` ................................................................................ each module may contain any number of files of the name `*.exe.c`. this files will be treated as *tools* by the build system and compiled as executables, rather than libraries. they should be compiled to `out/$module.$tool` the repository root and each module may also contain the directory `misc`. this directory may be used to store miscellaneous data such as ABI references, developer discussions, and roadmaps. if the `misc` directory is deleted, this must not affect the library or build system's function in any way - that is, nothing outside a `misc` folder may reference a `misc` folder or anything inside it, including documentation. the `misc` directory should be removed when its contents are no longer needed. in most cases, the repository wiki and forum should be used instead of the `misc` folder. the folder `arch` in the root of the repository contains syscall tables and ABI implementations for various architectures. ## build sysem libk uses a very simple build system. the entire project is built with the `build.sh` script in the project root. `build.sh` need not be modified so long as all you're adding is functions, runtime stubs, headers, documentation files, or macro files in line with the standard naming convention. it will need to be modified to add or remove modules. it does not track dependencies, recompiling the entire project in one fell swoop. while in theory this could be slow and inefficient, in practice, it's not meaningfully slower on the average run than the old make-based system was. the shell-based build system was chosen for a number of reason. firstly, libk originally used a GNU make-based build system, but this was unwieldy and unreliably - make simply does not have the necessary capabilities to build a project of this nature. the original build system relied heavily on recursion, which is impossible to use while preserving idempotency. multithreaded building was likewise impossible. relying on make also had a number of subtler downsides. GNU make's capacities vary significantly across versions, and not all users will necessarily have access to the correct version in their repos. even a version as recent as make 4.1 couldn't build a project that build without issue in 4.2. given that libk is explicitly intended to be widely portable, this variance was alarming. make is also very indirect - it's a pain in the ass to tell it what to do sometimes, and it's well nigh on impossible to decipher the intricate way in which a sufficiently complex set of makefiles interlocks. this is both user- and developer-hostile. by contrast, a simple, linear build script with clearly delineated functions makes the build process much easier to understand, and therefore to tweak should a user need to modify some aspect of the process to compile `libk` on her system. finally, make simply has a smaller install base than `bash`. with the `build.sh` build system, more people can successfully build libk with less effort. any time you make a change to the build script, always specify precisely what changed in your commit log and what affects this might have on the surrounding code. be sure to comment any new code as thoroughly as you can. the goal is that anyone who is familiar with bash should be able to learn the build process simply by reading `build.sh`. # design principles there are four overriding principles that guide the design of libk. 1. it should be easy to write code that uses it. 2. it should be easy to read code that uses it. ................................................................................ rule_kind_undialectical, rule_action_expropriate, rule_target_bourgeoisie }; this makes code much more legible and has the added benefit of making the definitions easier to expand at a later date if new functionality is needed without breaking the API or ABI. ## build process to build libk, you must invoke `build.sh` with the proper parameters. at minimum you must set the following environment variables: * `os={atom}` - an atom representing the operating system you are building libk for * `arch={atom}` - an atom representing the processor architecture you are building libk for * `bits={atom}` - if your processor has multiple variants with different word lengths (such as x86-32 vs. x86-64), specify the word length in this variable; otherwise, leave it unset. further optional variables may be set to control the build process and determine what targets it produces. * `library=static {static|shared|both}` - this variable controls whether the build process will produce `libk.a`, `libk.so`, or both. * `out=out {path}` - an alternate path to store build artifacts in * `gen=gen {path}` - an alternate path to store generated source in * `cc=<autodetect> {executable}` - the compiler to compile C sources with * `m4=<autodetect> {executable}` - the m4 binary to compile the macro sources with * `asm=<autodetect> {executable}` - the assembler to assemble the assembly listings with. it must take Intel-syntax input and handle nasm-style macros. only `yasm` and `nasm` are likely to be viable. two other shell scripts complete the build system: * `install.sh` - installs compiled libraries, objects, documentation, and headers into the appropriate directories. ## authors so far, this is a one-woman show. contributions are welcome however. * lexi hale <lexi@hale.su> ## caveats the main coder, lexi hale, is first and foremost a writer, not a coder. this is a side-project of hers and will remain so unless it picks up a significant amount of attention. while MRs adding support for Windows, OS X, and other operating systems will be gratefully accepted, the maintainer is a Linux and FreeBSD developer, will not be writing such support infrastructure herself, and has limited ability even to vet code for those platforms. ## license libk is released under the terms of the [GNU AGPLv3](LICENSE). contributors do not relinquish ownership of the code they contribute, but agree to release it under the same terms as the overall project license. the AGPL may seem like an inappropriately restrictive license for a project with such grandiose ambitions. it is an ideological choice. i selected it because libk is intended very specifically as a contribution to the *free software* community, a community that i hope will continue to grow at the expense of closed-source ecosystems. i have no interest in enabling people or corporations to profit from keeping secrets, especially not with my own free labor (or anyone else's, for that matter). if you disagree with this philosophy, you are welcome to continue using libc. ## what does the k stand for? nothing. it was chosen in reference to libc - the letter C was part of the original roman alphabet, while K was added later by analogy to the Greek kappa ‹κ›. in my native language, the older letter ‹c› can make a number of different sounds based on context, including [k] and [s], while ‹k› is fairly consistently used for the sound [k]. and for orthographical reasons, [k] is often represented by the digraph ‹ck› - that is, a C followed by a K. hopefully the analogies are obvious. this project has nothing to do with KDE. |
| | > | | | < | | | | > > > > > > > > > > > > > > > > > > > > > > > < < < < < < < < < < < < < < < < < < < | | | | |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 .. 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 ... 142 143 144 145 146 147 148 149 150 151 152 153 154 155 156 157 158 159 160 161 162 163 164 165 166 167 168 169 170 171 172 173 174 175 176 177 178 179 180 181 182 183 184 185 186 187 188 189 ... 205 206 207 208 209 210 211 212 213 214 215 216 217 218 219 220 221 222 223 224 225 226 227 228 229 230 231 232 233 234 235 236 |
# libk libk is intended as a modernized replacement *(not* reimplementation) for libc. # manifesto normally, all C binaries (and binaries from other languages, depending on the platform) use a combination of libraries to get things done: POSIX libraries (interfaces common to UNIX-like operating systems) and libc, the C standard library. unlike POSIX, libc is part of the C language -- it's a standardized interface to various critical parts of the operating system, things like IO, system clock access, random number generation, and more. it's also a piece of shit. libc is ancient, and it shows. it contains decades worth of cruft, masses of different interfaces with completely different design, horrible hacks to get around the fundamental shifts in basic computer architecture that have occurred over the past half-century, and vendor-specific extensions that make porting code a nightmare. using it is painful, tedious, error-prone, and unsafe. for various reasons, there are many different implementations of libc, but all of them have that same broken, bloated interface in common. as far as i can tell, there are been no serious attempts to create an actual *alternative* to libc - a new system interface that takes into account the decades of painful lessons we programmers have learned since the heydays of UNIX. hence, libk. libk aims to offer a better, safer, and most importantly, less unpleasant foundation for modern code in C or any other language. it also aims to be much smaller, simpler, and faster than glibc to build so that there's no arduous bootstrapping process for new architectures. currently, the only dependency on libc in any form is `arch/typesize.c`, a small binary tool which uses libc IO routines to print type information it calculates; however, this could also be augmented to use POISX IO routines, or even potentially libk IO routines to remove any external dependency at all -- the work would be nontrivial, but fully feasible. further, the file it creates can also _in extremis_ be created by hand. the final compiled libc binaries and headers do not depend on or reference libc in any way; typesize is only a makedepend. ## goals libk's goals are far-reaching, and suggestions are welcome. note however that libk is *not* intended to be a kitchen-sink library like libiberty. it's meant to do one thing, and to it well: to provide an easy- and pleasant-to-use foundation for modern open source projects. below is a list of some of the project's major goals. 1. **IO.** libc's basic input/output mechanisms are dreadful, built at entirely the wrong level of abstraction. libk is intended to make many more primitives available to the user, and offer a sliding scale of abstraction so libk is suitable for a wide range of needs. 2. **file manipulation.** libc's file manipulation primitives are a relic of a bygone age and in dire need of upgrading. 3. **terminal manipulation.** libc has no provision for simple output formatting, a task that requires a combination of ANSI codes and in some cases pty manipulation with POSIX APIs, both of which are somewhat dark wizardry. this situation forces many innocent coders to drag in the entire unholy bulk of the aptly named library `ncurses`, much of whose code has been utterly obsolete for the last twenty years and whose API is one of the most singularly hateful ones in existence. libk therefore should offer a simple, straightforward way to do gracefully-degrading terminal sorcery. 4. **memory management.** the single memory management function `malloc()` provided by libc is absolutely pitiful. this is 2019. modern applications have much more exotic allocation needs, and a standard library should offer a range of allocators and management techniques, as well as abstract pointer objects so that pointers to objects of different allocation types (including static or stack allocation!) can be mixed freely and safely. 5. **intrinsic reentrancy.** because *jesus christ,* libc. 6. **interprocess communication.** libc offers no useful IPC abstractions over the paltry array of tools POSIX &co. give us to work with. we can do better. 7. **tooling.** libk is intended as more than just a library. it's also intended to work with some basic tooling to automate tasks that current binary tooling is inadequate for -- for instance, embedding binary data into a program binary. (see module [kgraft](kgraft)) 8. **modularity.** libk is not part of the C specification and it isn't always going to be practical for developers to expect the entire library to be present on the end-user's computer. so libk is designed to be usable in many different ways -- as a traditional library, as a static library, in full form or with only components needed by the developer, to be distributed either on its own or as part of a binary. 9. **compatibility.** code that links against libk should be able to compile and run on any operating system. in the ideal case (Linux or FreeBSD) it will be able to do so without touching any other system libraries; for less ideal environments like Windows, libk will when necessary abstract over system libraries or libc itself. 10. **sane error-handling.** every time you type `errno` god murders a puppy. 11. **ease of distribution.** libk should enable to user to create completely static binaries, free of any local dependency and trivial to distribute. # dependencies libk is designed to be as portable and depedency-free as possible. ideally, it will be possible to compile code against libk using nothing but libk itself. compiling libk is also designed to be as easy as possible. it has only two external dependencies, the macro processor [m4], needed for compile-time header generation, and optionally the [commonmark] compiler `cmark`, which is only needed if you wish to typeset the documentation in manpage, html, or pdf format. this process also requires `groff`, but `groff` is a standard part of most UNIX systems. [m4]: http://www.gnu.org/software/m4 [commonmark]: http://www.commonmark.org/ a different macro processor, gpp, was used in early versions of libk, however, it was so obscure and took so much overly fragile infrastructure to make it work that the cleaner syntax just wasn't worth it; i've since deleted the gpp infra and ported the macro files to m4. # naming conventions one of the most frustrating things about libc is its complete and total *lack* of a naming convention. in C, every function and global is injected into a single global namespace, including macros. this means that every libc header you include scatters words all over that namespace, potentially clobbering your function with a macro! ................................................................................ 3. capital letters are only used in macro prefixes. 4. low-level function names are prefixed with the API they call into. for example, the function that performs the POSIX syscall `write` is named `kio_posix_fd_write`. a wrapper around the Windows function `CreateProcess()` might be called `kproc_win_createprocess`. ## atoms libk uses the concept of "atoms" (small, regular strings of text) to standardize common references, such as operating systems or processor architectures. **operating systems** these atoms will be used to reference operating systems. * Linux: `lin` * Haiku: `hai` * Android: `and` * FreeBSD: `fbsd` * NetBSD: `nbsd` * OpenBSD: `obsd` * Darwin/Mac OS X/iOS: `dar` * MS-DOS: `dos` * FreeDOS: `fdos` * Windows: `win` * Windows MinGW: `mgw` **file extensions** * C function implementations: `*.c` * C module headers: `*.h` * ancillary C headers: `*.inc.h` * assembly code: `*.s` **arches** these atoms will be used to reference particular system architectures. these will mostly be used in the filenames of assembly code. * Intel/AMD x86: `x86` * ARM: `arm` (aarch64 is specified by `os=arm bits=64`) * MIPS: `mips` * Itanium: `ia64` (no bits) * PowerPC: `ppc` ................................................................................ each module may contain any number of files of the name `*.exe.c`. this files will be treated as *tools* by the build system and compiled as executables, rather than libraries. they should be compiled to `out/$module.$tool` the repository root and each module may also contain the directory `misc`. this directory may be used to store miscellaneous data such as ABI references, developer discussions, and roadmaps. if the `misc` directory is deleted, this must not affect the library or build system's function in any way - that is, nothing outside a `misc` folder may reference a `misc` folder or anything inside it, including documentation. the `misc` directory should be removed when its contents are no longer needed. in most cases, the repository wiki and forum should be used instead of the `misc` folder. the folder `arch` in the root of the repository contains syscall tables and ABI implementations for various architectures. # build system libk uses a very simple build system. the entire project is built with the `build.sh` script in the project root. `build.sh` need not be modified so long as all you're adding is functions, runtime stubs, headers, documentation files, or macro files in line with the standard naming convention. it will need to be modified to add or remove modules. it does not track dependencies, recompiling the entire project in one fell swoop. while in theory this could be slow and inefficient, in practice, it's not meaningfully slower on the average run than the old make-based system was. the shell-based build system was chosen for a number of reason. firstly, libk originally used a GNU make-based build system, but this was unwieldy and unreliably - make simply does not have the necessary capabilities to build a project of this nature. the original build system relied heavily on recursion, which is impossible to use while preserving idempotency. multithreaded building was likewise impossible. relying on make also had a number of subtler downsides. GNU make's capacities vary significantly across versions, and not all users will necessarily have access to the correct version in their repos. even a version as recent as make 4.1 couldn't build a project that build without issue in 4.2. given that libk is explicitly intended to be widely portable, this variance was alarming. make is also very indirect - it's a pain in the ass to tell it what to do sometimes, and it's well nigh on impossible to decipher the intricate way in which a sufficiently complex set of makefiles interlocks. this is both user- and developer-hostile. by contrast, a simple, linear build script with clearly delineated functions makes the build process much easier to understand, and therefore to tweak should a user need to modify some aspect of the process to compile `libk` on her system. finally, make simply has a smaller install base than `bash`. with the `build.sh` build system, more people can successfully build libk with less effort. any time you change build script, always specify precisely what changed in your commit log and what affects this might have on the surrounding code. be sure to comment any new code as thoroughly as you can. the goal is that anyone who is familiar with bash should be able to learn the build process simply by reading `build.sh`. ## build process to build libk, you must invoke `build.sh` with the proper parameters. at minimum you must set the following environment variables: * `os={atom}` - an atom representing the operating system you are building libk for * `arch={atom}` - an atom representing the processor architecture you are building libk for * `bits={atom}` - if your processor has multiple variants with different word lengths (such as x86-32 vs. x86-64), specify the word length in this variable; otherwise, leave it unset. further optional variables may be set to control the build process and determine what targets it produces. * `library=static {static|shared|both}` - this variable controls whether the build process will produce `libk.a`, `libk.so`, or both. * `out=out {path}` - an alternate path to store build artifacts in * `gen=gen {path}` - an alternate path to store generated source in * `cc=<autodetect> {executable}` - the compiler to compile C sources with * `m4=<autodetect> {executable}` - the m4 binary to compile the macro sources with * `asm=<autodetect> {executable}` - the assembler to assemble the assembly listings with. it must take Intel-syntax input and handle nasm-style macros. only `yasm` and `nasm` are likely to be viable. * `doc=yes {yes|no}` - whether to typeset the documentation (very slow with all three formats set to "yes") * `doc_html=yes {yes|no}` - enable or disable html output of the documentation * `doc_pdf=yes {yes|no}` - enable or disable pdf output of the documentation * `doc_man=yes {yes|no}` - enable or disable manpage output of the documentation two other shell scripts complete the build system: * `install.sh` - installs compiled libraries, objects, documentation, and headers into the appropriate directories. # design principles there are four overriding principles that guide the design of libk. 1. it should be easy to write code that uses it. 2. it should be easy to read code that uses it. ................................................................................ rule_kind_undialectical, rule_action_expropriate, rule_target_bourgeoisie }; this makes code much more legible and has the added benefit of making the definitions easier to expand at a later date if new functionality is needed without breaking the API or ABI. # authors so far, this is a one-woman show. contributions are welcome however. * lexi hale <lexi@hale.su> # caveats the main coder, lexi hale, is first and foremost a writer, not a coder. this is a side-project of hers and will remain so unless it picks up a significant amount of attention. while MRs adding support for Windows, OS X, and other operating systems will be gratefully accepted, the maintainer is a Linux and FreeBSD developer, will not be writing such support infrastructure herself, and has limited ability even to vet code for those platforms. # license libk is released under the terms of the [GNU AGPLv3](LICENSE). contributors do not relinquish ownership of the code they contribute, but agree to release it under the same terms as the overall project license. the AGPL may seem like an inappropriately restrictive license for a project with such grandiose ambitions. it is an ideological choice. i selected it because libk is intended very specifically as a contribution to the *free software* community, a community that i hope will continue to grow at the expense of closed-source ecosystems. i have no interest in enabling people or corporations to profit from keeping secrets, especially not with my own free labor (or anyone else's, for that matter). if you disagree with this philosophy, you are welcome to continue using libc. # what does the k stand for? nothing. it was chosen in reference to libc - the letter C was part of the original roman alphabet, while K was added later by analogy to the Greek kappa ‹κ›. in my native language, the older letter ‹c› can make a number of different sounds based on context, including [k] and [s], while ‹k› is fairly consistently used for the sound [k]. and for orthographical reasons, [k] is often represented by the digraph ‹ck› - that is, a C followed by a K. hopefully the analogies are obvious. this project has nothing to do with KDE. |