omicron/oas - oas - Gitea: Git with a cup of tea

Author	SHA1	Message	Date
omicron	7cefc3564d	Implement one immediate label reference operand All checks were successful Validate the build / validate-build (push) Successful in 43s Details Also adds opcode data for jmp and call	2025-04-24 14:45:57 +02:00
omicron	c848995ad6	Implement two pass encoding First pass: - collect information for numbers, registers and which instructions contain label references - encode all instructions that don't contain label references - Set (temporary) addresses for each instruction Second pass: - Collect information about label references (address, offset, size) - encode all instructions that contain label references - Update (if necessary) addresses for each instruction The second pass is iterated 10 times or until no instructions change size, whichever comes first.	2025-04-24 14:45:46 +02:00
omicron	5272fdb227	Add more values to the ast to facilitate encoding - Add a instruction value that contains the encoding, the address and a flag to indicate if this instruction contains label references - Add label value that contains an address - Add reference value that contains offset, an absolute address and an operand size - define types for all value options in the union - define accessor functions for all the values in the union	2025-04-23 15:57:04 +02:00
omicron	9c6b69e187	Symbol table now keeps track of label statements Before it kept track of a more specific node that referenced the symbol in some way. Now it will only keep track of the actual label defining statements. This is done to facilitate encoding. The encoder can now go from a symbol name to the statement that defines the symbol. Restructure the encoder to deal with this and pass the correct statement to the symbol update function.	2025-04-18 14:00:08 +02:00
omicron	530e3fb423	Fix parse_memory_expression to use parse_label_reference All checks were successful Validate the build / validate-build (push) Successful in 37s Details	2025-04-17 23:28:44 +02:00
omicron	6f78d26ea1	Change the n argument of lexer_shift_buffer to size_t from int Some checks failed Validate the build / validate-build (push) Failing after 35s Details	2025-04-17 15:12:56 +02:00
omicron	1a79bf050e	Remove unused ast_node_free_value Values are all inside the ast struct and require no cleanup other than freeing the ast struct.	2025-04-17 15:10:36 +02:00
omicron	d97cfb97be	Implement printing the encoding in main All checks were successful Validate the build / validate-build (push) Successful in 33s Details	2025-04-16 23:10:17 +02:00
omicron	99c9dcd985	Incomplete second pass encoding	2025-04-16 23:10:09 +02:00
omicron	7e9c1bfda2	Add bytes type and tests bytes_t is a local (automatic) allocation array that carries the length and capacity with it.	2025-04-16 23:10:09 +02:00
omicron	d8ae126e9a	Add opcode encoding value for NODE_INSTRUCTION entries in the AST	2025-04-16 23:10:09 +02:00
omicron	68dcd9dcce	Add first encoding pass First pass collects all the symbols and interprets number and register tokens into usable data for the later passes.	2025-04-16 23:10:00 +02:00
omicron	dcf90b72e0	Add register and number values to AST nodes	2025-04-16 23:10:00 +02:00
omicron	2cf69f5e18	Add initial limited opcode data	2025-04-16 23:09:47 +02:00
omicron	d59559d327	Add registers data table Change the validated primitive parse_register so that it uses the data table instead	2025-04-16 13:46:19 +02:00
omicron	2a7bb479ac	initial symbol table implementation	2025-04-16 13:46:19 +02:00
omicron	8c0e9926c5	Make main properly return with failure on parsing errors	2025-04-16 13:46:19 +02:00
omicron	d3d69b82d5	Add .import and .export directive to the grammar and parser	2025-04-16 13:46:10 +02:00
omicron	dc210e409c	fix parse_immediate to accept label_reference instead of identifier	2025-04-16 13:41:28 +02:00
omicron	2385d38608	Prune the parse tree of NODE_NEWLINE after parsing succeeds	2025-04-16 13:01:02 +02:00
omicron	242fd9baa5	Fix grammar not being able to disambiguate some instructions When two identifiers follow eachother it could be two instruction mnemonics or one instruction mnemonic and one operand. To fix this TOKEN_NEWLINE has been reintroduced as a semantic token. The grammar has been changed to allow empty statements and every instruction and directive has to end in a newline. Labels do not have to end in a newline. In addition to updating the grammar, the implementation of tokenlist, ast and parser has been updated to reflect these changes.	2025-04-16 12:34:44 +02:00
omicron	1574ec6249	Fix parse_consecutive behavior when the token stream runs out	2025-04-16 12:13:02 +02:00
omicron	5560de2904	Make sure parse skips past initial trivia in the tokenlist	2025-04-09 01:15:51 +02:00
omicron	f1f4c93a8e	Fix bug in lexer_next_number not correctly tracking character number All checks were successful Validate the build / validate-build (push) Successful in 28s Details When a number has a suffix the lexer state didn't record the number of characters consumed for this suffix. This made the lexer state be 2-3 characters short in its line location reporting until it encountered a newline character. It did not otherwise corrupt the state of the lexer.	2025-04-05 01:41:40 +02:00
omicron	3fead8017b	Rename lexer errors	2025-04-05 01:37:04 +02:00
omicron	af66790cff	Clean up error definitions, location and expose them in the headers - Exposes all errors in the header file so any user of the api can test for the specific error conditions - Mark all static error pointers as const - Move generic errors into error.h - Name all errors err_modulename_* for errors that belong to a specific module and err_* for generic errors.	2025-04-05 01:37:04 +02:00
omicron	5ea942024f	add functionality to main to parse and print the ast	2025-04-02 20:57:02 +02:00
omicron	b4757e008c	Add parse_result_wrap to wrap a result with another parent node Use the new wrap function to wrap numbers and immediate nodes	2025-04-02 20:57:02 +02:00
omicron	b70b6896bf	Partial parser implementation	2025-04-02 20:56:59 +02:00
omicron	6ca7bb3661	Fix incorrect size comparison in lexer_consume_n The buffer length len and the requested number of tokens n are mixed up in an invalid comparison. This causes all valid requests for n < len tokens to be denied and all invalid requests for n > len tokens to be accepted. This may cause a buffer overflow if the caller requests more characters than they provide space for.	2025-04-02 20:41:49 +02:00
omicron	d424c0f886	Add a parser combinator to parse a delimited list	2025-04-02 20:41:49 +02:00
omicron	c66489dd90	Add basic parser combinators	2025-04-02 20:41:49 +02:00
omicron	44fa66c2b7	Add "primitive" parsers for all the non-trivia tokens in the lexer grammar	2025-04-02 20:41:42 +02:00
omicron	c48adb1306	Add basic parser utilities	2025-04-02 20:38:35 +02:00
omicron	5fb6ebef28	Add functions to skip over trivia in a tokenlist	2025-04-02 11:59:24 +02:00
omicron	bbdcad024f	Add function to print the AST	2025-04-02 11:50:25 +02:00
omicron	935da30257	Add basic AST functionality	2025-04-02 11:35:53 +02:00
omicron	bd37ddaeea	Add tokenlist, a linked list of lexer tokens The linked list is doubly linked so the parser can look forward into it and error reporting can look backward. This commmit also reworks main to use the tokenlist instead of dealing with the lexer manually.	2025-03-31 18:43:34 +02:00
omicron	42da7b1d05	Move err_allocation_failed into error.c and make it available to everyone.	2025-03-31 18:43:34 +02:00
omicron	5cdb60d395	Remove peek function	2025-03-30 22:51:47 +02:00
omicron	e5830daac9	Add documentation comments to the lexer code	2025-03-30 22:51:15 +02:00
omicron	942dd444cc	Fix infinite loop when lexing an invalid newline sequence	2025-03-30 22:03:12 +02:00
omicron	df948b18c6	Initial commit, basic lexer structure	2025-03-30 17:45:51 +02:00

43 Commits