rubyx/lib/typed/compiler
2016-12-14 13:22:47 +02:00
..
assignment.rb finish assignment tests 2016-12-10 18:48:18 +02:00
basic_values.rb modulize basic_value 2016-12-09 13:40:10 +02:00
call_site.rb refoactor call site some 2016-12-09 12:22:37 +02:00
class_field.rb fix class def tests and code 2016-12-10 22:41:49 +02:00
class_statement.rb small fix 2016-12-10 18:07:38 +02:00
collections.rb modulize field_access and field_def 2016-12-09 14:04:06 +02:00
field_access.rb fix more expression tests 2016-12-10 15:18:37 +02:00
field_def.rb change method locals to type object too (same as args) 2016-12-13 19:18:17 +02:00
function_statement.rb fixes from the method argument change 2016-12-13 18:49:45 +02:00
if_statement.rb refactor while statement 2016-12-09 14:29:06 +02:00
name_expression.rb change method locals to type object too (same as args) 2016-12-13 19:18:17 +02:00
operator_expression.rb modulize operator_expression 2016-12-09 14:17:01 +02:00
README.md small things 2016-12-14 13:22:47 +02:00
return_statement.rb modulize return and list 2016-12-09 14:19:22 +02:00
statement_list.rb fix statement list compilation 2016-12-11 12:13:42 +02:00
while_statement.rb refactor while statement 2016-12-09 14:29:06 +02:00

Compiling

The typed syntax tree is created by the ruby compiler.

The code in this directory compiles the typed tree to the register machine code, and Parfait object structure.

If this were an interpreter, we would just walk the tree and do what it says. Since it's not things are a little more difficult, especially in time.

When compiling we deal with two times, compile-time and run-time. All the headache comes from mixing those two up.*

Similarly, the result of compiling is two-fold: a static and a dynamic part.

  • the static part are objects like the constants, but also defined classes and their methods
  • the dynamic part is the code, which is stored as streams of instructions in the MethodSource

Too make things a little simpler, we create a very high level instruction stream at first and then run transformation and optimization passes on the stream to improve it.

The compiler has a method for each class of typed tree, named along on_xxx with xxx as the type

Compiler holds scope

The Compiler instance can hold arbitrary scope needed during the compilation. A class statement sets the current @type scope , a method definition the @method. If either are not set when needed compile errors will follow. So easy, so nice.

All code is encoded as a stream of Instructions in the MethodSource. Instructions are stored as a list of Blocks, and Blocks are the smallest unit of code, which is always linear.

Code is added to the method (using add_code), rather than working with the actual instructions. This is so each compiling method can just do it's bit and be unaware of the larger structure that is being created. The general structure of the instructions is a graph (with if's and whiles and breaks and what), but we build it to have one start and one end (return).

Messages and frames

Since the machine is oo we define it in objects.

Also it is important to define how instructions operate, which is is in a physical machine would be by changing the contents of registers or some stack.

Our machine is not a register machine, but an object machine: it operates directly on objects and also has no separate stack, only objects. There is only one object which is accessible, basically meaning pinned to a register, the Message.

One can think of the Message as an oo replacement of the stack.

When a TypedMethod needs to make a call, it creates a NewMessage object. Messages contain return addresses (yes, plural) and arguments.

The important thing here is that Messages and Frames are normal objects.