new post about future of types

2017-09-05 16:40:32 +03:00
parent 67b73a6707
commit b8d2d18f96
4 changed files with 165 additions and 1 deletions
--- a/_posts/2017-04-07-how-not-to-interpret.md
+++ b/_posts/2017-04-07-how-not-to-interpret.md
@@ -75,7 +75,7 @@ thus experience a cache miss and have their correct method found.

 ## Up next

-More grunt-work. Now that Vool replaces the ast the ode from rubyx/passes has to be "ported" to use it. That means:
+More grunt-work. Now that Vool replaces the ast the code from rubyx/passes has to be "ported" to use it. That means:
 - class extraction and class object creation
 - method extraction and creation
 - type creation by ivar analysis
--- a/_posts/2017-09-05-more-types-will-come-but-later.md
+++ b/_posts/2017-09-05-more-types-will-come-but-later.md
@@ -0,0 +1,77 @@
+---
+layout: news
+author: Torsten
+---
+
+While work on Mom (Minimal object machine) continues, i can see the futures a little clearer.
+Alas, for now the shortest route is best, so the future will have to wait. But here is what i'm
+thinking.
+
+## Types today
+
+The [architecture](/rubyx/layers.html) document outlines this in more detail, but in short:
+- types are immutable
+- every object has a type (which may change)
+- a type implements the interface of a class at a given time
+- a type is defined by a list of attribute names
+
+![Types diagram](/assets/types.jpg)
+
+### How classes work
+
+So the interesting thing here is how the classes work. Seeing as they are open, attributes can
+be added and removed, but the types are immutable.
+
+The solution is easy: when a new attribute is added to a class, a new type is created.
+
+The *instance type* is then updated to point to the current type. This means that new objects will
+be created with the new type, and old ones will keep their old type. Until the attribute is
+added to them too, in which case their *type* is updated too.
+
+**Methods** btw are stored at the Type, as they encode the knowledge of the memory layout
+that comes with the type, into the code of the method. Remember: full data hiding, only objects
+methods can access the variables, hence the type needs to be know only for *self*.
+
+## The future of types
+
+But what i wanted to talk about is how this picture is going to change in the future.
+To understand why we might want to, let's look at method dispatch on an instance variable.
+
+When you write something like @me.length , the compiler can check that @me is indeed an instance variable by checking the type of self. But since not information is stored about the type of
+*me* , a dynamic dispatch is needed to call *length*.
+
+The simple idea is to get rid of this dynamic dispatch by storing the type of instance variables
+too. This makes a lot calls faster, but it does come at significant cost:
+- every assignment to the variable has to be checked for type.
+- many more types must be created to differentiate the variables by name **and** type.
+
+Both of those don't maybe sound soo bad at first, but it's the cumulative effects that make a
+difference. Instance assignment is one of the only two ways to move data around in a oo machine.
+That's a lot of checking. And Types hold the methods, so for every new type *all* methods have
+to be *a* stored, and *b* created/compiled .
+
+But off course the biggest thing is all the coding this entails. So that's why it's in the future :-)
+
+## Multilayered Mom
+
+Just a note on Mom: this was meant to be a bridge between the language layer (vool) and the machine
+layer (risc). This step, from tree and statements, to list and low level instructions was deemed
+to big, so the abstract Minimal Object Machine is supposed to be a layer in between those.
+And it is off course.
+
+What i didn't fully appreciate before starting was that the two things are related. I mean
+statements lend themselves to a tree, while having instruction in a tree is kind of silly.
+Similarly statements in a list doesn't really make sense either. So it ended up being a two step
+process inside Mom.
+
+The *first* pass that transforms vool, keeps the tree structure. But it does introduce Mom's own
+instructions. It turns out that this is sensible for exactly the linear parts of code.
+
+The *second* pass flattens the remaining control structures into jumps and labels. The result
+maps to the risc layer 1 to n, meaning every Mom instruction simple expands into one or usually
+more risc instructions.
+
+In the future i envision that this intermediate representation at the Mom level will be a
+good place for further optimisations, but we shall see. At least the code is still recognisable,
+meaning relatively easy to reason about. This is a property that the risc layer really does
+not have anymore.
--- a/assets/types.jpg
+++ b/assets/types.jpg
--- a/assets/types.uxf
+++ b/assets/types.uxf
@@ -0,0 +1,87 @@
+<?xml version="1.0" encoding="UTF-8" standalone="no"?>
+<diagram program="umlet" version="14.2">
+  <zoom_level>10</zoom_level>
+  <element>
+    <id>UMLClass</id>
+    <coordinates>
+      <x>380</x>
+      <y>60</y>
+      <w>210</w>
+      <h>70</h>
+    </coordinates>
+    <panel_attributes>_Type_
+--
+hash: unique id
+list of attribute names</panel_attributes>
+    <additional_attributes/>
+  </element>
+  <element>
+    <id>UMLClass</id>
+    <coordinates>
+      <x>710</x>
+      <y>60</y>
+      <w>210</w>
+      <h>70</h>
+    </coordinates>
+    <panel_attributes>_Object_
+--
+type
+list of attribute values</panel_attributes>
+    <additional_attributes/>
+  </element>
+  <element>
+    <id>UMLClass</id>
+    <coordinates>
+      <x>10</x>
+      <y>60</y>
+      <w>150</w>
+      <h>70</h>
+    </coordinates>
+    <panel_attributes>_Class_
+--
+instance_type</panel_attributes>
+    <additional_attributes/>
+  </element>
+  <element>
+    <id>Relation</id>
+    <coordinates>
+      <x>150</x>
+      <y>100</y>
+      <w>250</w>
+      <h>60</h>
+    </coordinates>
+    <panel_attributes>lt=-&gt;
+m1=n
+m2=1
+&lt; implements</panel_attributes>
+    <additional_attributes>230.0;30.0;110.0;30.0;10.0;10.0</additional_attributes>
+  </element>
+  <element>
+    <id>Relation</id>
+    <coordinates>
+      <x>580</x>
+      <y>70</y>
+      <w>150</w>
+      <h>50</h>
+    </coordinates>
+    <panel_attributes>lt=&lt;-
+m1=1
+m2=1
+is of &gt;</panel_attributes>
+    <additional_attributes>10.0;20.0;130.0;20.0</additional_attributes>
+  </element>
+  <element>
+    <id>Relation</id>
+    <coordinates>
+      <x>150</x>
+      <y>50</y>
+      <w>250</w>
+      <h>60</h>
+    </coordinates>
+    <panel_attributes>lt=-&gt;
+m1=1
+m2=1
+current &gt;</panel_attributes>
+    <additional_attributes>10.0;40.0;120.0;10.0;230.0;10.0</additional_attributes>
+  </element>
+</diagram>