ruby-x.github.io/app/views/pages/rubyx/layers.html.haml

134 lines
6.3 KiB
Plaintext
Raw Normal View History

2018-04-11 09:45:50 +02:00
= render "pages/rubyx/menu"
%h1=title "RubyX architectural layers"
2018-04-10 18:50:07 +02:00
%p
To implement an object system to execute object oriented languages takes a large system.
The parts or abstraction layers are detailed below.
%p
It is important to understand the approach first though, as it differs from the normal
interpretation. The idea is to
%strong compile
ruby. The argument is often made that
typed languages are faster, but i dont believe in that. I think dynamic languages
just push more functionality into the “virtual machine” and it is in fact only the
2018-07-25 11:10:08 +02:00
compilation to binaries that gives static languages their speed. This is the reason
2018-04-10 18:50:07 +02:00
to compile ruby.
%p.center.three_width
2020-02-07 15:03:36 +01:00
= image_tag "architecture.jpg" , alt: "Architectural layers"
2018-04-11 09:45:50 +02:00
2018-07-25 11:10:08 +02:00
%h3#ruby Ast + Ruby
2018-04-10 18:50:07 +02:00
%p
To compile and run ruby, we first need to parse ruby. While parsing ruby is quite
a difficult task, it has already been implemented in pure ruby
2018-04-11 09:45:50 +02:00
= succeed "." do
=ext_link "here" ,"https://github.com/whitequark/parser"
The output of the parser is
2018-04-10 18:50:07 +02:00
an ast, which holds information about the code in instances of a single
%em Node
class.
2018-04-11 09:45:50 +02:00
Nodes have a type attribute (which you sometimes see in s-expressions) and a list of children.
2018-04-10 18:50:07 +02:00
%p
2018-07-25 11:10:08 +02:00
The first layer in RubyX is the
%b Ruby Layer
that basically models the ast in a concrete syntax tree. This means there is one class
per node type.
2018-04-10 18:50:07 +02:00
%p
2019-10-04 00:22:34 +02:00
The Ruby layer is then used to transform the data into the Sol layer. Eg: Implicit
2018-07-25 11:10:08 +02:00
block passing is made explicit, conditions are simplified, arguments are simplified,
and syntactic sugar is removed.
2018-04-11 09:45:50 +02:00
2019-10-04 00:22:34 +02:00
%h3#simple-object-language Simple Object Language
2018-04-10 18:50:07 +02:00
%p
2019-10-04 00:22:34 +02:00
Simpe, in this context, means that it is much simpler than ruby.
2018-04-10 18:50:07 +02:00
%p
The main purpose is to simplify existing oo languages down to its core components: mostly
2018-07-25 11:10:08 +02:00
calling, assignment, continuations and exceptions. Typed classes for each language
2019-10-04 00:22:34 +02:00
construct exist and are responsible to transform a statement into SlotMachine level below.
2018-04-10 18:50:07 +02:00
%p
2019-10-04 00:22:34 +02:00
Examples for things that exist in ruby but are broken down in Sol are
2018-04-10 18:50:07 +02:00
%em unless
2018-04-11 09:45:50 +02:00
,
%em ternary operator
,
%em do while
or
%em for
loops and other similar syntactic sugar.
2019-10-04 00:22:34 +02:00
%h3#slot-machine SlotMachine
2018-04-10 18:50:07 +02:00
%p
2019-10-04 00:22:34 +02:00
We compile Sol statements into SlotMachine instructions. SlotMachine is a machine,
which means it has instructions. But unlike a cpu (or the risc layer below) it does not
have memory, only objects.
It also has no registers, and together these two things mean that all information is
stored in objects. Also the calling convention is object based and uses Frame and
Message instances to save state.
2018-04-10 18:50:07 +02:00
%p
Objects are typed, and are in fact the same objects the language operates on. Just the
functionality is expressed through instructions. While source methods are defined on
2019-10-04 00:22:34 +02:00
classes as Sol, when they are compiled to binary, they are made type specific. These
CallableMethods hold the binary and are stored in the Type.
2018-04-10 18:50:07 +02:00
%p
2019-10-04 00:22:34 +02:00
The SlotMachine level exists to make the transition to Risc easier. It has a very abstract,
high level instruction set, where each single instruction may resolve to many (even
2019-10-04 00:22:34 +02:00
tens of) lower level instructions. But it breaks down Sol's tree into an instruction
2018-04-11 09:45:50 +02:00
list, which is conceptually a much easier input for the next layer.
%p
2019-10-04 00:22:34 +02:00
SlotMachine uses a class hierachy to represent instructions, and this fact may be used to
extend the systm functionality. This feature is used in the Builtin layer of functions.
These functions are coded at the SlotMachine level, as they
can not be expressed in ruby. Examples include instance variable access, integer ops...
2018-04-11 09:45:50 +02:00
2018-04-10 18:50:07 +02:00
%h3#risc Risc
%p
The Register machine layer is a relatively close abstraction of risc hardware,
but without the quirks that for example arm has.
2018-04-10 18:50:07 +02:00
The Risc machine has registers, indexed addressing, operators, branches and everything
2019-10-04 00:22:34 +02:00
needed to implement SlotMachine. It does not try to abstract every possible machine feature
2018-04-10 18:50:07 +02:00
(like llvm), but rather “objectifies” the general risc view to provide what is needed for
2019-10-04 00:22:34 +02:00
the Slot layer, the next layer up (and actually Builtin functions).
2018-04-10 18:50:07 +02:00
%p
The machine has its own (abstract) instruction set, and the mapping to arm is quite
straightforward. Since the instruction set is implemented as derived classes, additional
instructions may be defined and used later, as long as translation is provided for them too.
In other words the instruction set is extensible (unlike cpu instruction sets).
%p
Basic object oriented concepts are needed already at this level, to be able to generate
a whole self contained system. Ie what an object is, a class, a method etc.
This minimal runtime is called parfait, and the same objects will be used at runtime
and compile time.
2018-04-10 18:50:07 +02:00
%p
Since working with at this low machine level (essentially assembler) is not easy to
follow for everyone (me :-), an interpreter was created (by me:-). Later a graphical
interface, a kind of
2018-04-10 18:50:07 +02:00
%a{:href => "https://github.com/ruby-x/rubyx-debugger"} visual debugger
was added.
Visualizing the control flow and being able to see values updated immediately helped
tremendously in creating this layer. And the interpreter helps in testing, ie keeping it
working in the face of developer change.
2018-04-11 09:45:50 +02:00
%h3 Target assembler
2018-04-10 18:50:07 +02:00
%p
Risc is the last abstract layer, it is then translated into machine dependent code.
This is not binary yet, more an oo version of assembler, where each instruction
is represented by an object.
2018-04-10 18:50:07 +02:00
%p
Arm is a risc architecture, but anyone who knows it will attest, with its own quirks.
2018-04-11 09:45:50 +02:00
For example any instruction may be executed conditionally, ie
%em every
instruction carries bits to make it check the status register. Or the fact that there
is no 32bit register load instruction. It is possible to create very dense code using
all the arm special features, but this is not implemented yet.
%p
The Arm::Translator translates RegisterInstructions to ArmInstructions.
%h3#binary-and-elf Elf and Binary
%p
A physical machine will run binaries containing instructions that the cpu understands,
in a format the operating system understands (elf).
%p
The previously generated objects must be able to convert themselves to binary.
These binary codes are wrapped and stored into binary files of elf format.
Arm and elf subdirectories hold the code for these layers.
2018-04-11 09:45:50 +02:00
and the Elf::ObjectWriter creates Linux binaries.