ruby-x.github.io/app/views/posts/_2019-08-20-parsing-parfait-and-other-goodies.haml
Torsten Rüger 90977d1a7c First version of new post
Compiling Parfait, better mom, builtin etc
2019-08-20 22:06:37 +03:00

205 lines
10 KiB
Plaintext

%p
As we rubyx will eventually need to parse and compile itself, i am very happy to
report success on the first steps on that journey. Also benchmarks, better design
and another conference are on the list.
%h2 Compiling parfait
%p
As a recap, Parfait is that part of the core library that we need already during
compilation. Ie the compiler creates Parfait objects during compilation and uses
Parfait code to do this. This off course is a conundrum, which is solved by using the
Parfait Code as is in the compiler (and some ruby module magic to avoid name clashes)
%p
Naturally any meaningful program that the compiler generates will use Parfait
and so Parfait must be available at run-time, ie parsed and compiled. Since i have been
busy doing the basics, this has been on the ToDo for a long while.
%p
Now, finally, most the basics are inplace and i have started what feels like a tremendous
task. In fact i have succefully compiled
%em three files.
Object, DataObject and Integer, to be precise.
%p
The significance of this is actually much greater (especially since there are no tests)
yet. Parfait, as part of rubyx, is what one may call ideomatic ruby, ie real world ruby.
Off course i have to smoothen out a few bugs before compiling actually worked, but
surprisingly little. In other words the compiler is functional enough to
compile larger more feature ritch ruby programs, but more on that below.
%h2 Design improvements
%p
The overall design has been like in the picture below for a while already.
Alas, the implementation oof this architecture was slightly lacking.
To be precise, when mom code was generated, it was immediately converted to risc.
In other words the layer only existed conceptually, or in transit.
%p.center.three_width
= image_tag "architecture.png" , alt: "Architectural layers"
%p
Now the code works
%em exactly
as advertised. Ruby comes in from the top and binary code out at the bottom.
But more than that, every layer is a distinct step, in fact there are methods on the
topmost compiler object to create every level down from ruby. This is obviously
very handy for testing.
%h2 Automated binary tests
%p
Speaking of testing, we are at over 1600 tests, which is more than 200 up from before
the design rewrite. At over 15000 assertions this is still 95%, in other words everything
apart from a few fails. And with parallel execution still fast.
%p.center.full_width
= image_tag "1600_tests.png" , alt: "Lots of test, never boring"
%p
But the main achievement a couple of weeks ago was the integration of binary testing
int the automated test flow. Specifically on
%em Travis.
%p
This uses a feature of Qemu that i had not know before, namely that one can get qemu
to run binaries from a different target on a machine, by simply calling it with
qemu-arm.
%p
I had done previous testing of binaries via ssh, usually to an qemu emulated pi on my
machine. This setup is vastly more complicated, as described
=ext_link "here" , "/arm/qemu.html"
and i had shied away from that. Meaning they would happen irregularily and all that.
My only consolation was that the test would run on the interpreter, but off course that
does not test the arm and elf genertion.
%p
The actual tests that i am talking about are a growing number of "mains" tests, found in
the tets/mains directory.
These are actal programs that calculate or output stuff. They are complete system
tests in the sense that we only test their output (system output).
%p
As we usually link to "a.out" files (thus overwriting and avoiding cleanup), the actual
invocation of qemu for a binary is really simple:
%pre
%code
qemnu-arm ./a.out
but that still leaves you to generate that binary. This can be done by using the
rubyxc compiler and linking the resulting object file (see bug #13). But sine i too am a
lazy programmer i have automated these steps into the rubyxc compiler, and so one
can just compile/link/execute a source file like this:
%pre
%code
:preserve
./bin/rubyxc execute test/mains/source/fibo__8.rb
This will compile, link and execute this specific fibonachi test. This output
of this test will be 8, as encoded in the file name.
So this and 20 others will be tested as binaries now every time travid does its thing.
%p
BTW, i have also created arubyxc command to execute a file via the interpreter.
This can sometimes yield better errors when things go wrong.
%pre
%code
:preserve
./bin/rubyxc interpret test/mains/source/puts_Hello-there_11.rb
And as a second btw, i also added an option to the compiler, so one can control the
Parfait factory size with the option --parfait.
%h2 Misc other news
%h3 Microbenchmarks
%p
At the last conference in Hamburg, someone asked the fair question: So how fast is it?
It's been so long that i did tests, that i could only mumble.
Now i finished updating the tests, but it will be a while before i can answer the
question more fully.
%p
So for starters, because the functionality of the compiler is limited, i did very small
benchmarks. Very small means 20 lines or less, loops, string output, fibonacchi, both
linear and recursive. I realized too late, that that will tell most about integer
performance.
%p
Now because of the early days, i will not go into detail here. In general speed was not
as fast as i had hoped from by 4 year old benchmarks, about the same as mri. I
will have to do some work on the calling convention and probably some on integer
handling too. I think i can quite easily shave 30-50% off, and that alone should
verify the saying that all benchmarks are lies. Like the one where rubyx is doing
"hello world" faster than C. Yes, C, not mri. But only because i switched the buffering
off, because also rubyx does not buffer (apples and oranges ...)
%p
So i will take this round as inspiration to do some optimisation and performance
measuring. And come back to it later.
%h3 Implicit returns
%p
As part of parsing Parfait, i implemented a first version of implicit returns.
Low hanging fruits, and in fact most common use cases, included constants and calls.
So when a method ends in a simple variable, constant, or a call, a return will be added.
More complex rules like returns for if's or while will ave to wait, but i found that i
personally don't tend to use them anyway.
%p
Since class methods are basically methods (of the meta class), adding the unified
return handling to them was easy too.
%h3 Improved Block handling
%p
Block handling, at least the simple implicit kind, has worked for a while, but was in
several ways too complicated. The block was unneccessarily assigned to a local, and
compiling was handled by picking them out.
%p
This all stemmed from a misunderstanding, or lack of understanding: Blocks, or should
i say Lambdas, are constants. Just like a string or integer. They are created once at
compile time and can not change identity. In fact Methods and Classes are also contants,
and i reflected this in the Vool level by calling them Expressions, instead of
before Statements.
%p
So now the Lambda Expression is created and just added as an argument to the send.
Compiling thee Lambda is triggered by the constant creation, ie the step down from
vool to mom, and the block compiler added to method compiler automatically.
%h3 Vool coming into focus
%p
I've been saying ruby without the fluff, to descibe vool. And while that is true,
it is quite vague. Two major things have become clear about vool through the work above.
%p
Firstly, Vool has no complex or recursive send statements. Arguments must be variables or
constants. Calls are executed before and assigned to a temporary variable. In effect
recursive calls are flattened into a list, and as such the calling does not rely on a
stack as in ruby.
%p
Secondly, Vool distinguishes between expressions and statements. Like other lower level,
but not ruby. As a rule of thumb, Statements do things, Expression are things. In other
words, only expressions have value, statements (lke if or while) do not.
%h2 Plans
%h4 GrillRB conference
%p
I will speak in
=ext_link "Wrazlaw" , "https://grillrb.com/"
in about a week. The plan is to make a comparison with rails and focus on the
possibilities, rather than technical detail.
%h4 Calling
%p
The Calling can do with work and i noticed two mistakes i did. One is that creating
a new message for every call is unneccessarily complicated. Its is only in the
special case that a Proc is created that the return sequence (a mom instruciton) needs
to keep the message alive.
%h4 Integers
%p
I still want to hang on to Integers being objects, though creation is clealy costly.
In the future a full escape analysis will help off course, but for now it should be easy
enough to figure out wether an int is passed down. If not loops can be
destructively change the int. A simple special case is a the times method.
%h4 Mom instruction invocation
%p
I have this idea of being able to code more stuff higher up. To make that more
efficient i am thinking of macros or instruction invocation at the vool level.
Only inside Parfait off course. The basic idea would be to save the call/return
code, and have eg X.return_jump map to the Mom::ReturnJump Instruction. "Just" have
to figure out the passing semantics, or how that integrates intot the vools code.
%h4 Better Builtin
%p
The generation of the current builtin methods has always bothered me a bit.
It is true that some things just can not be expressed as ruby and so some
alternative mechanism is needed (even in c one can embed assembler).
%p
The main problem i have is that those methods don't check their arguments and as such
may cause core dumps. So they are to high level and hopefully all we really need is
that previous idea of being able to integrate Mom code into vool. As Mom is extensible
that should take care of any possible need. And we could code the methods normally as
part of Parfait, make them safe, and just use the lower level inside them. Lets see!.