ruby-x.github.io/app/views/posts/2019/_08-20-parsing-parfait-and-other-goodies.haml

229 lines
11 KiB
Plaintext
Raw Normal View History

%p
2019-08-21 16:30:26 +03:00
As rubyx will eventually need to parse and compile itself, i am very happy to
report success on the first steps towards that goal. Also better design, benchmarks,
and another conference are on the list.
%h2 Compiling parfait
%p
As a recap, Parfait is that part of the core library that we need already during
compilation. Ie the compiler creates Parfait objects during compilation and uses
Parfait code to do this. This off course is a conundrum, which is solved by using the
Parfait Code as is in the compiler (and some ruby module magic to avoid name clashes)
%p
Naturally any meaningful program that the compiler generates will use Parfait
and so Parfait must be available at run-time, ie parsed and compiled. Since i have been
busy doing the basics, this has been on the ToDo for a long while.
%p
2019-08-21 16:30:26 +03:00
Now, finally, most the basics are in place and i have started what feels like a tremendous
task. In fact i have successfully compiled
%em three files.
Object, DataObject and Integer, to be precise.
%p
2019-08-21 16:30:26 +03:00
The significance of this is actually much greater (especially since there are no tests
yet). Parfait, as part of rubyx, is what one may call ideomatic ruby, ie real world ruby.
Off course i had to smoothen out a few bugs before compiling actually worked, but
surprisingly little. In other words the compiler is functional enough to
2019-08-21 16:30:26 +03:00
compile larger, more feature rich ruby programs, but more on that below.
%h2 Design improvements
%p
The overall design has been like in the picture below for a while already.
2019-08-21 16:30:26 +03:00
Alas, the implementation of this architecture was slightly lacking.
2020-02-07 21:03:36 +07:00
To be precise, when SlotMachine code was generated, it was immediately converted to risc.
In other words the layer existed only conceptually, or in transit. (Also it was called mom)
%p.center.three_width
2020-02-07 21:03:36 +07:00
= image_tag "architecture.jpg" , alt: "Architectural layers"
%p
Now the code works
%em exactly
as advertised. Ruby comes in from the top and binary code out at the bottom.
But more than that, every layer is a distinct step, in fact there are methods on the
topmost compiler object to create every level down from ruby. This is obviously
2019-08-21 16:30:26 +03:00
very handy for testing, and by iself sped up testing by 30%, as risc is not renerated
before really needed.
%h2 Automated binary tests
%p
Speaking of testing, we are at over 1600 tests, which is more than 200 up from before
2019-08-21 16:30:26 +03:00
the design rewrite. At over 15000 assertions this is still 95% of the code, in other
words everything apart from a few fails. And with parallel execution still fast.
%p.center.full_width
= image_tag "1600_tests.png" , alt: "Lots of test, never boring"
%p
But the main achievement a couple of weeks ago was the integration of binary testing
2019-08-21 16:30:26 +03:00
into the automated test flow. Specifically on
%em Travis.
%p
This uses a feature of Qemu that i had not know before, namely that one can get qemu
to run binaries from a different target on a machine, by simply calling it with
qemu-arm.
%p
2019-08-21 16:30:26 +03:00
Previously i had done testing of binaries via ssh, usually to an qemu emulated pi on my
machine. This setup is vastly more complicated, as described
=ext_link "here" , "/arm/qemu.html"
2019-08-21 16:30:26 +03:00
and i had shied away from automating that. Meaning they would happen irregularily and
all that. My only consolation was that the test would run on the interpreter, but off
course that does not test the arm and elf genertion.
%p
The actual tests that i am talking about are a growing number of "mains" tests, found in
2019-08-21 16:30:26 +03:00
the tests/mains directory.
These are actual programs that calculate or output stuff. They are complete system
tests in the sense that we only test their output (system output and exit code).
%p
As we usually link to "a.out" files (thus overwriting and avoiding cleanup), the actual
invocation of qemu for a binary is really simple:
%pre
%code
2019-08-21 16:30:26 +03:00
qemu-arm ./a.out
but that still leaves you to generate that binary. This can be done by using the
rubyxc compiler and linking the resulting object file (see bug #13). But sine i too am a
lazy programmer i have automated these steps into the rubyxc compiler, and so one
can just compile/link/execute a source file like this:
%pre
%code
:preserve
./bin/rubyxc execute test/mains/source/fibo__8.rb
2019-08-21 16:30:26 +03:00
This will compile, link and execute this specific fibonacci test. The exit code
of this test will be 8, as encoded in the file name.
2019-08-21 16:30:26 +03:00
Now this test, and 20 others, will be run as binaries every time travis does its thing.
%p
2019-08-21 16:30:26 +03:00
BTW, i have also created a rubyxc command to execute a file via the interpreter.
This can sometimes yield better errors when things go wrong.
%pre
%code
:preserve
./bin/rubyxc interpret test/mains/source/puts_Hello-there_11.rb
And as a second btw, i also added an option to the compiler, so one can control the
Parfait factory size with the option --parfait.
%h2 Misc other news
2019-08-21 16:30:26 +03:00
%h3 Microbenchmarks
%p
At the last conference in Hamburg, someone asked the fair question: So how fast is it?
2019-08-21 16:30:26 +03:00
It's been so long that i did
=ext_link "tests," , "/misc/soml_benchmarks.html"
that i could only mumble.
Now i finished updating the tests, but it will be a while before i can answer the
question more fully.
%p
So for starters, because the functionality of the compiler is limited, i did very small
benchmarks. Very small means 20 lines or less, loops, string output, fibonacchi, both
2019-08-21 16:30:26 +03:00
linear and recursive. I realized too late, that all that will tell about is integer
performance.
%p
2019-08-21 16:30:26 +03:00
Now because it's early days, i will not go into detail here. In general speed was not
as fast as i had hoped for, about the same as mri. I
will have to do some work on the calling convention and probably some on integer
handling too. I think i can quite easily shave 30-50% off, and that alone should
verify the saying that all benchmarks are lies. Like the one where rubyx is doing
"hello world" faster than C. Yes, C, not mri. But only because i switched the buffering
off, because also rubyx does not buffer (apples and oranges ...)
%p
So i will take this round as inspiration to do some optimisation and performance
measuring. And come back to it later.
%h3 Implicit returns
%p
As part of parsing Parfait, i implemented a first version of implicit returns.
Low hanging fruits, and in fact most common use cases, included constants and calls.
So when a method ends in a simple variable, constant, or a call, a return will be added.
2019-08-21 16:30:26 +03:00
More complex rules like returns for if's or while will have to wait, but i found that i
personally don't tend to use them anyway.
%p
Since class methods are basically methods (of the meta class), adding the unified
return handling to them was easy too.
%h3 Improved Block handling
%p
Block handling, at least the simple implicit kind, has worked for a while, but was in
2019-08-21 16:30:26 +03:00
several ways too complicated. The block was unnecessarily assigned to a local, and
compiling was handled by looking for blocks statements, not during the normal flow.
%p
This all stemmed from a misunderstanding, or lack of understanding: Blocks, or should
i say Lambdas, are constants. Just like a string or integer. They are created once at
compile time and can not change identity. In fact Methods and Classes are also contants,
2020-02-07 21:03:36 +07:00
and i reflected this in the SOL level by calling them Expressions, instead of
before Statements.
%p
So now the Lambda Expression is created and just added as an argument to the send.
2019-08-21 16:30:26 +03:00
Compiling the Lambda is triggered by the constant creation, ie the step down from
2020-02-07 21:03:36 +07:00
SOL to SlotMachine, and the block compiler added to method compiler automatically.
2020-02-07 21:03:36 +07:00
%h3 SOL coming into focus
%p
2020-02-07 21:03:36 +07:00
I've been saying ruby without the fluff, to descibe SOL (Previously vool). And while
that is true,
it is quite vague. Two major things have become clear about SOL through the work above.
%p
2020-02-07 21:03:36 +07:00
Firstly, SOL has no complex or recursive send statements. Arguments must be variables or
constants. Calls are executed before and assigned to a temporary variable. In effect
recursive calls are flattened into a list, and as such the calling does not rely on a
stack as in ruby.
%p
2020-02-07 21:03:36 +07:00
Secondly, SOL distinguishes between expressions and statements. Like other lower level
2019-08-21 16:30:26 +03:00
languages, but not ruby. As a rule of thumb, Statements do things, Expression are things.
In other words, only expressions have value, statements (like if or while) do not.
%h2 Plans
%h4 Calling
%p
The Calling can do with work and i noticed two mistakes i did. One is that creating
2019-08-21 16:30:26 +03:00
a new message for every call is unneccessarily complicated. It is only in the
2020-02-07 21:03:36 +07:00
special case that a Proc is created that the return sequence (a SlotMachine instruction)
needs to keep the message alive.
2019-08-21 16:30:26 +03:00
%p
The other is that having arguments and local variables as seperate arrays may be handy
and easy to code. But it does add an extra indirection for every access _and_ store.
Since Mom is memory based, and Mom translates to risc, that does amount to a lot of
instructions.
%h4 Integers
%p
I still want to hang on to Integers being objects, though creation is clealy costly.
In the future a full escape analysis will help off course, but for now it should be easy
2019-08-21 16:30:26 +03:00
enough to figure out wether an int is passed down. If not loops can be made to
destructively change the int.
2020-02-07 21:03:36 +07:00
%h4 SlotMachine instruction invocation
%p
I have this idea of being able to code more stuff higher up. To make that more
2020-02-07 21:03:36 +07:00
efficient i am thinking of macros or instruction invocation at the SOL level.
Only inside Parfait off course. The basic idea would be to save the call/return
2020-02-07 21:03:36 +07:00
code, and have the compiler map eg X.return_jump to the Mom::ReturnJump Instruction.
"Just" have to figure out the passing semantics, or how that integrates into the SOL code.
2019-08-21 16:30:26 +03:00
%h4 Better Builtin
%p
The generation of the current builtin methods has always bothered me a bit.
It is true that some things just can not be expressed as ruby and so some
2020-02-07 21:03:36 +07:00
alternative mechanism is needed (even in c one can/must embed assembler).
%p
The main problem i have is that those methods don't check their arguments and as such
2019-08-21 16:30:26 +03:00
may cause core dumps. So they are too high level and hopefully all we really need is
2020-02-07 21:03:36 +07:00
that previous idea of being able to integrate SlotMachine code into SOL. As SlotMachine
is extensible
that should take care of any possible need. And we could code the methods normally as
2019-08-21 16:30:26 +03:00
part of Parfait, make them safe, and just use the lower level inside them. Lets see!
%h4 Compiling Parfait tests
%p
Since Parfait is part of rubyx, we have off course unit tests for it. The plan is to
parse the tests too, and run them as a test for both Prfait and the compiler.
Off course this will involve writing some mini version of minitest that the compiler
can actually handle (Why do i have the feeling that the real minitest involves too much
2020-02-07 21:03:36 +07:00
magic).
2019-08-21 16:30:26 +03:00
%h4 GrillRB conference
%p
Last, but not least, i will speak in
=ext_link "Wrocław" , "https://grillrb.com/"
in about a week. The plan is to make a comparison with rails and focus on the
possibilities, rather than technical detail. See you there :-)
2020-02-07 21:03:36 +07:00
%p
PS: Talk is now online
=link_to "here." , "/slides/grillrb"