Laythe

A toy programming language originall based on the 2nd book of Crafting Interpreters. See git tag v0.1.0 for a fully compliant lox implementations. Since v0.1.0 I've continued adding features and exploring what's possible to learn and do in the PL world.

Getting Started

This project can be built using the typical set of cargo commands for building, testing, running and benching.

In debug

cargo build

In Release

cargo build --release

To run the test suite run the following.

cargo test

A set of benchmarks have been setup using criterion. To run these again use cargo.

cargo bench

If you have the the flamegraph cargo subcommand you can profile a script by the following.

cargo flamegraph --bin=laythe [filepath]

To run the repl use.

cargo run [--release]

To run a script use.

cargo run [--release] [filepath]

Notable differences

I've continued to work on the language with a few extensions. Below are some of the differences

Additions

Built in Classes: Laythe now has machinery to give all types methods. Some simple examples include .str() methods to get a string representation of each type.

> let x = true;
> x.str()
'true'

Lambdas: There are now function expressions. This was actually a very minimal change to enable this as it reuses almost all the the function machinery.

> let func = |x| x * 2;
> let withBody |name| { print "hi! " + name; };
> func(5)
10
> withBody("John")
hi! john

Static Methods: Classes now support static methods using the static keyword.

class WithStatic {
  static example() { 
    return 'example';
  }
}

> WithStatic.example()
'example'

New Collection Types: Laythe now has lists and maps as part of the language both supporting literals.

> let list = [1, false, nil, 3, clock];
> let map = { 'key1', 10, 'key2': false, 15: nil };
> list[2];
nil
> map[false];
15

Map support all types with objects supported by reference equality. Since strings are interned this gives the desire value comparison results most would expect.

Modified

Gc The gc is setup quite differently from clox mostly because dealing with lifetimes in the base implementation was nearly impossible. Instead objects as of 5/8/2020 Act as a sort of smart pointer into a Vec<Box<Value>>. The gc takes a context for determining when should be considered roots. Gc is also explicitly turned off during compilation as values are only freed during the main loop.

Performance

In the whole performance is unfortunately only about 50~60% that of clox. For the internal hashmap I use the hashbrown crate with the fnv hasher which should be about the best setup for small hashes as all hashes are 16bytes. This still falls quite short of the brutally simply method used in clox. Beyond that I have not implemented the NAN tagging optimization. I'm hoping to eventually close the gap here.

Running the original benchmark suite on a 2015 dell xps we have.

benchmark	clox	Laythe	relative speed	notes
binary_tress.lox	total: 0.781831	total: 0.7417039	1.05	This likely due to the small number of properties and the init optimization. We essentially avoid hashing altogether in this bench
equality.lox	loop: 2.42498 elapsed: 2.70574	loop: 2.632479 elapsed: 2.936534	0.92	Similar to above we have essentially zero hashing and again perform pretty equal to clox
fib.lox	total: 1.43954	total: 2.613307	0.55	Here we have some hashing from the `fib` lookup but even making this local shows a difference. It appears there is still some performance difference in function calling here
instantiation.lox	total: 0.904372	total: 1.75085	0.51	Again localizing this gives a decent speedup but hashing and function calls still seem to slow Laythe down
invocation.lox	total: 0.433293	total: 1.181278	0.366	Now the hashing speed difference is quite apparent
method_call.lox	total: 0.286027	total: 0.531728	0.53	Same as above
properties.lox	total: 0.732466	total: 1.725661	0.42	Same as above
trees.lox	total: 3.48647	total: 6.325482	0.55	Same as above
zoo.lox	total: 0.474	total: 1.167421	0.406	Same as above

Future Ideas

These are some features / changes I'm considering adding to the language.

Gradual typing

I'm a huge fan of Typescript and believe it's gradual typing approach is quite fantastic. I'd like to incorporate a similar system here. Beyond just the normal type errors I'd like to have more of a runtype check at typed and untyped boundaries, at least enable that with a flag.

// Possible type syntax
fn unTyped(x) {
  return typed(x)
}

fn typed(x: number) -> string {
  return typedInner(x);
}

fn typedInner(x: number) -> string {
  return x.str();
}

> unTyped(10);
'10'
> unTyped(20);
'20'
≥ unTyped('15')
typeError 'typed' expected number received string'

Here I would think it could be useful if laythe automatically injected some runtime type checking before it's called in unTyped. The runtime check could then be emitted inside of typed as we know the value comes from a trust checked typedInner. I think this approach would really help extend the usefulness of the typescript model. I know outside of the strict flags in ts that you know the wrong type might still slip through. Here I think we can inject runtime checks to assert call boundaries are truly what the claim to be.

Implicit Return

I think this is just a nice to have and can really cleanup some code, but I think the rust / ruby etc. implicit return really is visually nice

fn example() {
  10 // returned
}

Move Instance hash to Class

The number of instance to classes in almost every case should be relatively large. In this case it makes a lot of sense in terms of memory to instead store the field mapping on the class. Where it HashMap<Managed<String>, usize>. The instance could then just hold a vector of Values which should pretty significantly reduce memory pressure and usage.

To accompany this we'll need to close all instances so you can no longer add field after the initializer. I think this behavior is not very desirable to begin with, plus it leads to a lot of potential in terms of gradual typing and optimization.

As a side effect of this change we should be able to emit new instruction of getting field 0-N directly and completely avoiding the hash.

mspraggs / laythe Goto Github PK

laythe's Introduction

Laythe

Getting Started

In debug

In Release

Notable differences

Additions

Modified

Performance

Future Ideas

Gradual typing

Implicit Return

Move Instance hash to Class

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent