Summer Status Update

Why Sponsor Oils? | blog | oilshell.org

Summer Status Update

2018-07-21

I plan to release OSH 0.5 next week! This post is an update on what's happened since the last release.

Table of Contents

Recurse Center

Oil Subproject Review

Shell Trace Tool

OPy

Footnote

Recurse Center

In late March, I wrote a post to relieve myself of Oil-related "duties" while attending the Recurse Center.

But a few things happened in the meantime, and I ended up working on Oil a lot:

Guido van Rossum tweeted about my project [1], which attracted new contributors. I kept my promise to review their pull requests quickly.

I'm embarrassed that the resulting commits have been sitting unreleased in the repo since May, but I'll fix this shortly.
Pair programming is a core part of the Recurse Center experience. It turned out that many people were interested in pairing on Oil, so it was natural to work on it. You'll see this in the release notes.
After the handling the logistics of traveling and housing, it felt good to work on Oil, rather than something more speculative and unfamiliar.

Oil Subproject Review

In the same post on Recurse Center (abbreviated RC from now on), I wrote about these subprojects:

A shell trace tool, i.e. a better sh -x.
Static analysis of shell scripts, e.g. for deploying self-contained app bundles.
The OPy compiler.
A source code browser.

I've decided to focus on two of these projects. They address the main problem I see: that OSH is basically a slower and less featureful version of bash right now!

While I've built a solid foundation for the project, I'm not surprised that it doesn't have regular users. I want to change that in the coming months.

Shell Trace Tool

This is a "carrot" for OSH, or at least a candidate for one. (The Oil language is still the main carrot, but it's a longer-term design and implementation effort.)

Before leaving for New York, I prototyped a web-based shell trace tool with my friend Eric.

I conjectured that we could use bash as the client, rather than Oil. I came up with some horrible hacks involving the $PS4 variable, which controls the output of sh -x.

This led to a working demo, but I'm not sure how successful this experiment was. When I get back to San Francisco, I plan to release it and solicit feedback.

OPy

The OPy compiler is necessary to turn OSH into a "production quality" shell. It's a risky and unusual implementation strategy, but I'm optimistic based on the progress so far.

I haven't worked on it too much this summer, but I had some realizations based on conversations with Darius Bacon, who wrote one of the articles I mentioned here:

Compiling to C code is probably easier than compiling to bytecode and writing my own VM. It's more direct, and will solve the "double interpretation" problem.
There's more than one way to compile a subset of Python to C code. PyPy compiles a control flow graph to C, but higher-level translations are also possible.
The C code strategy and the bytecode strategy have a lot of work in common, so I don't have to choose between them now.

For example, Python 3.8 will remove the BREAK_LOOP and CONTINUE_LOOP opcodes and express them in terms of jumps. I want to reproduce this change in OPy. It's useful whether we compile to bytecode or native code. (Python issue; credit for bringing this to my attention.)

My time at RC has given me many ideas for blog posts, which fall in these categories:

Lexing (continuing where I left off)
Phrases to Explain Oil
Five-Minute Talks I Gave at RC
Book Reviews

I'll sketch these ideas in the next post.

Footnote

[1] I e-mailed Guido about his recent blog post on pgen, the bespoke parser generator that powers Python. (Oil uses pgen2, the Python port of pgen.)

Among other things, this conversation clarified that "regular expression" with respect to pgen refers to EBNF iteration syntax like + and *, as opposed to the recursion of plain BNF.

Related: other posts on #parsing. I've written a lot about parsing shell, but parsing Python is interesting too.