Beautiful Racket: Into the rapids: more basic

Beautiful Racket / tutorials

basic/parser.rkt

#lang brag
b-program : [b-line] (/NEWLINE [b-line])*
b-line : b-line-num [b-statement] (/":" [b-statement])* [b-rem]
@b-line-num : INTEGER
@b-statement : b-end | b-print | b-goto | b-let | b-input
b-rem : REM
b-end : /"end"
b-print : /"print" [b-printable] (/";" [b-printable])*
@b-printable : STRING | b-expr
b-goto : /"goto" b-expr
b-let : [/"let"] b-id /"=" (b-expr | STRING)
b-input : /"input" b-id
@b-id : ID
b-expr : b-sum
b-sum : b-number (/"+" b-number)*
@b-number : INTEGER | DECIMAL | b-id


#lang brag
b-program : [b-line] (/NEWLINE [b-line])*
b-line : b-line-num [b-statement] (/":" [b-statement])* [b-rem]
@b-line-num : INTEGER
@b-statement : b-end | b-print | b-goto | b-let | b-input
b-rem : REM
b-end : /"end"
b-print : /"print" [b-printable] (/";" [b-printable])*
@b-printable : STRING | b-expr
b-goto : /"goto" b-expr
b-let : [/"let"] b-id /"=" (b-expr | STRING)
b-input : /"input" b-id
@b-id : ID
b-expr : b-sum
b-sum : b-number (/"+" b-number)*
@b-number : INTEGER | DECIMAL | b-id


Associativity: operations within an infix expression are applied from left to right. So a infix expression like this:

1 + 2 - 3 + 4 - 5 + 6

1
1 + 2 - 3 + 4 - 5 + 6

Corresponds to this Racket expression, where the left-to-right infix operations are converted to inner-to-outer prefix operations:

(+ (- (+ (- (+ 1 2) 3) 4) 5) 6)

1
(+ (- (+ (- (+ 1 2) 3) 4) 5) 6)

Precedence: certain operations need to be applied before others. We probably remember this from sixth-grade math, but just in case—if we have an infix expression like this:

1 + 2 * 3 + 4 * 5 + 6

1
1 + 2 * 3 + 4 * 5 + 6

The * operations have higher precedence than the +. So the expression is evaluated as if it were written like so, and then the operations are applied in the usual left-to-right way:

1 + (2 * 3) + (4 * 5) + 6

1
1 + (2 * 3) + (4 * 5) + 6

In Racket, we would write the original expression like so:

(+ (+ (+ 1 (* 2 3)) (* 4 5)) 6)

1
(+ (+ (+ 1 (* 2 3)) (* 4 5)) 6)

Subexpressions: the usual precedence rules can be overridden by using parenthesized subexpressions. In effect, subexpressions are another layer of precedence. Starting with the previous example, we can force the addition operations to happen first by parenthesizing them, and then the remaining operations go left to right:

(1 + 2) * (3 + 4) * (5 + 6)

1
(1 + 2) * (3 + 4) * (5 + 6)

In Racket, we’d write the expression this way:

(* (* (+ 1 2) (+ 3 4)) (+ 5 6))

1
(* (* (+ 1 2) (+ 3 4)) (+ 5 6))


Add support for infix expressions to Racket itself. Then in our BASIC interpreter, we can just pass through whole infix expressions, and our new-and-improved Racket will do the rest.

This is actually a totally reasonable idea. As we know, Racket itself is designed to be extensible (for instance, with macros). We could create a tiny domain-specific language for interpreting infix expressions that we can invoke from within Racket. (Or even better, we could use one that already exists.)

We could also update our parser with new rules, so it automatically parses infix expressions into simpler nested expressions with the correct associativity and precedence. Then we update our expander to support the evaluation of these smaller expressions.

Since we already know how to make parser rules, this will be our approach.

grammar.rkt

test.rkt

grammar.rkt

test.rkt

grammar.rkt

basic/lexer.rkt

#lang br
(require brag/support)

(define-lex-abbrev digits (:+ (char-set "0123456789")))

(define-lex-abbrev reserved-terms (:or "print" "goto" "end" "+"
":" ";" "let" "=" "input" "-" "*" "/" "^" "mod" "(" ")"))

(define basic-lexer
  (lexer-srcloc
   ["\n" (token 'NEWLINE lexeme)]
   [whitespace (token lexeme #:skip? #t)]
   [(from/stop-before "rem" "\n") (token 'REM lexeme)]
   [reserved-terms (token lexeme lexeme)]
   [(:seq alphabetic (:* (:or alphabetic numeric "$")))
    (token 'ID (string->symbol lexeme))]
   [digits (token 'INTEGER (string->number lexeme))]
   [(:or (:seq (:? digits) "." digits)
         (:seq digits "."))
    (token 'DECIMAL (string->number lexeme))]
   [(:or (from/to "\"" "\"") (from/to "'" "'"))
    (token 'STRING
           (substring lexeme
                      1 (sub1 (string-length lexeme))))]))

(provide basic-lexer)

#lang br
(require brag/support)

(define-lex-abbrev digits (:+ (char-set "0123456789")))

(define-lex-abbrev reserved-terms (:or "print" "goto" "end" "+"
":" ";" "let" "=" "input" "-" "*" "/" "^" "mod" "(" ")"))

(define basic-lexer
  (lexer-srcloc
   ["\n" (token 'NEWLINE lexeme)]
   [whitespace (token lexeme #:skip? #t)]
   [(from/stop-before "rem" "\n") (token 'REM lexeme)]
   [reserved-terms (token lexeme lexeme)]
   [(:seq alphabetic (:* (:or alphabetic numeric "$")))
    (token 'ID (string->symbol lexeme))]
   [digits (token 'INTEGER (string->number lexeme))]
   [(:or (:seq (:? digits) "." digits)
         (:seq digits "."))
    (token 'DECIMAL (string->number lexeme))]
   [(:or (from/to "\"" "\"") (from/to "'" "'"))
    (token 'STRING
           (substring lexeme
                      1 (sub1 (string-length lexeme))))]))

(provide basic-lexer)


Addition (+) and subtraction (-).

Multiplication (*), division (/), and modulo (mod).

Negation (- with one argument rather than two).

Exponentiation (^).

Parenthesized subexpressions.

basic/parser.rkt

#lang brag
b-program : [b-line] (/NEWLINE [b-line])*
b-line : b-line-num [b-statement] (/":" [b-statement])* [b-rem]
@b-line-num : INTEGER
@b-statement : b-end | b-print | b-goto | b-let | b-input
b-rem : REM
b-end : /"end"
b-print : /"print" [b-printable] (/";" [b-printable])*
@b-printable : STRING | b-expr
b-goto : /"goto" b-expr
b-let : [/"let"] b-id /"=" (b-expr | STRING)
b-input : /"input" b-id
@b-id : ID
b-expr : b-sum
b-sum : [b-sum ("+"|"-")] b-product
b-product : [b-product ("*"|"/"|"mod")] b-neg
b-neg : ["-"] b-expt
b-expt : [b-expt "^"] b-value
@b-value : b-number | b-id | /"(" b-expr /")"
@b-number : INTEGER | DECIMAL

#lang brag
b-program : [b-line] (/NEWLINE [b-line])*
b-line : b-line-num [b-statement] (/":" [b-statement])* [b-rem]
@b-line-num : INTEGER
@b-statement : b-end | b-print | b-goto | b-let | b-input
b-rem : REM
b-end : /"end"
b-print : /"print" [b-printable] (/";" [b-printable])*
@b-printable : STRING | b-expr
b-goto : /"goto" b-expr
b-let : [/"let"] b-id /"=" (b-expr | STRING)
b-input : /"input" b-id
@b-id : ID
b-expr : b-sum
b-sum : [b-sum ("+"|"-")] b-product
b-product : [b-product ("*"|"/"|"mod")] b-neg
b-neg : ["-"] b-expt
b-expt : [b-expt "^"] b-value
@b-value : b-number | b-id | /"(" b-expr /")"
@b-number : INTEGER | DECIMAL

basic/expr.rkt

#lang br
(provide b-expr b-sum b-product b-neg b-expt)

(define (b-expr expr)
  (if (integer? expr) (inexact->exact expr) expr))

(define-macro-cases b-sum
  [(_ VAL) #'VAL]
  [(_ LEFT "+" RIGHT) #'(+ LEFT RIGHT)]
  [(_ LEFT "-" RIGHT) #'(- LEFT RIGHT)])

(define-macro-cases b-product
  [(_ VAL) #'VAL]
  [(_ LEFT "*" RIGHT) #'(* LEFT RIGHT)]
  [(_ LEFT "/" RIGHT) #'(/ LEFT RIGHT 1.0)]
  [(_ LEFT "mod" RIGHT) #'(modulo LEFT RIGHT)])

(define-macro-cases b-neg
  [(_ VAL) #'VAL]
  [(_ "-" VAL) #'(- VAL)])

(define-macro-cases b-expt
  [(_ VAL) #'VAL]
  [(_ LEFT "^" RIGHT) #'(expt LEFT RIGHT)])

#lang br
(provide b-expr b-sum b-product b-neg b-expt)

(define (b-expr expr)
  (if (integer? expr) (inexact->exact expr) expr))

(define-macro-cases b-sum
  [(_ VAL) #'VAL]
  [(_ LEFT "+" RIGHT) #'(+ LEFT RIGHT)]
  [(_ LEFT "-" RIGHT) #'(- LEFT RIGHT)])

(define-macro-cases b-product
  [(_ VAL) #'VAL]
  [(_ LEFT "*" RIGHT) #'(* LEFT RIGHT)]
  [(_ LEFT "/" RIGHT) #'(/ LEFT RIGHT 1.0)]
  [(_ LEFT "mod" RIGHT) #'(modulo LEFT RIGHT)])

(define-macro-cases b-neg
  [(_ VAL) #'VAL]
  [(_ "-" VAL) #'(- VAL)])

(define-macro-cases b-expt
  [(_ VAL) #'VAL]
  [(_ LEFT "^" RIGHT) #'(expt LEFT RIGHT)])

basic/expr.rkt

#lang br
(provide b-expr b-sum b-product b-neg b-expt)

(define (b-expr expr)
  (if (integer? expr) (inexact->exact expr) expr))

(define-cases b-sum
  [(_ arg) arg]
  [(_ left op right) ((case op
                        [("+") +]
                        [("-") -]) left right)])

(define-cases b-product
  [(_ arg) arg]
  [(_ left op right) ((case op
                        [("*") *]
                        [("/") (λ (l r) (/ l r 1.0))]
                        [("mod") modulo]) left right)])

(define-cases b-neg
  [(_ val) val]
  [(_ _ val) (- val)])

(define-cases b-expt
  [(_ val) val]
  [(_ left _ right) (expt left right)])

#lang br
(provide b-expr b-sum b-product b-neg b-expt)

(define (b-expr expr)
  (if (integer? expr) (inexact->exact expr) expr))

(define-cases b-sum
  [(_ arg) arg]
  [(_ left op right) ((case op
                        [("+") +]
                        [("-") -]) left right)])

(define-cases b-product
  [(_ arg) arg]
  [(_ left op right) ((case op
                        [("*") *]
                        [("/") (λ (l r) (/ l r 1.0))]
                        [("mod") modulo]) left right)])

(define-cases b-neg
  [(_ val) val]
  [(_ _ val) (- val)])

(define-cases b-expt
  [(_ val) val]
  [(_ left _ right) (expt left right)])

Beautiful Racket / tutorials

Into the rapids: more basic

Previous oversimplifications

Associativity & precedence

Angles of attack

Parsing with associativity & precedence

Lexer updates

Parser updates

Expander updates

Testing our expressions

Using functions instead of macros

Beau­tiful Racket / tuto­rials

Into the rapids: more basic

Beautiful Racket / tutorials