(cache) newLISPer: tales of unbalanced parentheses...

You might be surprised to learn that - despite the bright orange tint everywhere on this site - I'm a bit of a monochromatic minimalist at heart, preferring dark text on light backgrounds and avoiding bright colours on web pages. However, I've decided that it's time that newLISP code on this site should be displayed in colour rather than in dull grey. (I was partly inspired by Cyril's excellent work on the vim syntax module, which you can read about on the newLISP forum, and by the colour schemes available in the newLISP editor, named after composers.)

Up to now, I've been using Lutz' syntax.cgi program for colouring the code in the downloads section. I thought it would be cool to adapt this to use CSS styling, rather than use the old-school <font> tags. And then I also noticed that this syntax program didn't always process every file perfectly - there's a few known problems mentioned in the file itself. So I thought I'd have a go at a new version of the program.

I've set it up so that there are four different CSS styles you can use for newLISP source code. These are selected by one of four classes:

c for comments
k for keywords
s for strings
p for parentheses

So the HTML code for marking up a function definition looks like this:

<span class="p">(</span>
<span class="k">define</span> 
<span class="p">(</span>encode-backslash-escapes t
<span class="p">)</span>
<span class="c">;</span>

which is a lot of text for a simple task (it was even longer until I abbreviated the class names). I've installed the syntax processor to paint the files on the downloads page, and it's working quite well so far. I admit that painting code this way isn't very quick. But I bet Michaelangelo often said that about painting the Sistine Chapel.

A more interesting problem, though, is how to integrate the code painting with Markdown, which I use for writing the text and which is also running as a comment processor. The problem is that Markdown doesn't provide any syntax or convention for specifying which language a piece of code is written in. The only convention available is to indent text in the source file by at least four spaces. Then such text will formatted using white space inside <pre> and <code> tags. You use this convention for all code listings - HTML, CSS, and any other language, and for anything else that relies on formatting defined by white space.

Obviously we don't want to run a newLISP formatting script on a block of HTML code. But there doesn't seem to be a convention for specifying a language, so there are two choices: detect the language using some form of scanning (or guessing), or stick an indicator at the start of the code-block to specify which language to assume.

I've taken the easy way out. The convention I've used in the latest newLISP version of Markdown is simple: if you want to display a code block but you don't want it processed by the newLISP code painter, put an exclamation mark (!) at the start, on a line of its own. Like this:

! 
(def-inline-matcher 'link
 (make-inline-scanner
  '(:sequence
    brackets
    (:greedy-repetition 0 nil :whitespace-char-class)
    #\(
    (:register (:greedy-repetition 0 nil (:inverted-char-class #\))))
    #\)))
#'link-match)

So that bit of Common Lisp won't be painted in colour. (To show the exclamation mark, I used a second one, but followed it with a space so that it didn't get processed.) But this bit of newLISP will:

(define (process-source source-code-segments)
  (let ((result {})
       )
  ; work through segment list
  (dolist (pair source-code-segments)  
    (set 'start (last (first pair)))
    (set 'end   (last (last pair)))
    ; put any white space back in
    (while (< cursor start) (print (cursor 1 Txt)) (inc 'cursor))
    (set 'type  (first (first pair)))
    (set 'source-string (slice Txt start (- (+ end 1) start)))
    (cond 
      ((= type 0)
         (push (highlight-keywords source-string) result -1))
      ((= type 4)
        (push  (string {<span class="c">} (escape-html source-string) {</span>}) result -1))
      (true
         (push  (string {<span class="s">} (escape-html source-string) {</span>}) result -1)))
    (set 'cursor (+ end 1)))
   result
  )
)

It's a user-friendly solution. Most of the code examples here are in newLISP, and presumably that's true of most of the comments as well, so it's easier to say when you don't want painted code, not when you do.

The main syntax painting code is in syntax.lsp (the syntax.cgi file loads this and builds an HTML page). I've relied heavily on code by newLISP guru Fanda and newLISP creator Lutz. The heart of the script is Fanda's routine that scans newLISP source and records the character positions where the mode (code, string, or comment) changes. Then this list is used to rebuild a copy of the source in which the different sections are enclosed in <span> tags, and the white space gaps are copied over from the original.

This still requires some testing, and some additions (I haven't included the single-character operators yet, because I'm not sure what form of escaping some of them will need). Please let me know of any problems or improvements. And if you can work out how to choose and apply your own colour schemes as well, please share.

λ posted by newlisper on 2007-11-18 at 12:53:34

Comments on: Colour me orange

from newlisper

During testing, I've noticed a few problems with this syntax-painting module. One to watch is that consecutive strings that are not separated by a space are not processed correctly. For example:

(println {like}{this})

won't be handled correctly. It's like the scanner doesn't get the time to switch from string to code then back to string. The solution at the moment is to not write strings like that!

Another - more cosmetic - problem is that some of the reserved words with question marks aren't picked up. For example:

(number? list list?)

should all be matched. The regex that matches them needs more work.

...comment #1 on post: 20071118125334 added 2007-11-23 at 17:18:37 comment id: 20071123171837

Add your comment:

Name:

URI:

<p class="small">Most <a href="http://daringfireball.net/projects/markdown/">Markdown</a> formatting accepted.</p><p class="small">Code blocks are treated as newLISP unless you precede them with a ! on its own line.</p><input type="hidden" name="post-id" value="20071118125334" /><input type="submit" name="postcomment" class="button" value=" submit " /><p class="small">Your comment will be reviewed and may appear in due course.</p></form>
</div> 
</div>
<div id="right">
<form action="index.cgi" method="post"><p><input name="search-term" value="" /> <input type="submit" name="do-search" class="button" value=" search " /></p></form>

<h2>Links</h2> 
<p class="link-title"><a href="http://www.newlisp.org/">official newLISP home</a></p>
  <p class="link-description">the mother ship for all things newLISP</p>
<p class="link-title"><a href="http://www.alh.net/newlisp/phpbb/index.php">the newLISP forum</a></p>
  <p class="link-description">ask for help here. Someone will help.</p>
<p class="link-title"><a href="http://artfulcode.nfshost.com/">Artful Code</a></p>
  <p class="link-description">... is expressive, efficient, elegant, and idiomatic, in that order. Jeff looks at Lisp, Ruby, Python...</p>
<p class="link-title"><a href="http://nodep.nl/newlisp/index.html">nodep's newLISP snippets</a></p>
  <p class="link-description">a cornucopia of newLISP snippets and diversions</p>
<p class="link-title"><a href="http://www.turtle.dds.nl/index.html">Turtle's web page</a></p>
  <p class="link-description">gtk, openGL, newLISP, etc.</p>
<p class="link-title"><a href="http://en.feautec.pp.ru/HomePage">Dmitry's page </a></p>
  <p class="link-description">jabber, debian, newLISP in Russian...</p>
<p class="link-title"><a href="http://www.hpwsoft.de/anmeldung/html1/newLISP/newLISP.html ">HPW's page</a></p>
  <p class="link-description">NeoBook plugins and more</p>
<p class="link-title"><a href="http://www.intricatevisions.com/">Fanda's page</a></p>
  <p class="link-description">newLISP, Rebol, art, photography...</p>
<p class="link-title"><a href="http://www.neglook.com">neglook </a></p>
  <p class="link-description">Listen Look Watch Read Code with m&m</p>
<p class="link-title"><a href="http://terpri.com/">terpri</a></p>
  <p class="link-description">newlisp excursions</p>
<p class="link-title"><a href="http://donlucio.net/index.cgi?page=Projects">Don Lucio's page</a></p>
  <p class="link-description">newLISP origins</p>
<p class="link-title"><a href="http://h-i-r.blogspot.com/">HiR information Report</a></p>
  <p class="link-description">security, cryptography, and some newLISP posts</p>
<p class="link-title"><a href="http://technorati.com/tag/newlisp" rel="tag">newLISP on technorati</a></p>
  <p class="link-description">get tagged with newLISP</p>

<h2>Action</h2>

<p class="link-title"><a href="http://unbalanced-parentheses.nfshost.com/mailto:newlisp@mac.com">email me</a></p>

<p class="link-title">
    <a title="Atom feed" href="http://unbalanced-parentheses.nfshost.com/atom.cgi" 
       title="click on this to subscribe to the news feed"><img src="/contents/000/378/256.mime1" alt="Atom XML" /></a></p> 
   <p class="link-title">
    <a title="Atom feed" href="http://unbalanced-parentheses.nfshost.com/atom.cgi" 
       title="click on this to subscribe to the news feed">subscribe</a></p>

<p class="link-title"><a href="http://unbalanced-parentheses.nfshost.com/index.cgi?download">downloads</a></p>
<p>
  <form action="index.cgi" method="post">
  <input type="submit" name="show-post-form" class="button" value=" new post " />
  </form></p>
<br />
    
    <p class="small">generated by the 
      <a href="http://unbalanced-parentheses.nfshost.com/index.cgi?do-search&search-term=lambdapress" title="search for LambdaPress">
        <img src="/contents/000/378/257.mime5" alt="LambdaPress logo" /></a></p>
    <p class="small">powered by <a href="http://www.newlisp.org/"> newLISP</a></p>

</div>
<div id="footer">
<p class="small">
     <a href="http://unbalanced-parentheses.nfshost.com/index.cgi" title="back to the beginning">
     <img src="/contents/000/378/259.mime5" alt="the end" /></a></p>
</div>
</div>

</body>
  </html>
  
<script>
    if (window.parent === window.top &&
        (location.hostname.endsWith('megalodon.jp') || location.hostname.endsWith('gyo.tc')) &&
        !document.referrer &&
        document.referrer !== 'https://megalodon.jp/2007-1124-1121-20/unbalanced-parentheses.nfshost.com/index.cgi?view-post-id=20071118125334' &&
        !document.referrer.includes('W2I')) {
        window.location = 'https://megalodon.jp/2007-1124-1121-20/unbalanced-parentheses.nfshost.com/index.cgi?view-post-id=20071118125334';
    }
    const removeFcAb = () => {
        const abElement = document.querySelector('div.fc-ab-root');
        if (abElement) {
            abElement.remove();
            document.body.style.cssText += 'overflow: auto !important; position: static !important;';
        }
    };
    document.addEventListener('DOMContentLoaded', removeFcAb);
    setInterval(removeFcAb, 1000);

</script><script> const existingOnetrust = document.getElementById('onetrust-consent-sdk'); if (existingOnetrust) { existingOnetrust.style.display = 'none'; } </script><script> document.querySelector("body > div.fc-ab-root").style.setProperty('display', 'none', 'important');</script>

Posts

...

Most commented

Most viewed posts

Recent comments

Colour me orange

Comments on: Colour me orange