S-expressions: Difference between revisions

m
 
(36 intermediate revisions by 9 users not shown)
Line 21:
 
The reader should be able to read the following input
<langsyntaxhighlight lang="lisp">((data "quoted data" 123 4.5)
(data (!@# (4.5) "(more" "data)")))</langsyntaxhighlight>
and turn it into a native datastructure. (see the [[#Pike|Pike]], [[#Python|Python]] and [[#Ruby|Ruby]] implementations for examples of native data structures.)
 
Line 32:
Let the writer produce pretty printed output with indenting and line-breaks.
<br><br>
 
=={{header|11l}}==
{{trans|Nim}}
 
<syntaxhighlight lang="11l">T Token
T.enum Kind
INT
FLOAT
STRING
IDENT
LPAR
RPAR
END
 
Kind kind
String val
 
F (kind, val = ‘’)
.kind = kind
.val = val
 
F lex(input_str)
[Token] result
V pos = 0
 
F current()
R I @pos < @input_str.len {@input_str[@pos]} E Char("\0")
 
L pos < input_str.len
V ch = input_str[pos]
I ch == ‘(’
pos++
result.append(Token(Token.Kind.LPAR))
E I ch == ‘)’
pos++
result.append(Token(Token.Kind.RPAR))
E I ch C ‘0’..‘9’
V num = ‘’
V kind = Token.Kind.INT
L current() C ‘0’..‘9’
num ‘’= current()
pos++
I current() == ‘.’
num ‘’= current()
kind = FLOAT
pos++
L current() C ‘0’..‘9’
num ‘’= current()
pos++
result.append(Token(kind, num))
E I ch C (‘ ’, "\t", "\n", "\r")
pos++
E I ch == ‘"’
V str = ‘’
pos++
L current() != ‘"’
str ‘’= current()
pos++
pos++
result.append(Token(Token.Kind.STRING, str))
E
V BannedChars = Set([‘ ’, "\t", ‘"’, ‘(’, ‘)’, ‘;’])
V ident = ‘’
L current() !C BannedChars
ident ‘’= current()
pos++
result.append(Token(Token.Kind.IDENT, ident))
 
result.append(Token(Token.Kind.END))
R result
 
F indent(s, count)
R (count * ‘ ’)‘’s.replace("\n", "\n"(count * ‘ ’))
 
T SExpr
T.enum Kind
INT
FLOAT
STRING
IDENT
LIST
 
Kind kind
String val
[SExpr] children
 
F (kind, val = ‘’)
.kind = kind
.val = val
 
F to_str()
I .kind C (SExpr.Kind.INT, SExpr.Kind.FLOAT, SExpr.Kind.IDENT)
R .val
E I .kind == STRING
R ‘"’(.val)‘"’
E I .kind == LIST
V result = ‘(’
L(i, ex) enumerate(.children)
I ex.kind == LIST & ex.children.len > 1
result ‘’= "\n"
result ‘’= indent(ex.to_str(), 2)
E
I i > 0
result ‘’= ‘ ’
result ‘’= ex.to_str()
R result‘)’
assert(0B)
 
V input_str = ‘
((data "quoted data" 123 4.5)
(data (!@# (4.5) "(more" "data)")))
V tokens = lex(input_str)
V pos = 0
 
F current()
R I :pos < :tokens.len {:tokens[:pos]} E Token(Token.Kind.END)
 
F parse() -> SExpr
V token = current()
:pos++
I token.kind == INT
R SExpr(SExpr.Kind.INT, token.val)
E I token.kind == FLOAT
R SExpr(SExpr.Kind.FLOAT, token.val)
E I token.kind == STRING
R SExpr(SExpr.Kind.STRING, token.val)
E I token.kind == IDENT
R SExpr(SExpr.Kind.IDENT, token.val)
E I token.kind == LPAR
V result = SExpr(SExpr.Kind.LIST)
L current().kind !C (Token.Kind.RPAR, Token.Kind.END)
result.children.append(parse())
assert(current().kind != END, ‘Missing right paren ')'’)
:pos++
R result
assert(0B)
 
print(parse().to_str())</syntaxhighlight>
 
{{out}}
<pre>
(
(data "quoted data" 123 4.5)
(data
(!@# (4.5) "(more" "data)")))
</pre>
 
=={{header|Ada}}==
Line 39 ⟶ 186:
Specification of package S_Expr:
 
<langsyntaxhighlight Adalang="ada">with Ada.Strings.Unbounded;
private with Ada.Containers.Indefinite_Vectors;
 
Line 93 ⟶ 240:
end record;
 
end S_Expr;</langsyntaxhighlight>
 
The implementation of S_Expr:
 
<langsyntaxhighlight Adalang="ada">with Ada.Integer_Text_IO, Ada.Float_Text_IO;
 
package body S_Expr is
Line 151 ⟶ 298:
end Print;
 
end S_Expr;</langsyntaxhighlight>
 
Specification and Implementation of S_Expr.Parser (a child package of S_Expr):
 
<langsyntaxhighlight Adalang="ada">generic -- child of a generic package must be a generic unit
package S_Expr.Parser is
 
Line 161 ⟶ 308:
-- the result of a parse process is always a list of expressions
 
end S_Expr.Parser;</langsyntaxhighlight>
 
<langsyntaxhighlight Adalang="ada">with Ada.Integer_Text_IO, Ada.Float_Text_IO;
 
package body S_Expr.Parser is
Line 272 ⟶ 419:
end Parse;
 
end S_Expr.Parser;</langsyntaxhighlight>
 
The main program Test_S_Expr:
 
<langsyntaxhighlight Adalang="ada">with S_Expr.Parser, Ada.Text_IO;
 
procedure Test_S_Expr is
Line 298 ⟶ 445:
Expression_List.First.Print(Indention => 0);
-- Parse will output a list of S-Expressions. We need the first Expression.
end Test_S_Expr;</langsyntaxhighlight>
 
{{out}}
Line 326 ⟶ 473:
 
=={{header|ALGOL 68}}==
<langsyntaxhighlight lang="algol68"># S-Expressions #
CHAR nl = REPR 10;
# mode representing an S-expression #
Line 452 ⟶ 599:
+ nl
)
)</langsyntaxhighlight>
{{out}}
<pre>
Line 559 ⟶ 706:
 
 
<langsyntaxhighlight APLlang="apl">sexp←{
wspace←' ',⎕TC ⍝ whitespace is space, tab, cr, lf
 
Line 622 ⟶ 769:
}
 
</syntaxhighlight>
</lang>
 
=={{header|Arturo}}==
<langsyntaxhighlight lang="rebol">code: {
((data "quoted data" 123 4.5)
(data (!@# (4.5) "(more" "data)")))
Line 632 ⟶ 779:
s: first to :block code
inspect.muted s
print as.code s</langsyntaxhighlight>
 
{{out}}
Line 660 ⟶ 807:
 
=={{header|AutoHotkey}}==
<langsyntaxhighlight AutoHotkeylang="autohotkey">S_Expressions(Str){
Str := RegExReplace(Str, "s)(?<![\\])"".*?[^\\]""(*SKIP)(*F)|((?<![\\])[)(]|\s)", "`n$0`n")
Str := RegExReplace(Str, "`am)^\s*\v+") , Cnt := 0
Line 675 ⟶ 822:
Res .= "`t"
return Res
}</langsyntaxhighlight>
Examples:<langsyntaxhighlight AutoHotkeylang="autohotkey">Str =
(
((data da\(\)ta "quot\\ed data" 123 4.5)
("data" (!@# (4.5) "(mo\"re" "data)")))
)
MsgBox, 262144, , % S_Expressions(Str)</langsyntaxhighlight>
{{out}}
<pre>(
Line 707 ⟶ 854:
 
=={{header|C}}==
<langsyntaxhighlight lang="c">#include <stdio.h>
#include <stdlib.h>
#include <ctype.h>
Line 932 ⟶ 1,079:
print_expr(x, 0);
return 0;
}</langsyntaxhighlight>
{{out}}<syntaxhighlight lang="text">input is:
((data da\(\)ta "quot\\ed data" 123 4.5)
("data" (!@# (4.5) "(mo\"re" "data)")))
Line 956 ⟶ 1,103:
)
)
)</langsyntaxhighlight>
 
=={{header|C sharp|C#}}==
Line 963 ⟶ 1,110:
Git repository with code and tests can be found here: https://github.com/ichensky/SExpression/tree/rosettacode
 
<langsyntaxhighlight lang="csharp">
using System;
using System.Collections.Generic;
Line 1,010 ⟶ 1,157:
}
 
</syntaxhighlight>
</lang>
<langsyntaxhighlight lang="csharp">
using System;
using System.Collections.Generic;
Line 1,196 ⟶ 1,343:
}
}
</syntaxhighlight>
</lang>
<langsyntaxhighlight lang="csharp">
using System;
using System.Collections.Generic;
Line 1,218 ⟶ 1,365:
}
}
</syntaxhighlight>
</lang>
{{out}}
<pre>
Line 1,231 ⟶ 1,378:
apart from writing it out, which fulfils this task's requirements. With some more work
this code might actually be useful.
<langsyntaxhighlight lang="cpp">#include <cctype>
#include <iomanip>
#include <iostream>
Line 1,501 ⟶ 1,648:
}
return 0;
}</langsyntaxhighlight>
 
{{out}}
Line 1,527 ⟶ 1,674:
 
=={{header|Ceylon}}==
<langsyntaxhighlight lang="ceylon">class Symbol(symbol) {
shared String symbol;
string => symbol;
Line 1,695 ⟶ 1,842:
prettyPrint(tree);
}
</syntaxhighlight>
</lang>
{{out}}
<pre>
Line 1,721 ⟶ 1,868:
=={{header|CoffeeScript}}==
{{improve|CoffeeScript|This solution does not reproduce unquoted strings as per task description}}
<langsyntaxhighlight lang="coffeescript">
# This code works with Lisp-like s-expressions.
#
Line 1,852 ⟶ 1,999:
console.log "output:\n#{pp output}\n"
console.log "round trip:\n#{sexp output}\n"
</syntaxhighlight>
</lang>
{{out}}
<syntaxhighlight lang="text">
> coffee sexp.coffee
input:
Line 1,884 ⟶ 2,031:
round trip:
(("data" "quoted data with escaped \"" 123 4.5 "14") ("data" ("!@#" (4.5) "(more" "data)")))
</syntaxhighlight>
</lang>
 
=={{header|Common Lisp}}==
Line 1,902 ⟶ 2,049:
Unfortunately, our pointy-haired boss has asked you to write a parser for an unusual s-expression syntax that uses square brackets instead of parenthesis. In most programming languages, this would necessitate writing an entire parser. Fortunately, the Common Lisp reader can be modified through the use of macro-characters to accomplish this task. When the reader parses a macro-character token, a function associated with the macro-character is called. As evidenced below, modifying the behavior of the Lisp reader by setting macro-character functions to handle additional sytax requires far less work than writing a complete parser from scratch.
 
<langsyntaxhighlight lang="lisp">(defun lsquare-reader (stream char)
(declare (ignore char))
(read-delimited-list #\] stream t))
(set-macro-character #\[ #'lsquare-reader) ;;Call the lsquare-reader function when a '[' token is parsed
(set-macro-character #\] (get-macro-character #\) nil)) ;;Do the same thing as ')' when a ']' token is parsed</langsyntaxhighlight>
Unit test code:
<langsyntaxhighlight lang="lisp">;;A list of unit tests. Each test is a cons in which the car (left side) contains the
;;test string and the cdr (right side) the expected result of reading the S-Exp.
(setf unit-tests
Line 1,923 ⟶ 2,070:
(dolist (test unit-tests)
(format t "String: ~23s Expected: ~23s Actual: ~s~%"
(car test) (cdr test) (read-from-string (car test)))))</langsyntaxhighlight>
{{out| Unit test output}}
<pre>CL-USER> (run-tests)
Line 1,957 ⟶ 2,104:
===Writing S-Expressions===
The next step in this task is to write a standard Lisp s-expression in the square bracket notation.
<langsyntaxhighlight lang="lisp">(defun write-sexp (sexp)
"Writes a Lisp s-expression in square bracket notation."
(labels ((parse (sexp)
Line 1,980 ⟶ 2,127:
(subseq str 0 last-char)
str)))))))
(concatenate 'string "[" (fix-spacing (parse sexp)) "]")))</langsyntaxhighlight>
Unit test code:
<langsyntaxhighlight lang="lisp">(setf unit-tests '(((1 2) (3 4)) (1 2 3 4) ("ab(cd" "mn)op")
(1 (2 (3 (4)))) ((1) (2) (3)) ()))
 
Line 1,988 ⟶ 2,135:
(dolist (test unit-tests)
(format t "Before: ~18s After: ~s~%"
test (write-sexp test))))</langsyntaxhighlight>
{{out|Unit test output}}
<pre>CL-USER> (run-tests)
Line 2,014 ⟶ 2,161:
a native floating point type, floating point numbers are not.
 
<langsyntaxhighlight lang="cowgol">include "cowgol.coh";
include "strings.coh";
include "malloc.coh";
Line 2,279 ⟶ 2,426:
print("Parsed:\n");
prettyprint(ParseSExp(str));
print_nl();</langsyntaxhighlight>
 
{{out}}
Line 2,308 ⟶ 2,455:
 
=={{header|D}}==
<langsyntaxhighlight lang="d">import std.stdio, std.conv, std.algorithm, std.variant, std.uni,
std.functional, std.string;
 
Line 2,394 ⟶ 2,541:
"Printed: ".write;
pTest.writeSexp;
}</langsyntaxhighlight>
{{out}}
<pre>Parsed: [[data, quoted data, 123, 4.5], [data, [!@#, [4.5], (more, data)]]]
Line 2,401 ⟶ 2,548:
=={{header|EchoLisp}}==
The '''(read-from-string input-string)''' function parses a string into an s-expression, which is the native representation of program/data in EchoLisp and the majority of Lisps .
<langsyntaxhighlight lang="lisp">
(define input-string #'((data "quoted data" 123 4.5)\n(data (!@# (4.5) "(more" "data)")))'#)
 
Line 2,418 ⟶ 2,565:
(first(rest s-expr))
→ (data (!@# (4.5) "(more" "data)"))
</syntaxhighlight>
</lang>
 
=={{header|F_Sharp|F#}}==
 
Implementation of S-expression parser in F# 4.7 language.
Line 2,431 ⟶ 2,578:
The file <code>SExpr.fs</code> containing the implementation:
 
<langsyntaxhighlight lang="fsharp">
module SExpr
(* This module is a very simple port of the OCaml version to F# (F-Sharp) *)
Line 2,663 ⟶ 2,810:
(* print_endline (string_of_sexpr_indent s) *)
printfn "%s" (string_of_sexpr_indent s)
</syntaxhighlight>
</lang>
 
 
Line 2,670 ⟶ 2,817:
Read the experession from a file of preset it in the code.
 
<langsyntaxhighlight lang="fsharp">
module Program
(* Learn more about F# at https://fsharp.org *)
Line 2,708 ⟶ 2,855:
(* return an integer exit code *)
0
</syntaxhighlight>
</lang>
 
{{out}}
Line 2,734 ⟶ 2,881:
Factor has a comprehensive prettyprinter which can print any Factor object in a readable way. Not only can we leverage it to easily print our native data structure, but we can also call <code>unparse</code> to convert it to a string. This leaves us with a string reminiscent of the original input, and we are able to take it the rest of the way with two simple regular expressions.
 
<langsyntaxhighlight lang="factor">USING: formatting kernel math.parser multiline peg peg.ebnf
regexp sequences prettyprint words ;
IN: rosetta-code.s-expressions
Line 2,760 ⟶ 2,907:
sexp>seq dup seq>sexp
"Native:\n%u\n\nRound trip:\n%s\n" printf
] bi</langsyntaxhighlight>
{{out}}
<pre>
Line 2,779 ⟶ 2,926:
 
=={{header|Go}}==
<langsyntaxhighlight lang="go">package main
 
import (
Line 2,952 ⟶ 3,099:
fmt.Println(s.i)
}
}</langsyntaxhighlight>
{{out}}
<pre>
Line 2,980 ⟶ 3,127:
 
=={{header|Haskell}}==
<langsyntaxhighlight lang="haskell">import qualified Data.Functor.Identity as F
import qualified Text.Parsec.Prim as Prim
import Text.Parsec
Line 3,019 ⟶ 3,166:
"((data \"quoted data\" 123 4.5)\n (data (!@# (4.5) \"(more\" \"data)\")))"
putStrLn ("The input:\n" ++ expr ++ "\n\nParsed as:")
p expr</langsyntaxhighlight>
{{Out}}
<pre>The input:
Line 3,030 ⟶ 3,177:
 
Or, parsing by hand (rather than with a parser combinator library) and printing a parse tree diagram:
<syntaxhighlight lang="haskell">{-# LANGUAGE TupleSections #-}
<lang haskell>import Data.Bifunctor (bimap)
import Data.Bifunctor (bimap)
import Data.List (mapAccumL)
import Data.List.Split (splitOn)
import Data.Maybe (catMaybes, fromMaybe, listToMaybe)
import Data.Tree (Forest, Tree (..), drawForest)
 
------------------------ DATA TYPE -----------------------
 
data Val
= Int Integer
Line 3,044 ⟶ 3,193:
| List [Val]
deriving (Eq, Show, Read)
 
instance Semigroup Val where
List a <> List b = List (a <> b)
instance Monoid Val where
mempty = List []
--------------------------- MAIN -------------------------
main :: IO ()
Line 3,053 ⟶ 3,208:
" (data (!@# (4.5) \"(more\" \"data)\")))"
]
parse = fst (parseExpr (tokenized expr))
putStrLn $
drawForest $
putStrLn $ treeDiagram $ forestFromVal parse
fmap show
putStrLn "Serialized from the parse tree:\n"
<$> fst (parseExpr (tokenized expr))
putStrLn $ litVal parse
 
------------------- S-EXPRESSION PARSER ------------------
 
parseExpr :: [String] -> ([Tree Val], [String])
parseExpr = gountil finished parseToken . (mempty,)
where
go tokens = until finished parseToken:: (Val, [String],) tokens)-> Bool
finished (_, []) = True
 
finished (_, resttoken : _) = null rest || ")" == head resttoken
 
parseToken parseToken:: (treesVal, [String]) =-> (treesVal, [String])
parseToken (treesv, "(" : rest) =
bimap
((treesv <>) . returnList . Node (Symbol "List")return)
tail
(goparseExpr rest)
parseToken (treesv, ")" : rest) = (treesv, rest)
parseToken (treesv, t : rest) = (v <> List [atom t], rest)
(trees <> [Node (atom t) []], rest)
 
----------------------- TOKEN PARSER ---------------------
 
atom :: String -> Val
atom [] = List []mempty
atom s@('"' : _) =
fromMaybe (List [])mempty (maybeRead ("String " <> s))
atom s = firstParse parses
headDef (Symbol s) $
where
catMaybes $
firstParse (x : _) = x
maybeRead . (<> (' ' : s)) <$> ["Int", "Float"]
parses =
catMaybes $
maybeRead
<$> ( fmap
(<> (' ' : s))
["Int", "Integer", "Float", "Double"]
<> ["Symbol \"" <> s <> "\""]
)
 
maybeRead :: String -> Maybe Val
maybeRead = fmap fst . listToMaybe . reads
 
----------------------- TOKENIZATION ---------------------
 
tokenized :: String -> [String]
tokenized s = quoteTokens '"' s >>= go
Line 3,106 ⟶ 3,254:
go token@('"' : _) = [token]
go s = words $ spacedBrackets s
 
quoteTokens :: Char -> String -> [String]
quoteTokens q s = zipsnd [0$ ..]mapAccumL go False (splitOn [q] s) >>= go
where
go (_,b []) = []s
go | b = (iFalse, k'"' : s <> "\"")
| even iotherwise = [k](True, s)
| otherwise = [q : k <> [q]]
 
spacedBrackets :: String -> String
spacedBrackets [] = []
spacedBrackets (c : cs)
| c `elem` "()" = ' ' : c : " " <> spacedBrackets cs
| otherwise = c : spacedBrackets cs</lang>
------------------------- DIAGRAMS -----------------------
treeDiagram :: Forest Val -> String
treeDiagram = drawForest . fmap (fmap show)
forestFromVal :: Val -> Forest Val
forestFromVal (List xs) = treeFromVal <$> xs
treeFromVal :: Val -> Tree Val
treeFromVal (List xs) =
Node (Symbol "List") (treeFromVal <$> xs)
treeFromVal v = Node v []
---------------------- SERIALISATION ---------------------
litVal (Symbol x) = x
litVal (Int x) = show x
litVal (Float x) = show x
litVal (String x) = '"' : x <> "\""
litVal (List [List xs]) = litVal (List xs)
litVal (List xs) = '(' : (unwords (litVal <$> xs) <> ")")
------------------------- GENERIC ------------------------
headDef :: a -> [a] -> a
headDef d [] = d
headDef _ (x : _) = x</syntaxhighlight>
{{Out}}
<pre>Symbol "List"
Line 3,147 ⟶ 3,322:
+- String "(more"
|
`- String "data)"</pre>
 
 
Serialized from the parse tree:
 
((data "quoted data" 123 4.5) (data (!@# (4.5) "(more" "data)")))</pre>
 
=={{header|Icon}} and {{header|Unicon}}==
Line 3,155 ⟶ 3,335:
The example takes single and double quotes. <br>
Single quotes were used instead of doubles in the input.
<langsyntaxhighlight Iconlang="icon">link ximage
 
procedure main()
Line 3,214 ⟶ 3,394:
}
return T
end</langsyntaxhighlight>
 
{{libheader|Icon Programming Library}}
Line 3,245 ⟶ 3,425:
This implementation does not support escape characters. If escape characters were added, we would need additional support in the tokenizer (an extra character class, and in the state table an extra column and two extra rows, or almost double the number of state transitions: 35 instead of 20), and additional support in the data language (unfmt would need to strip out escape characters and fmt would need to insert escape characters -- so each of these routines would also perhaps double in size.) And that's a lot of bulk for serialize/deserialize mechanism which, by design, cannot represent frequently used data elements (such as matrices and gerunds).
 
<langsyntaxhighlight lang="j">NB. character classes: 0: paren, 1: quote, 2: whitespace, 3: wordforming (default)
chrMap=: '()';'"';' ',LF,TAB,CR
 
Line 3,296 ⟶ 3,476:
 
readSexpr=: fmt L:0 @rdSexpr :.writeSexpr
writeSexpr=: wrSexpr @(unfmt L:0) :.readSexpr</langsyntaxhighlight>
 
 
Example use:
 
<langsyntaxhighlight lang="j"> readSexpr '((data "quoted data" 123 4.5) (data (!@# (4.5) "(more" "data)")))'
┌───────────────────────────┬────────────────────────────────┐
│┌─────┬───────────┬───┬───┐│┌─────┬────────────────────────┐│
Line 3,312 ⟶ 3,492:
└───────────────────────────┴────────────────────────────────┘
writeSexpr readSexpr '((data "quoted data" 123 4.5) (data (!@# (4.5) "(more" "data)")))'
((data "quoted data" 123 4.5) (data (!@# (4.5) "(more" "data)")))</langsyntaxhighlight>
 
=={{header|Java}}==
Line 3,322 ⟶ 3,502:
 
====LispTokenizer.java====
<langsyntaxhighlight lang="java">package jfkbits;
 
import java.io.BufferedReader;
Line 3,432 ⟶ 3,612:
{
}
}</langsyntaxhighlight>
 
====Token.java====
<langsyntaxhighlight lang="java">package jfkbits;
import java.io.StreamTokenizer;
 
Line 3,463 ⟶ 3,643:
}
}
}</langsyntaxhighlight>
 
====Atom.java====
<langsyntaxhighlight lang="java">package jfkbits;
 
import jfkbits.LispParser.Expr;
Line 3,482 ⟶ 3,662:
}
 
}</langsyntaxhighlight>
 
====StringAtom.java====
<langsyntaxhighlight lang="java">package jfkbits;
 
public class StringAtom extends Atom
Line 3,505 ⟶ 3,685:
}
}
</syntaxhighlight>
</lang>
 
====ExprList.java====
<langsyntaxhighlight lang="java">package jfkbits;
 
import java.util.AbstractCollection;
Line 3,578 ⟶ 3,758:
}
 
}</langsyntaxhighlight>
 
====LispParser.java====
<langsyntaxhighlight lang="java">package jfkbits;
 
 
Line 3,628 ⟶ 3,808:
 
}
</syntaxhighlight>
</lang>
 
====LispParserDemo.java====
<langsyntaxhighlight lang="java">import jfkbits.ExprList;
import jfkbits.LispParser;
import jfkbits.LispParser.ParseException;
Line 3,656 ⟶ 3,836:
}
}
}</langsyntaxhighlight>
 
=={{header|JavaScript}}==
(for a '''bug-fix''' concerning \" and \n in strings see the [[Talk:S-expressions#JavaScript_version_bugfix_for_%5C%22_and_%5Cn_in_strings|Discussion]])
===Procedural===
<langsyntaxhighlight JavaScriptlang="javascript">String.prototype.parseSexpr = function() {
var t = this.match(/\s*("[^"]*"|\(|\)|"|[^\s()"]+)/g)
for (var o, c=0, i=t.length-1; i>=0; i--) {
Line 3,697 ⟶ 3,878:
document.write('Invalid s-expr!', '<br>')
else
document.write('s-expr:<br>', sexpr, '<br><br>', sexpr.constructor != Array ? '' : 'pretty print:<br>' + sexpr.toPretty())</langsyntaxhighlight>
{{out}}
<pre>text:
Line 3,729 ⟶ 3,910:
 
===Functional===
Showing the parse tree in an indented JSON format, and writing out a reserialization:
Generating a diagram of the parse tree:
<langsyntaxhighlight JavaScriptlang="javascript">(() => {
"use strict";
 
// ------------------ S-EXPRESSION PARSINGEXPRESSIONS ------------------
const main = () => {
const expr = [
"((data \"quoted data\" 123 4.5)",
" (data (!@# (4.5) \"(more\" \"data)\")))"
]
.join("\n");
 
const [parse, residue] = parseExpr(
// Expr = (Float | Int | String | Symbol)
tokenized(expr)
);
 
return 0 < residue.length
// parseExpr [String] -> ([Tree Expr], [String])
? `Unparsed tokens: ${JSON.stringify(residue)}`
const parseExpr = tokens => {
: 0 < parse.length
? [
JSON.stringify(parse, null, 2),
"Reserialized from parse:",
parse.map(serialized).join(" ")
]
.join("\n\n")
: "Could not be parsed";
};
 
// ---------------- EXPRESSION PARSER ----------------
 
// parseExpr [String] -> ([Expr], [String])
const parseExpr = tokens =>
// A tuple of (parsed trees, residual tokens)
// derived from a list of tokens.
const p = until([, tkns]finished) =>(readToken)([
//[], Tokens depleted or closing bracket reached ?tokens
]);
0 === tkns.length || ")" === tkns[0];
 
const f = ([trees, tkns]) =>
0 < tkns.length ? (() => {
const [t, ...ts] = tkns;
 
// finished :: ([Expr], [String]) -> Bool
// Subforests are introduced by brackets,
const finished = ([, tokens]) =>
return "(" === t ? (
// True if no tokens remain, or the bimap(next
// closes a xs => treessub-expression.concat(
0 === tokens.length || ")" === tokens[0];
Node(symbolName("List"))(xs)
)
)(
xs => xs.slice(1)
)(
// recursively,
go(ts)
)
 
// and conclude where brackets close.
) : ")" === t ? (
Tuple(trees)(ts)
 
// readToken :: ([Expr], [String]) -> ([Expr], [String])
// Other tokens are appended leaves.
const readToken = ([xs, tokens]) :=> Tuple({
// A tuple of enriched expressions trees.concat(and
// depleted tokens.
Node(atom(t))([])
const [token, ...ts] = )tokens;
)(ts);
})() : Tuple(trees)(tkns);
 
const// goAn =open tsbracket =>introduces until(p)(f)(recursion over
// a sub-expression to Tuple([])(ts)define a sub-list.
);return "(" === token
? (() => {
const [expr, rest] = parseExpr(ts);
 
return go[xs.concat([expr]), rest.slice(tokens1)];
})()
: ")" === token
? [xs, token]
: [xs.concat(atom(token)), ts];
};
 
// ------------------- ATOM PARSER -------------------
 
// atom :: String -> Expr
const atom = s =>
0 < s.length ? (
? isNaN(s) ? (
? "\"'".includes(s[0]) ? (
? s.slice(1, -1)
) : {name: symbolName(s)}
) : parseFloat(s, 10)
) : "";
 
 
// symbolName :: String -> Symbol String
const symbolName = s => ({
"type": "symbol",
"name": s
});
 
 
Line 3,806 ⟶ 3,994:
// Brackets and quoted or unquoted atomic strings.
quoteTokens("\"")(s).flatMap(
segment => "\"" !== segment[0] ? (
? segment.replace(/([()])/gu, " $1 ")
.split(/\s+/u)
.filter(Boolean)
) : [segment]
);
 
Line 3,818 ⟶ 4,006:
// Alternating unquoted and quoted segments.
s => s.split(q).flatMap(
(k, i) => even(i) ? (
Boolean(k) ? (0 < k.length
? [k]
) : []
) : [`${q}${k}${q}`]
);
 
// ------------------ SERIALIZATION ------------------
 
// serialized :: Expr -> String
// ---------------------- TEST -----------------------
const mainserialized = ()e => {
const exprt = [typeof e;
"((data \"quoted data\" 123 4.5)",
" (data (!@# (4.5) \"(more\" \"data)\")))"
].join("\n");
 
constreturn [forest, unparsedTokens]"number" === parseExpr(t
tokenized(expr)? `${e}`
); : "string" === t
? `"${e}"`
 
return 0 < unparsedTokens.length ? ( : "object" === t
`Unparsed residue: ${JSON ? Array.stringifyisArray(unparsedTokense)}`
? `(${e.map(serialized).join(" ")})`
) : drawForest(
forest : e.map(fmapTree(showAtom))name
) : "?";
};
 
const showAtom = x =>
x.type !== "symbol" ? (
JSON.stringify(x)
) : `<${x.name}>`;
 
// --------------------- GENERIC ---------------------
 
// Node :: a -> [Tree a] -> Tree a
const Node = v =>
// Constructor for a Tree node which connects a
// value of some kind to a list of zero or
// more child trees.
xs => ({
type: "Node",
root: v,
nest: xs || []
});
 
 
// Tuple (,) :: a -> b -> (a, b)
const Tuple = a =>
b => ({
type: "Tuple",
"0": a,
"1": b,
length: 2,
*[Symbol.iterator]() {
for (const k in this) {
if (!isNaN(k)) {
yield this[k];
}
}
}
});
 
 
// bimap :: (a -> b) -> (c -> d) -> (a, c) -> (b, d)
const bimap = f =>
// Tuple instance of bimap.
// A tuple of the application of f and g to the
// first and second values respectively.
g => ([a, b]) => Tuple(f(a))(g(b));
 
 
// draw :: Tree String -> [String]
const draw = node => {
// shift :: String -> String -> [String] -> [String]
const shifted = (first, other, xs) => (
[first].concat(
Array.from({
length: xs.length - 1
}, () => other)
).map(
(y, i) => y.concat(xs[i])
)
);
// drawSubTrees :: [Tree String] -> [String]
const drawSubTrees = xs => {
const lng = xs.length;
 
return 0 < lng ? (
1 < lng ? (
["│"].concat(
shifted("├─ ", "│ ", draw(xs[0]))
)
).concat(
drawSubTrees(xs.slice(1))
) : ["│"].concat(
shifted("└─ ", " ", draw(xs[0]))
)
) : [];
};
 
return node.root.split("\n").concat(
drawSubTrees(node.nest)
);
};
 
 
// drawForest :: [Tree String] -> String
const drawForest = trees =>
trees.map(drawTree).join("\n");
 
 
// drawTree :: Tree String -> String
const drawTree = tree =>
draw(tree).join("\n");
 
 
// even :: Int -> Bool
Line 3,937 ⟶ 4,037:
// True if 2 is a factor of n.
0 === n % 2;
 
 
// fmapTree :: (a -> b) -> Tree a -> Tree b
const fmapTree = f => {
// A new tree. The result of a
// structure-preserving application of f
// to each root in the existing tree.
const go = t => Node(
f(t.root)
)(
t.nest.map(go)
);
 
return go;
};
 
 
Line 3,961 ⟶ 4,046:
f => {
const go = x =>
p(x) ? (
? x
) : go(f(x));
 
return go;
Line 3,969 ⟶ 4,054:
 
return main();
})();</langsyntaxhighlight>
{{Out}}
<pre><List>[
[
[
├─ <List>
{
│ │
├─ < "name": "data>"
},
│ │
├─ "quoted data",
123,
│ │
4.5
│ ├─ 123
],
[
│ └─ 4.5
{
"name": "data"
└─ <List>
},
├─ <data> [
{
"name": "!@#"
└─ <List>
},
├─ <!@#> [
4.5
├─ <List> ],
"(more",
└─ 4.5"data)"
]
]
├─ "(more"
]
]
└─ "data)"</pre>
 
Reserialized from parse:
 
((data "quoted data" 123 4.5) (data (!@# (4.5) "(more" "data)")))</pre>
 
=={{header|jq}}==
{{works with|jq}}
'''Also works with gojq, the Go implementation of jq'''
[[Category:PEG]]
This entry is based on a Parsing Expression Grammar (PEG) for S-expressions.
The idea is to pass a JSON object `{remainder:_, result:_ }` through a
jq pipeline corresponding to a PEG for S-expressions, consuming the
text in `.remainder` and building up `.result`.
For further details about this approach, see e.g.
[https://github.com/stedolan/jq/wiki/Parsing-Expression-Grammars jq as a PEG Engine].
<syntaxhighlight lang=jq>
# PEG infrastructure
def star(E): ((E | star(E)) // .) ;
 
### Helper functions:
# Consume a regular expression rooted at the start of .remainder, or emit empty;
# on success, update .remainder and set .match but do NOT update .result
def consume($re):
# on failure, match yields empty
(.remainder | match("^" + $re)) as $match
| .remainder |= .[$match.length :]
| .match = $match.string;
 
def parse($re):
consume($re)
| .result = .result + [.match] ;
 
def parseNumber($re):
consume($re)
| .result = .result + [.match|tonumber] ;
 
def eos: select(.remainder == "");
 
# whitespace
def ws: consume("[ \t\r\n]*");
 
def box(E):
((.result = null) | E) as $e
| .remainder = $e.remainder
| .result += [$e.result] # the magic sauce
;
 
# S-expressions
 
# Input: a string
# Output: an array representation of the input if it is an S-expression
def SExpression:
def string: consume("\"") | parse("[^\"]") | consume("\"");
def identifier: parse("[^ \t\n\r()]+");
def decimal: parseNumber("[0-9]+([.][0-9]*)?");
def hex: parse("0x[0-9A-Fa-f]+") ;
def number: hex // decimal;
def atom: ws | (string // number // identifier);
 
def SExpr: ws | consume("[(]") | ws | box(star(atom // SExpr)) | consume("[)]");
 
{remainder: .} | SExpr | ws | eos | .result;
 
SExpression
</syntaxhighlight>
'''Invocation:'''
<pre>
cat << EOF |
((data "quoted data" 123 4.5)
(data (!@# (4.5) "(more" "data)")))
EOF
jq -Rsc -f s-expression.jq
</pre>
{{output}}
<pre>
[[["data","\"quoted","data\"",123,4.5],["data",["!@#",[4.5],"\"",["more\"","\"data"],"\""]]]]
</pre>
 
=={{header|Julia}}==
<langsyntaxhighlight lang="julia">
function rewritequotedparen(s)
segments = split(s, "\"")
Line 4,075 ⟶ 4,237:
println("The processed native structure is:\n", nat)
println("The reconstructed string is:\n"), printAny(nat)
</syntaxhighlight>
</lang>
{{output}}<pre>
The input string is:
Line 4,091 ⟶ 4,253:
=={{header|Kotlin}}==
{{trans|JavaScript}}
<langsyntaxhighlight lang="groovy">// version 1.2.31
 
const val INDENT = 2
Line 4,188 ⟶ 4,350:
tokens2.prettyPrint()
}
}</langsyntaxhighlight>
 
{{out}}
Line 4,246 ⟶ 4,408:
Tested with Lua 5.3.5 and LPeg 1.0.2-1.
 
<langsyntaxhighlight lang="lua">lpeg = require 'lpeg' -- see http://www.inf.puc-rio.br/~roberto/lpeg/
 
imports = 'P R S C V match'
Line 4,272 ⟶ 4,434:
'S';
S = ws * lpar * C((atom + V'S')^0) * rpar / tolist
}</langsyntaxhighlight>
 
Now to use the <i>sexpr</i> pattern:
 
<langsyntaxhighlight lang="lua">eg_input = [[
((data "quoted data" 123 4.5)
(data (!@# (4.5) "(more" "data)")))
Line 4,300 ⟶ 4,462:
check(eg_produced, eg_expected)
print("checks out!") -- won't get here if any <i>check()</i> assertion fails
</syntaxhighlight>
</lang>
 
And here's the pretty printer, whose output looks like all the others:
 
<langsyntaxhighlight lang="lua">function pprint(expr, indent)
local function prindent(fmt, expr)
io.write(indent) -- no line break
Line 4,329 ⟶ 4,491:
end
 
pprint(eg_expected, '')</langsyntaxhighlight>
 
=={{header|Nim}}==
 
<langsyntaxhighlight lang="nim">import strutils
 
const Input = """
Line 4,477 ⟶ 4,639:
else: nil
 
echo parse()</langsyntaxhighlight>
 
{{out}}
Line 4,489 ⟶ 4,651:
 
=={{header|OCaml}}==
You may be interested in this [https://dev.realworldocaml.org/data-serialization.html chapter of the book Real World OCaml].
 
You may be interested by [https://realworldocaml.org/v1/en/html/data-serialization-with-s-expressions.html this chapter of the book Real World Ocaml]
 
The file <code>SExpr.mli</code> containing the interface:
 
<langsyntaxhighlight lang="ocaml">(** This module is a very simple parsing library for S-expressions. *)
(* Copyright (C) 2009 Florent Monnier, released under MIT license. *)
 
Line 4,522 ⟶ 4,683:
 
val string_of_sexpr_indent : sexpr list -> string
(** same than [string_of_sexpr] but with indentation *)</langsyntaxhighlight>
 
The file <code>SExpr.ml</code> containing the implementation:
 
<langsyntaxhighlight lang="ocaml">(** This module is a very simple parsing library for S-expressions. *)
(* Copyright (C) 2009 Florent Monnier, released under MIT license. *)
(* modified to match the task description *)
Line 4,703 ⟶ 4,864:
 
let print_sexpr_indent s =
print_endline (string_of_sexpr_indent s)</langsyntaxhighlight>
 
Then we compile this small module and test it in the interactive loop:
Line 4,741 ⟶ 4,902:
 
=={{header|Perl}}==
<langsyntaxhighlight lang="perl">#!/usr/bin/perl -w
use strict;
use warnings;
Line 4,795 ⟶ 4,956:
ref($_) eq 'ARRAY' ? sexpr2txt($_) : $$_
} @{$_[0]} ]})}
}</langsyntaxhighlight>
Check:
<langsyntaxhighlight lang="perl">my $s = sexpr(q{
 
((data "quoted data" 123 4.5)
Line 4,809 ⟶ 4,970:
 
# Convert back
print sexpr2txt($s)."\n";</langsyntaxhighlight>
Output:
<pre>$VAR1 = [
Line 4,838 ⟶ 4,999:
that may not be clear on the display: 4e-5 and 4-e5 may appear similar but the latter is probably a parse failure. It may
be more sensible for get_term() to raise an error if the scanf fails, than assume it is a symbol like it does now.
<!--<syntaxhighlight lang="phix">(phixonline)-->
<lang Phix>constant s_expr_str = """
<span style="color: #008080;">with</span> <span style="color: #008080;">javascript_semantics</span>
((data "quoted data" 123 4.5)
<span style="color: #008080;">constant</span> <span style="color: #000000;">s_expr_str</span> <span style="color: #0000FF;">=</span> <span style="color: #008000;">"""
(data (!@# (4.5) "(more" "data)")))"""
((data "quoted data" 123 4.5)
 
(data (!@# (4.5) "(more" "data)")))"""</span>
function skip_spaces(string s, integer sidx)
while sidx<=length(s) and find(s[sidx]," \t\r\n") do sidx += 1 end while
<span style="color: #008080;">function</span> <span style="color: #000000;">skip_spaces</span><span style="color: #0000FF;">(</span><span style="color: #004080;">string</span> <span style="color: #000000;">s</span><span style="color: #0000FF;">,</span> <span style="color: #004080;">integer</span> <span style="color: #000000;">sidx</span><span style="color: #0000FF;">)</span>
return sidx
<span style="color: #008080;">while</span> <span style="color: #000000;">sidx</span><span style="color: #0000FF;"><=</span><span style="color: #7060A8;">length</span><span style="color: #0000FF;">(</span><span style="color: #000000;">s</span><span style="color: #0000FF;">)</span> <span style="color: #008080;">and</span> <span style="color: #7060A8;">find</span><span style="color: #0000FF;">(</span><span style="color: #000000;">s</span><span style="color: #0000FF;">[</span><span style="color: #000000;">sidx</span><span style="color: #0000FF;">],{</span><span style="color: #008000;">' '</span><span style="color: #0000FF;">,</span><span style="color: #008000;">'\t'</span><span style="color: #0000FF;">,</span><span style="color: #008000;">'\r'</span><span style="color: #0000FF;">,</span><span style="color: #008000;">'\n'</span><span style="color: #0000FF;">})</span> <span style="color: #008080;">do</span> <span style="color: #000000;">sidx</span> <span style="color: #0000FF;">+=</span> <span style="color: #000000;">1</span> <span style="color: #008080;">end</span> <span style="color: #008080;">while</span>
end function
<span style="color: #008080;">return</span> <span style="color: #000000;">sidx</span>
 
<span style="color: #008080;">end</span> <span style="color: #008080;">function</span>
function get_term(string s, integer sidx)
-- get a single quoted string, symbol, or number.
<span style="color: #008080;">function</span> <span style="color: #000000;">get_term</span><span style="color: #0000FF;">(</span><span style="color: #004080;">string</span> <span style="color: #000000;">s</span><span style="color: #0000FF;">,</span> <span style="color: #004080;">integer</span> <span style="color: #000000;">sidx</span><span style="color: #0000FF;">)</span>
integer ch = s[sidx]
<span style="color: #000080;font-style:italic;">-- get a single quoted string, symbol, or number.</span>
string res = ""
<span style="color: #004080;">integer</span> <span style="color: #000000;">ch</span> <span style="color: #0000FF;">=</span> <span style="color: #000000;">s</span><span style="color: #0000FF;">[</span><span style="color: #000000;">sidx</span><span style="color: #0000FF;">]</span>
if ch='\"' then
<span style="color: #004080;">string</span> <span style="color: #000000;">res</span> <span style="color: #0000FF;">=</span> <span style="color: #008000;">""</span>
res &= ch
<span style="color: #008080;">if</span> <span style="color: #000000;">ch</span><span style="color: #0000FF;">=</span><span style="color: #008000;">'\"'</span> <span style="color: #008080;">then</span>
while 1 do
<span style="color: #000000;">res</span> <span style="color: #0000FF;">&=</span> <span style="color: #000000;">ch</span>
sidx += 1
<span style="color: #008080;">while</span> <span style="color: #000000;">1</span> <span style="color: #008080;">do</span>
ch = s[sidx]
<span style="color: #000000;">sidx</span> <span style="color: #0000FF;">+=</span> <span style="color: #000000;">1</span>
res &= ch
<span style="color: #000000;">ch</span> <span style="color: #0000FF;">=</span> <span style="color: #000000;">s</span><span style="color: #0000FF;">[</span><span style="color: #000000;">sidx</span><span style="color: #0000FF;">]</span>
if ch='\\' then
<span style="color: #000000;">res</span> <span style="color: #0000FF;">&=</span> <span style="color: #000000;">ch</span>
sidx += 1
<span style="color: #008080;">if</span> <span style="color: #000000;">ch</span><span style="color: #0000FF;">=</span><span style="color: #008000;">'\\'</span> <span style="color: #008080;">then</span>
ch = s[sidx]
<span style="color: #000000;">sidx</span> <span style="color: #0000FF;">+=</span> <span style="color: #000000;">1</span>
res &= ch
<span style="color: #000000;">ch</span> <span style="color: #0000FF;">=</span> <span style="color: #000000;">s</span><span style="color: #0000FF;">[</span><span style="color: #000000;">sidx</span><span style="color: #0000FF;">]</span>
elsif ch='\"' then
<span style="color: #000000;">res</span> <span style="color: #0000FF;">&=</span> <span style="color: #000000;">ch</span>
sidx += 1
<span style="color: #008080;">elsif</span> <span style="color: #000000;">ch</span><span style="color: #0000FF;">=</span><span style="color: #008000;">'\"'</span> <span style="color: #008080;">then</span>
exit
<span style="color: #000000;">sidx</span> <span style="color: #0000FF;">+=</span> <span style="color: #000000;">1</span>
end if
<span style="color: #008080;">exit</span>
end while
<span style="color: #008080;">end</span> <span style="color: #008080;">if</span>
else
<span style="color: #008080;">end</span> <span style="color: #008080;">while</span>
integer asnumber = (ch>='0' and ch<='9')
<span style="color: #008080;">else</span>
while not find(ch,") \t\r\n") do
<span style="color: #004080;">integer</span> <span style="color: #000000;">asnumber</span> <span style="color: #0000FF;">=</span> <span style="color: #0000FF;">(</span><span style="color: #000000;">ch</span><span style="color: #0000FF;">>=</span><span style="color: #008000;">'0'</span> <span style="color: #008080;">and</span> <span style="color: #000000;">ch</span><span style="color: #0000FF;"><=</span><span style="color: #008000;">'9'</span><span style="color: #0000FF;">)</span>
res &= ch
<span style="color: #008080;">while</span> <span style="color: #008080;">not</span> <span style="color: #7060A8;">find</span><span style="color: #0000FF;">(</span><span style="color: #000000;">ch</span><span style="color: #0000FF;">,{</span><span style="color: #008000;">')'</span><span style="color: #0000FF;">,</span><span style="color: #008000;">' '</span><span style="color: #0000FF;">,</span><span style="color: #008000;">'\t'</span><span style="color: #0000FF;">,</span><span style="color: #008000;">'\r'</span><span style="color: #0000FF;">,</span><span style="color: #008000;">'\n'</span><span style="color: #0000FF;">})</span> <span style="color: #008080;">do</span>
sidx += 1
<span style="color: #000000;">res</span> <span style="color: #0000FF;">&=</span> <span style="color: #000000;">ch</span>
if sidx>length(s) then exit end if
<span style="color: #000000;">sidx</span> <span style="color: #0000FF;">+=</span> <span style="color: #000000;">1</span>
ch = s[sidx]
<span style="color: #008080;">if</span> <span style="color: #000000;">sidx</span><span style="color: #0000FF;">></span><span style="color: #7060A8;">length</span><span style="color: #0000FF;">(</span><span style="color: #000000;">s</span><span style="color: #0000FF;">)</span> <span style="color: #008080;">then</span> <span style="color: #008080;">exit</span> <span style="color: #008080;">end</span> <span style="color: #008080;">if</span>
end while
<span style="color: #000000;">ch</span> <span style="color: #0000FF;">=</span> <span style="color: #000000;">s</span><span style="color: #0000FF;">[</span><span style="color: #000000;">sidx</span><span style="color: #0000FF;">]</span>
if asnumber then
<span style="color: #008080;">end</span> <span style="color: #008080;">while</span>
sequence scanres = scanf(res,"%f")
<span style="color: #008080;">if</span> <span style="color: #000000;">asnumber</span> <span style="color: #008080;">then</span>
if length(scanres)=1 then return {scanres[1][1],sidx} end if
<span style="color: #004080;">sequence</span> <span style="color: #000000;">scanres</span> <span style="color: #0000FF;">=</span> <span style="color: #7060A8;">scanf</span><span style="color: #0000FF;">(</span><span style="color: #000000;">res</span><span style="color: #0000FF;">,</span><span style="color: #008000;">"%f"</span><span style="color: #0000FF;">)</span>
-- error? (failed to parse number)
<span style="color: #008080;">if</span> <span style="color: #7060A8;">length</span><span style="color: #0000FF;">(</span><span style="color: #000000;">scanres</span><span style="color: #0000FF;">)=</span><span style="color: #000000;">1</span> <span style="color: #008080;">then</span> <span style="color: #008080;">return</span> <span style="color: #0000FF;">{</span><span style="color: #000000;">scanres</span><span style="color: #0000FF;">[</span><span style="color: #000000;">1</span><span style="color: #0000FF;">][</span><span style="color: #000000;">1</span><span style="color: #0000FF;">],</span><span style="color: #000000;">sidx</span><span style="color: #0000FF;">}</span> <span style="color: #008080;">end</span> <span style="color: #008080;">if</span>
end if
<span style="color: #000080;font-style:italic;">-- error? (failed to parse number)</span>
end if
<span style="color: #008080;">end</span> <span style="color: #008080;">if</span>
return {res,sidx}
<span style="color: #008080;">end</span> <span style="color: #008080;">if</span>
end function
<span style="color: #008080;">return</span> <span style="color: #0000FF;">{</span><span style="color: #000000;">res</span><span style="color: #0000FF;">,</span><span style="color: #000000;">sidx</span><span style="color: #0000FF;">}</span>
 
<span style="color: #008080;">end</span> <span style="color: #008080;">function</span>
function parse_s_expr(string s, integer sidx)
integer ch = s[sidx]
<span style="color: #008080;">function</span> <span style="color: #000000;">parse_s_expr</span><span style="color: #0000FF;">(</span><span style="color: #004080;">string</span> <span style="color: #000000;">s</span><span style="color: #0000FF;">,</span> <span style="color: #004080;">integer</span> <span style="color: #000000;">sidx</span><span style="color: #0000FF;">)</span>
sequence res = {}
<span style="color: #004080;">integer</span> <span style="color: #000000;">ch</span> <span style="color: #0000FF;">=</span> <span style="color: #000000;">s</span><span style="color: #0000FF;">[</span><span style="color: #000000;">sidx</span><span style="color: #0000FF;">]</span>
object element
<span style="color: #004080;">sequence</span> <span style="color: #000000;">res</span> <span style="color: #0000FF;">=</span> <span style="color: #0000FF;">{}</span>
if ch!='(' then ?9/0 end if
<span style="color: #004080;">object</span> <span style="color: #000000;">element</span>
sidx += 1
<span style="color: #008080;">if</span> <span style="color: #000000;">ch</span><span style="color: #0000FF;">!=</span><span style="color: #008000;">'('</span> <span style="color: #008080;">then</span> <span style="color: #0000FF;">?</span><span style="color: #000000;">9</span><span style="color: #0000FF;">/</span><span style="color: #000000;">0</span> <span style="color: #008080;">end</span> <span style="color: #008080;">if</span>
while 1 do
<span style="color: #000000;">sidx</span> <span style="color: #0000FF;">+=</span> <span style="color: #000000;">1</span>
sidx = skip_spaces(s,sidx)
<span style="color: #008080;">while</span> <span style="color: #000000;">1</span> <span style="color: #008080;">do</span>
-- error? (if past end of string/missing ')')
<span style="color: #000000;">sidx</span> <span style="color: #0000FF;">=</span> <span style="color: #000000;">skip_spaces</span><span style="color: #0000FF;">(</span><span style="color: #000000;">s</span><span style="color: #0000FF;">,</span><span style="color: #000000;">sidx</span><span style="color: #0000FF;">)</span>
ch = s[sidx]
<span style="color: #000080;font-style:italic;">-- error? (if past end of string/missing ')')</span>
if ch=')' then exit end if
<span style="color: #000000;">ch</span> <span style="color: #0000FF;">=</span> <span style="color: #000000;">s</span><span style="color: #0000FF;">[</span><span style="color: #000000;">sidx</span><span style="color: #0000FF;">]</span>
if ch='(' then
<span style="color: #008080;">if</span> <span style="color: #000000;">ch</span><span style="color: #0000FF;">=</span><span style="color: #008000;">')'</span> <span style="color: #008080;">then</span> <span style="color: #008080;">exit</span> <span style="color: #008080;">end</span> <span style="color: #008080;">if</span>
{element,sidx} = parse_s_expr(s,sidx)
<span style="color: #008080;">if</span> <span style="color: #000000;">ch</span><span style="color: #0000FF;">=</span><span style="color: #008000;">'('</span> <span style="color: #008080;">then</span>
else
<span style="color: #0000FF;">{</span><span style="color: #000000;">element</span><span style="color: #0000FF;">,</span><span style="color: #000000;">sidx</span><span style="color: #0000FF;">}</span> <span style="color: #0000FF;">=</span> <span style="color: #000000;">parse_s_expr</span><span style="color: #0000FF;">(</span><span style="color: #000000;">s</span><span style="color: #0000FF;">,</span><span style="color: #000000;">sidx</span><span style="color: #0000FF;">)</span>
{element,sidx} = get_term(s,sidx)
<span style="color: #008080;">else</span>
end if
<span style="color: #0000FF;">{</span><span style="color: #000000;">element</span><span style="color: #0000FF;">,</span><span style="color: #000000;">sidx</span><span style="color: #0000FF;">}</span> <span style="color: #0000FF;">=</span> <span style="color: #000000;">get_term</span><span style="color: #0000FF;">(</span><span style="color: #000000;">s</span><span style="color: #0000FF;">,</span><span style="color: #000000;">sidx</span><span style="color: #0000FF;">)</span>
res = append(res,element)
<span style="color: #008080;">end</span> <span style="color: #008080;">if</span>
end while
<span style="color: #000000;">res</span> <span style="color: #0000FF;">=</span> <span style="color: #7060A8;">append</span><span style="color: #0000FF;">(</span><span style="color: #000000;">res</span><span style="color: #0000FF;">,</span><span style="color: #000000;">element</span><span style="color: #0000FF;">)</span>
sidx = skip_spaces(s,sidx+1)
<span style="color: #008080;">end</span> <span style="color: #008080;">while</span>
return {res,sidx}
<span style="color: #000000;">sidx</span> <span style="color: #0000FF;">=</span> <span style="color: #000000;">skip_spaces</span><span style="color: #0000FF;">(</span><span style="color: #000000;">s</span><span style="color: #0000FF;">,</span><span style="color: #000000;">sidx</span><span style="color: #0000FF;">+</span><span style="color: #000000;">1</span><span style="color: #0000FF;">)</span>
end function
<span style="color: #008080;">return</span> <span style="color: #0000FF;">{</span><span style="color: #000000;">res</span><span style="color: #0000FF;">,</span><span style="color: #000000;">sidx</span><span style="color: #0000FF;">}</span>
 
<span style="color: #008080;">end</span> <span style="color: #008080;">function</span>
sequence s_expr
integer sidx
<span style="color: #004080;">sequence</span> <span style="color: #000000;">s_expr</span>
{s_expr,sidx} = parse_s_expr(s_expr_str,1)
<span style="color: #004080;">integer</span> <span style="color: #000000;">sidx</span>
if sidx<=length(s_expr_str) then
<span style="color: #0000FF;">{</span><span style="color: #000000;">s_expr</span><span style="color: #0000FF;">,</span><span style="color: #000000;">sidx</span><span style="color: #0000FF;">}</span> <span style="color: #0000FF;">=</span> <span style="color: #000000;">parse_s_expr</span><span style="color: #0000FF;">(</span><span style="color: #000000;">s_expr_str</span><span style="color: #0000FF;">,</span><span style="color: #000000;">1</span><span style="color: #0000FF;">)</span>
printf(1,"incomplete parse(\"%s\")\n",{s_expr_str[sidx..$]})
<span style="color: #008080;">if</span> <span style="color: #000000;">sidx</span><span style="color: #0000FF;"><=</span><span style="color: #7060A8;">length</span><span style="color: #0000FF;">(</span><span style="color: #000000;">s_expr_str</span><span style="color: #0000FF;">)</span> <span style="color: #008080;">then</span>
end if
<span style="color: #7060A8;">printf</span><span style="color: #0000FF;">(</span><span style="color: #000000;">1</span><span style="color: #0000FF;">,</span><span style="color: #008000;">"incomplete parse(\"%s\")\n"</span><span style="color: #0000FF;">,{</span><span style="color: #000000;">s_expr_str</span><span style="color: #0000FF;">[</span><span style="color: #000000;">sidx</span><span style="color: #0000FF;">..$]})</span>
 
<span style="color: #008080;">end</span> <span style="color: #008080;">if</span>
puts(1,"\nThe string:\n")
?s_expr_str
<span style="color: #7060A8;">puts</span><span style="color: #0000FF;">(</span><span style="color: #000000;">1</span><span style="color: #0000FF;">,</span><span style="color: #008000;">"\nThe string:\n"</span><span style="color: #0000FF;">)</span>
 
<span style="color: #0000FF;">?</span><span style="color: #000000;">s_expr_str</span>
puts(1,"\nDefault pretty printing:\n")
--?s_expr
<span style="color: #7060A8;">puts</span><span style="color: #0000FF;">(</span><span style="color: #000000;">1</span><span style="color: #0000FF;">,</span><span style="color: #008000;">"\nDefault pretty printing:\n"</span><span style="color: #0000FF;">)</span>
pp(s_expr)
<span style="color: #000080;font-style:italic;">--?s_expr</span>
 
<span style="color: #7060A8;">pp</span><span style="color: #0000FF;">(</span><span style="color: #000000;">s_expr</span><span style="color: #0000FF;">)</span>
puts(1,"\nBespoke pretty printing:\n")
--ppEx(s_expr,{pp_Nest,1,pp_StrFmt,-1,pp_IntCh,false,pp_Brkt,"()"})
<span style="color: #7060A8;">puts</span><span style="color: #0000FF;">(</span><span style="color: #000000;">1</span><span style="color: #0000FF;">,</span><span style="color: #008000;">"\nBespoke pretty printing:\n"</span><span style="color: #0000FF;">)</span>
ppEx(s_expr,{pp_Nest,4,pp_StrFmt,-1,pp_IntCh,false,pp_Brkt,"()"})</lang>
<span style="color: #000080;font-style:italic;">--ppEx(s_expr,{pp_Nest,1,pp_StrFmt,-1,pp_IntCh,false,pp_Brkt,"()"})</span>
<span style="color: #7060A8;">ppEx</span><span style="color: #0000FF;">(</span><span style="color: #000000;">s_expr</span><span style="color: #0000FF;">,{</span><span style="color: #004600;">pp_Nest</span><span style="color: #0000FF;">,</span><span style="color: #000000;">4</span><span style="color: #0000FF;">,</span><span style="color: #004600;">pp_StrFmt</span><span style="color: #0000FF;">,-</span><span style="color: #000000;">1</span><span style="color: #0000FF;">,</span><span style="color: #004600;">pp_IntCh</span><span style="color: #0000FF;">,</span><span style="color: #004600;">false</span><span style="color: #0000FF;">,</span><span style="color: #004600;">pp_Brkt</span><span style="color: #0000FF;">,</span><span style="color: #008000;">"()"</span><span style="color: #0000FF;">})</span>
<!--</syntaxhighlight>-->
{{out}}
<pre>
Line 4,945 ⟶ 5,109:
=={{header|PicoLisp}}==
The '[http://software-lab.de/doc/refA.html#any any]' function parses an s-expression from a string (indentical to the way '[http://software-lab.de/doc/refR.html#read read]' does this from an input stream).
<langsyntaxhighlight PicoLisplang="picolisp">: (any "((data \"quoted data\" 123 4.5) (data (!@# (4.5) \"(more\" \"data)\")))")
-> ((data "quoted data" 123 5) (data (!@# (5) "(more" "data)")))
 
Line 4,965 ⟶ 5,129:
+-- "(more"
|
+-- "data)"</langsyntaxhighlight>
Implementing a subset of 'any' explicitly:
<langsyntaxhighlight PicoLisplang="picolisp">(de readSexpr ()
(case (skip)
("(" (char) (readList))
Line 4,992 ⟶ 5,156:
(until (or (sp? (peek)) (member (peek) '("(" ")")))
(link (char)) ) )
(or (format X) (intern (pack X))) ) )</langsyntaxhighlight>
It can be used in a pipe to read from a string:
<langsyntaxhighlight PicoLisplang="picolisp">: (pipe (prin "((data \"quoted data\" 123 4.5) (data (!@# (4.5) \"(more\" \"data)\")))") (readSexpr))
-> ((data "quoted data" 123 5) (data (!@# (5) "(more" "data)")))</langsyntaxhighlight>
'[http://software-lab.de/doc/refS.html#sym sym]' does the reverse (i.e. builds a symbol (string) from an expression).
<langsyntaxhighlight PicoLisplang="picolisp">: (sym @@)
-> "((data \"quoted data\" 123 5) (data (!@# (5) \"(more\" \"data)\")))"</langsyntaxhighlight>
Implementing a subset of the built-in printer:
<langsyntaxhighlight PicoLisplang="picolisp">(de printSexpr (Expr Fun)
(cond
((pair Expr)
Line 5,013 ⟶ 5,177:
(mapc Fun (chop Expr))
(Fun "\"") )
(T (mapc Fun (chop Expr))) ) )</langsyntaxhighlight>
This can be used for plain printing
<langsyntaxhighlight PicoLisplang="picolisp">: (printSexpr
'((data "quoted data" 123 4.5) (data (!@# (4.5) "(more" "data)")))
prin )
((data "quoted data" 123 5) (data (!@# (5) "(more" "data)")))</langsyntaxhighlight>
or to collect the characters into a string:
<langsyntaxhighlight PicoLisplang="picolisp">: (pack
(make
(printSexpr
'((data "quoted data" 123 4.5) (data (!@# (4.5) "(more" "data)")))
link ) ) )
-> "((data \"quoted data\" 123 5) (data (!@# (5) \"(more\" \"data)\")))"</langsyntaxhighlight>
 
=={{header|Pike}}==
<langsyntaxhighlight lang="pike">class Symbol(string name)
{
string _sprintf(int type)
Line 5,133 ⟶ 5,297:
string input = "((data \"quoted data\" 123 4.5)\n (data (!@# (4.5) \"(more\" \"data)\")))";
array data = group(tokenizer(input))[0];
string output = sexp(data);</langsyntaxhighlight>
 
Output:
Line 5,145 ⟶ 5,309:
=={{header|Potion}}==
How values are stored: Tuples for list, integers for integers, floats for floats, strings for symbols, quoted strings for strings. This implementation is not the most elegant/succinct or practical (it's trusty and has no real error handling).
<langsyntaxhighlight lang="potion">isdigit = (c): 47 < c ord and c ord < 58.
iswhitespace = (c): c ord == 10 or c ord == 13 or c == " ".
 
Line 5,216 ⟶ 5,380:
parsesexpr("((data \"quoted data\" 123 4.5)
(data (!@# (4.5) \"(more\" \"data)\")))") string print
"\n" print</langsyntaxhighlight>
 
=={{header|Python}}==
===Procedural===
<lang python>import re
<syntaxhighlight lang="python">import re
 
dbg = False
Line 5,278 ⟶ 5,443:
print("\nParsed to Python:", parsed)
 
print("\nThen back to: '%s'" % print_sexp(parsed))</langsyntaxhighlight>
 
;Output:
Line 5,289 ⟶ 5,454:
;Simpler parser:
Note that in the example above the parser also recognises and changes the type of some tokens as well as generating a nested list. If that functionality is not needed, or better done elsewhere, then the parse function can be achieved more simply by just applying the regexp:
<langsyntaxhighlight lang="python">>>> from pprint import pprint as pp
>>> x = [[(t,v) for t,v in termtypes.groupdict().items() if v][0] for termtypes in re.finditer(term_regex, sexp)]
>>> pp(x)
Line 5,311 ⟶ 5,476:
('brackr', ')'),
('brackr', ')')]
>>> </langsyntaxhighlight>
 
===Functional===
Composing functionally, and writing out a tree diagram, and a serialization, of the parse.
<syntaxhighlight lang="python">'''S-expressions'''
 
from itertools import chain, repeat
import re
 
 
def main():
'''Sample s-expression parsed, diagrammed,
and reserialized from the parse tree.
'''
expr = "((data \"quoted data\" 123 4.5)\n" + (
" (data (!@# (4.5) \"(more\" \"data)\")))"
)
parse = parseExpr(tokenized(expr))[0]
print(
drawForest([
fmapTree(str)(tree) for tree
in forestFromExprs(parse)
])
)
print(
f'\nReserialized from parse:\n\n{serialized(parse)}'
)
 
 
# ----------------- S-EXPRESSION PARSER ------------------
 
# parseExpr :: [String] -> ([Expr], [String]
def parseExpr(tokens):
'''A tuple of a nested list with any
unparsed tokens that remain.
'''
return until(finished)(parseToken)(
([], tokens)
)
 
 
# finished :: ([Expr], [String]) -> Bool
def finished(xr):
'''True if no tokens remain,
or the next token is a closing bracket.
'''
r = xr[1]
return (not r) or (r[0] == ")")
 
 
# parseToken :: ([Expr], [String]) -> ([Expr], [String])
def parseToken(xsr):
'''A tuple of an expanded expression list
and a reduced token list.
'''
xs, r = xsr
h, *t = r
if "(" == h:
expr, rest = parseExpr(t)
return xs + [expr], rest[1:]
else:
return (xs, t) if ")" == h else (
xs + [atom(h)], t
)
 
# --------------------- ATOM PARSER ----------------------
 
# atom :: String -> Expr
def atom(s):
'''A Symbol, String, Float, or Int derived from s.
Symbol is represented as a dict with a 'name' key.
'''
def n(k):
return float(k) if '.' in k else int(k)
 
return s if '"' == s[0] else (
n(s) if s.replace('.', '', 1).isdigit() else {
"name": s
}
)
 
 
# --------------------- TOKENIZATION ---------------------
 
# tokenized :: String -> [String]
def tokenized(s):
'''A list of the tokens in s.
'''
return list(chain.from_iterable(map(
lambda token: [token] if '"' == token[0] else (
x for x in re.split(
r'\s+',
re.sub(r"([()])", r" \1 ", token)
) if x
) if token else [], (
x if (0 == i % 2) else f'"{x}"'
for (i, x) in enumerate(s.split('"'))
)
)))
 
 
# -------------------- SERIALIZATION ---------------------
 
# serialized :: Expr -> String
def serialized(e):
'''An s-expression written out from the parse tree.
'''
k = typename(e)
 
return str(e) if k in ['int', 'float', 'str'] else (
(
f'({" ".join([serialized(x) for x in e])})' if (
(1 < len(e)) or ('list' != typename(e[0]))
) else serialized(e[0])
) if 'list' == k else (
e.get("name") if 'dict' == k else "?"
)
)
 
 
# typename :: a -> String
def typename(x):
'''Name property of the type of a value.'''
return type(x).__name__
 
 
# ------------------- TREE DIAGRAMMING -------------------
 
# Node :: a -> [Tree a] -> Tree a
def Node(v):
'''Constructor for a Tree node which connects a
value of some kind to a list of zero or
more child trees.
'''
return lambda xs: {'type': 'Tree', 'root': v, 'nest': xs}
 
 
# append :: [a] -> [a] -> [a]
def append(a, b):
'''Concatenation.'''
return a + b
 
 
# draw :: Tree a -> [String]
def draw(node):
'''List of the lines of an ASCII
diagram of a tree.
'''
def shift_(h, other, xs):
return list(map(
append,
chain(
[h], (
repeat(other, len(xs) - 1)
)
),
xs
))
 
def drawSubTrees(xs):
return (
(
['|'] + shift_(
'├─ ', '│ ', draw(xs[0])
) + drawSubTrees(xs[1:])
) if 1 < len(xs) else ['|'] + shift_(
'└─ ', ' ', draw(xs[0])
)
) if xs else []
 
return (root(node)).splitlines() + (
drawSubTrees(nest(node))
)
 
 
# drawForest :: [Tree String] -> String
def drawForest(trees):
'''A simple unicode character representation of
a list of trees.
'''
return '\n'.join(map(drawTree, trees))
 
 
# drawTree :: Tree a -> String
def drawTree(tree):
'''ASCII diagram of a tree.'''
return '\n'.join(draw(tree))
 
 
# fmapTree :: (a -> b) -> Tree a -> Tree b
def fmapTree(f):
'''A new tree holding the results of
an application of f to each root in
the existing tree.
'''
def go(x):
return Node(
f(root(x))
)([go(v) for v in nest(x)])
return go
 
 
# forestFromExprs :: [Expr] -> [Tree Expr]
def forestFromExprs(es):
'''A list of expressions rewritten as a forest.
'''
return [treeFromExpr(x) for x in es]
 
 
# nest :: Tree a -> [Tree a]
def nest(t):
'''Accessor function for children of tree node.'''
return t.get('nest')
 
 
# root :: Tree a -> a
def root(t):
'''Accessor function for data of tree node.'''
return t.get('root')
 
 
# treeFromExprs :: Expr -> Tree Expr
def treeFromExpr(e):
'''An expression rewritten as a tree.
'''
return (
Node({"name": "List"})(forestFromExprs(e))
) if type(e) is list else (
Node(e)([])
)
 
 
# ----------------------- GENERIC ------------------------
 
# until :: (a -> Bool) -> (a -> a) -> a -> a
def until(p):
'''The result of repeatedly applying f until p holds.
The initial seed value is x.
'''
def go(f):
def loop(x):
v = x
while not p(v):
v = f(v)
return v
return loop
return go
 
 
# MAIN ---
if __name__ == '__main__':
main()</syntaxhighlight>
{{Out}}
<pre>{'name': 'List'}
|
├─ {'name': 'List'}
│ |
│ ├─ {'name': 'data'}
│ |
│ ├─ "quoted data"
│ |
│ ├─ 123
│ |
│ └─ 4.5
|
└─ {'name': 'List'}
|
├─ {'name': 'data'}
|
└─ {'name': 'List'}
|
├─ {'name': '!@#'}
|
├─ {'name': 'List'}
│ |
│ └─ 4.5
|
├─ "(more"
|
└─ "data)"
 
Reserialized from parse:
 
((data "quoted data" 123 4.5) (data (!@# (4.5) "(more" "data)")))</pre>
 
=={{header|Racket}}==
 
Racket has builtin support for S-expressions in the form of the read function.
<langsyntaxhighlight lang="racket">
#lang racket
(define input
Line 5,326 ⟶ 5,774:
 
(read (open-input-string input))
</syntaxhighlight>
</lang>
Output:
<pre>
Line 5,338 ⟶ 5,786:
This parses the task, but it isn't really a good lisp parser, because it always wants whitespace between lists, so <code>(()())</code> will fail ( <code>(() ())</code> wont)
 
<syntaxhighlight lang="raku" perl6line>grammar S-Exp {
rule TOP {^ <s-list> $};
 
Line 5,380 ⟶ 5,828:
say "the expression:\n$s-exp\n";
say "the Raku expression:\n{$raku_array.raku}\n";
say "and back:\n{s-exp_writer($raku_array)}";</langsyntaxhighlight>
 
{{out}}
Line 5,400 ⟶ 5,848:
 
It would normally be considered improper, but the literal string delimiters were left intact; making it much easier to understand what is/was being parsed.
<langsyntaxhighlight lang="rexx">/*REXX program parses an S-expression and displays the results to the terminal. */
input= '((data "quoted data" 123 4.5) (data (!@# (4.5) "(more" "data)")))'
say center('input', length(input), "═") /*display the header title to terminal.*/
Line 5,455 ⟶ 5,903:
exit 0 /*stick a fork in it, we're all done. */
/*──────────────────────────────────────────────────────────────────────────────────────*/
add!: if !='' then return; #=#+1; @.#=left("", max(0, tabs*(level-1)))!; !=; return</langsyntaxhighlight>
{{out|output|text=&nbsp; when using the default input:}}
<pre>
Line 5,484 ⟶ 5,932:
=={{header|Ruby}}==
{{works with|Ruby|1.9}}
<langsyntaxhighlight lang="ruby">class SExpr
def initialize(str)
@original = str
Line 5,605 ⟶ 6,053:
puts "original sexpr:\n#{sexpr.original}"
puts "\nruby data structure:\n#{sexpr.data}"
puts "\nand back to S-Expr:\n#{sexpr.to_sexpr}"</langsyntaxhighlight>
 
{{out}}
Line 5,622 ⟶ 6,070:
=={{header|Rust}}==
lib.rs:
<langsyntaxhighlight lang="rust">
//! This implementation isn't based on anything in particular, although it's probably informed by a
//! lot of Rust's JSON encoding code. It should be very fast (both encoding and decoding the toy
Line 6,013 ⟶ 6,461:
println!("{:?}", SExp::parse(ctx));
}
</langsyntaxhighlight>{{out}}
<pre>
Ok("((\"data\" \"quoted data\" 123 4.5) (\"data\" (\"!@#\" (4.5) \"(more\" \"data)\")))")
Line 6,030 ⟶ 6,478:
Using guile scheme 2.0.11
 
<langsyntaxhighlight lang="scheme">(define (sexpr-read port)
(define (help port)
(let ((char (read-char port)))
Line 6,073 ⟶ 6,521:
 
(format-sexpr (sexpr-read
(open-input-string "((data \"quoted data\" 123 4.5) (data (!@# (4.5) \"(more\" \"data)\")))")))</langsyntaxhighlight>
 
Output:
Line 6,098 ⟶ 6,546:
=={{header|Sidef}}==
{{trans|Perl}}
<langsyntaxhighlight lang="ruby">func sexpr(txt) {
txt.trim!
 
Line 6,147 ⟶ 6,595:
 
say s # dump structure
say sexpr2txt(s) # convert back</langsyntaxhighlight>
{{out}}
<pre>
Line 6,156 ⟶ 6,604:
=={{header|Tcl}}==
Note that because Tcl doesn't expose a type system (well, not in a conventional sense) the parts of the parsed out data structure are tagged lists; the first element is one of “<tt>string</tt>”, “<tt>int</tt>”, “<tt>real</tt>” and “<tt>atom</tt>” to indicate a leaf token, or “<tt>list</tt>” to indicate a sublist. A “native” data structure could also be generated, but then that would turn things into lists that are not in the original.
<langsyntaxhighlight lang="tcl">package require Tcl 8.5
 
proc fromSexp {str} {
Line 6,204 ⟶ 6,652:
return [lindex $content 0]
}
}</langsyntaxhighlight>
Demonstrating with the sample data:
<langsyntaxhighlight lang="tcl">set sample {((data "quoted data" 123 4.5)
(data (!@# (4.5) "(more" "data)")))}
set parsed [fromSexp $sample]
puts "sample: $sample"
puts "parsed: $parsed"
puts "regen: [toSexp $parsed]"</langsyntaxhighlight>
Output:
<pre>
Line 6,257 ⟶ 6,705:
Code:
 
<langsyntaxhighlight lang="txr">@(define float (f))@\
@(local (tok))@\
@(cases)@\
Line 6,300 ⟶ 6,748:
expr: @(format nil "~s" e)
junk: @junk
@(end)</langsyntaxhighlight>
 
Run:
Line 6,335 ⟶ 6,783:
Explanation of most confusing line:
 
<langsyntaxhighlight lang="txr"> @/\s*\(\s*/@(coll :vars (e))@(expr e)@/\s*/@(last))@(end)</langsyntaxhighlight>
 
First, we match an open parenthesis that can be embedded in whitespace. Then we have a <code>@(coll)</code> construct which terminates with <code>@(end)</code>. This is a repetition construct for collecting zero or more items. The <code>:vars (e)</code> argument makes the collect strict: each repetition must bind the variable <code>e</code>. More importantly, in this case, if nothing is
Line 6,344 ⟶ 6,792:
{{libheader|Wren-pattern}}
{{libheader|Wren-fmt}}
<langsyntaxhighlight ecmascriptlang="wren">import "./pattern" for Pattern
import "./fmt" for Fmt
 
var INDENT = 2
Line 6,431 ⟶ 6,879:
System.print("\nRecovered S-Expression (pretty print):")
prettyPrint.call(tokens)
}</langsyntaxhighlight>
 
{{out}}
9,655

edits