Execute SNUSP/Haskell: Difference between revisions

Content added Content deleted

Inline

Latest revision as of 09:00, 1 September 2022

This Haskell implementation supports commands from all the three SNUSP variants, as described on the Esolang SNUSP page.

Threads and 2D-data makes a purely functional implementation difficult, so most of the code works in the IO-Monad. There is an immutable array c for the code, a global mutable hashtable d for the data, and each thread has an instruction pointer ip, a memory pointer mp, and a call stack stack.

Design decisions (not covered by SNUSP specification):

Decrementing a zero memory cell sets it to zero.
The data area is infinite.
Threads block during read if no input is available, while other threads continue (as one of the examples requires).
As the SNUSP variants differ in the number of dimensions in data and code, make it easy to add even more dimensions.

The interpreter has been tested with the echo, thread, multiplication and multi-digit print examples.

The Haskell code starts with lots of imports:

import System.Environment
import System.IO
import System.Random

import Control.Monad

import Data.Char
import Data.List
import Data.Maybe
import Data.Array

import qualified Data.HashTable as H

Use a list as an index into an array:

type Index = [Int]

instance Ix a => Ix [a] where
  index ([],[]) []            = 0
  index (l:ls, u:us) (i:is)   = index (l,u) i + 
                                index (ls,us) is * rangeSize (l,u)
  range ([],[])               = [[]]
  range (l:ls, u:us)          = [i:is | is <- range (ls,us), i <- range (l,u)]
  inRange ([],[])      []     = True
  inRange (l:ls, u:us) (i:is) = inRange (l,u) i && inRange (ls,us) is
  rangeSize (ls,us)           = product $ map rangeSize $ zip ls us

or into an hashtable (the hash function could probably be improved):

cmpList :: Index -> Index -> Bool
cmpList []     []     = True
cmpList (x:xs) []     = x == 0 && cmpList xs []
cmpList []     (y:ys) = y == 0 && cmpList [] ys
cmpList (x:xs) (y:ys) = x == y && cmpList xs ys

hashList xs = H.hashInt $ foldr combine 0 xs

combine :: Int -> Int -> Int
combine x 0 = x
combine x y = z * (z+1) `div` 2 + x where z = x + y

Here it's important that index lists with trailing zeroes are treated just like this list without the zeroes, so we can handle any number of dimensions. We want the same flexibility when adding index lists:

(<+>) :: Index -> Index -> Index
[]     <+> ys     = ys
xs     <+> []     = xs
(x:xs) <+> (y:ys) = (x+y) : (xs <+> ys)

Some helper functions:

data Thread a = T {mp::a, ip::a, dir::a, stack::[(a,a)]} deriving Show

modify d t f = do
  let i = mp t
  x <- H.lookup d i
  let x' = fromMaybe 0 x
  H.delete d i
  H.insert d i (f x') -- H.update 
  return [t]

moveMp d t delta = return [t {mp=(mp t) <+> delta}]

readMp d t = H.lookup d (mp t) >>= return . fromMaybe 0

step t = t {ip=(ip t) <+> (dir t)}

dec :: Integer -> Integer
dec 0 = 0
dec x = x-1

toChar   = chr . fromInteger
fromChar = toInteger . ord

Now, the commands. Given a thread, return a list of threads valid after one simulation step. In that way, exec can handle forks and thread termination on errors.

-- Core SNUSP

exec '+'  d t = modify d t (+1)  
exec '-'  d t = modify d t (dec) 
exec '<'  d t = moveMp d t [-1] 
exec '>'  d t = moveMp d t [ 1]
exec ','  d t = getChar >>= modify d t . const . fromChar
exec '.'  d t = readMp d t >>= putChar . toChar >> return [t]
exec '\\' d t = return [t {dir=( d2:  d1:ds)}] where d1:d2:ds = dir t <+> [0,0]
exec '/'  d t = return [t {dir=(-d2: -d1:ds)}] where d1:d2:ds = dir t <+> [0,0]
exec '!'  d t = return [step t]
exec '?'  d t = readMp d t >>= \x -> return [if x == 0 then step t else t]

-- Modular SNUSP

exec '@'  d t = return [t {stack=(ip t, dir t):(stack t)}]
exec '#'  d   T{stack=[]}         = return []
exec '#'  d t@T{stack=(ip,dir):s} = return [step $ t {ip=ip, dir=dir, stack=s}]
   
-- Bloated SNUSP

exec ':'  d t = moveMp d t [0,-1]
exec ';'  d t = moveMp d t [0, 1]
exec '&'  d t = return [step t, t {stack=[]}]
exec '%'  d t = readMp d t >>= \x -> randomRIO (0,x) >>= modify d t . const

-- NOOP

exec _    d t = return [t]

The scheduler manages a list ts of active threads, and a list ks of threads waiting for input. If there are no more threads in either list, stop. If input is available, one blocked thread is executed. If no input is available and all threads are blocked, we block the interpreter, too (so the OS can do something else). Otherwise, try to execute one of the unblocked threads, first checking if it's still inside the code array.

start c = maybe (fst $ bounds $ c) fst $ find (\(_,x) -> x == '$') $ assocs c 

run c d = schedule [thread] [] False where 
  thread = T {mp=[1,1], ip=start c, dir=[1], stack=[]}
  exec' x d t = exec x d t >>= \ts -> return (ts,[])
  schedule' ts ks (ts',ks') = hReady stdin >>= schedule (ts++ts') (ks++ks')
  schedule [] []     _     = return ()
  schedule [] ks     False = hLookAhead stdin >> schedule' [] ks ([],[])
  schedule ts (k:ks) True  = exec' ',' d k  >>= schedule' ts ks
  schedule (t:ts) ks _     = check (step t) >>= schedule' ts ks 
  check t 
    | not $ bounds c `inRange` (ip t) = return ([],[])
    | x == ','                        = return ([],[t])
    | otherwise                       = exec' x d t 
    where x = c ! (ip t)

Finally, routines to run code from a string or a file, and the main program.

runString y s = do
  d <- H.new cmpList hashList
  let x = length s `div` y
  run (listArray ([1,1],[x,y]) s) d

runFile name = do
  s <- readFile name
  d <- H.new cmpList hashList
  let l = lines s
  let y = length l
  let x = maximum $ map length $ l
  let m = [([i,j],c) | (j,v) <- zip [1..] l, (i,c) <- zip [1..] v]
  let c = listArray ([1,1],[x,y]) (repeat ' ') // m
  run c d

main = do
  hSetBuffering stdin NoBuffering
  [s] <- getArgs
  runFile s

Extension

To demonstrate the ease of introducing even more dimensions, let's implement commands ( and ) to move the data pointer along the z-axis, and a command ^ to rotate the IP direction around the (1,1,1) axis (i.e., left becomes up, up becomes "farther" on the z-axis, "farther" becomes left, etc.).

exec '(' d t = moveMp d t [0,0,-1]
exec ')' d t = moveMp d t [0,0, 1]
exec '^' d t = return [t {dir=(d3:d1:d2:ds)}] where d1:d2:d3:ds = dir t <+> [0,0,0]

Revision as of 18:20, 9 May 2008 (view source) rosettacode>Mwn3d m (grammar) ← Older edit		Latest revision as of 09:00, 1 September 2022 (view source) PureFox (talk \| contribs) m (Fixed syntax highlighting.)
(10 intermediate revisions by 5 users not shown)
Line 1: {{implementation\|SNUSP}}{{collection\|RCSNUSP}}~~[[Category:Haskell]]~~ This [[Haskell]] implementation supports commands from all the three SNUSP variants, as described on the [~~http~~[eso:~~//esolangs.org/wiki/~~SNUSP \|Esolang SNUSP page]]. Threads and 2D-data makes a purely functional implementation difficult, so most of the code works in the IO-Monad. There is an immutable array ''c'' for the code, a global mutable hashtable ''d'' for the data, and each thread has an instruction pointer ''ip'', a memory pointer ''mp'', and a call stack ''stack''. Line 11: * As the SNUSP variants differ in the number of dimensions in data and code, make it easy to add even more dimensions. The interpreter has been tested with the ''echo'', ''thread'', ''multiplication'' and ''~~Multi~~multi-digit print'' examples. The Haskell code starts with lots of imports: <syntaxhighlight lang="haskell">import System.Environment▼ ~~<pre>~~ ▲import System.Environment import System.IO import System.Random Line 27 ⟶ 26: import Data.Array import qualified Data.HashTable as H</syntaxhighlight> ~~</pre>~~ Use a list as an index into an array: <syntaxhighlight lang="haskell">type Index = [Int] ~~<pre>~~ ~~type Index = [Int]~~ instance Ix a => Ix [a] where Line 43 ⟶ 40: inRange ([],[]) [] = True inRange (l:ls, u:us) (i:is) = inRange (l,u) i && inRange (ls,us) is rangeSize (ls,us) = product $ map rangeSize $ zip ls us</syntaxhighlight> ~~</pre>~~ or into an hashtable (the hash function could probably be improved): <syntaxhighlight lang="haskell">cmpList :: Index -> Index -> Bool▼ ~~<pre>~~ ▲cmpList :: Index -> Index -> Bool cmpList [] [] = True cmpList (x:xs) [] = x == 0 && cmpList xs [] Line 59 ⟶ 54: combine :: Int -> Int -> Int combine x 0 = x combine x y = z * (z+1) `div` 2 + x where z = x + y</syntaxhighlight> ~~</pre>~~ Here it's important that index lists with trailing ~~zeros~~zeroes are treated just like this list without the zeroes, so we can handle any number of dimensions. We want the same flexibility when adding index lists: <syntaxhighlight lang="haskell">(<+>) :: Index -> Index -> Index▼ ~~<pre>~~ ▲(<+>) :: Index -> Index -> Index [] <+> ys = ys xs <+> [] = xs (x:xs) <+> (y:ys) = (x+y) : (xs <+> ys)</syntaxhighlight> ~~</pre>~~ Some helper functions: <syntaxhighlight lang="haskell">data Thread a = T {mp::a, ip::a, dir::a, stack::[(a,a)]} deriving Show▼ ~~<pre>~~ ▲data Thread a = T {mp::a, ip::a, dir::a, stack::[(a,a)]} deriving Show modify d t f = do Line 95 ⟶ 86: toChar = chr . fromInteger fromChar = toInteger . ord</syntaxhighlight> ~~</pre>~~ Now, the commands. Given a thread, return a list of threads valid after one simulation step. In that way, ''exec'' can handle forks and thread termination on errors. <syntaxhighlight lang="haskell">-- Core SNUSP ~~<pre>~~ ~~-- Core SNUSP~~ exec '+' d t = modify d t (+1) Line 107 ⟶ 96: exec '<' d t = moveMp d t [-1] exec '>' d t = moveMp d t [ 1] exec ',' d t = ~~hGetChar stdin~~getChar >>= modify d t . const . fromChar exec '.' d t = readMp d t >>= putChar . toChar >> return [t] exec '\\' d t = return [t {dir=( d2: d1:ds)}] where d1:d2:ds = dir t <+> [0,0] Line 129 ⟶ 118: -- NOOP exec _ d t = return [t]</syntaxhighlight> ~~</pre>~~ The scheduler manages a list ''ts'' of active threads, and a list ''ks'' of threads waiting for input. If there are no more threads in either list, stop. If input is available, one blocked thread is executed. If no input is available and all threads are blocked, we block the interpreter, too (so the OS can do something else). Otherwise, try to execute one of the unblocked threads, first checking if it's still inside the code array. <syntaxhighlight lang="haskell">start c = maybe (fst $ bounds $ c) fst $ find (\(_,x) -> x == '$') $ assocs c ▼ ~~<pre>~~ ▲start c = maybe (fst $ bounds $ c) fst $ find (\(_,x) -> x == '$') $ assocs c run c d = schedule [thread] [] False where Line 149 ⟶ 136: \| x == ',' = return ([],[t]) \| otherwise = exec' x d t where x = c ! (ip t)</syntaxhighlight> ~~</pre>~~ Finally, routines to run code from a string or a file, and the main program. <syntaxhighlight lang="haskell">runString y s = do ~~<pre>~~ ~~runString y s = do~~ d <- H.new cmpList hashList let x = length s `div` y Line 173 ⟶ 158: hSetBuffering stdin NoBuffering [s] <- getArgs runFile s</syntaxhighlight> ~~</pre>~~ ===Extension=== To demonstrate the ease of introducing even more dimensions, let's implement commands ( and ) to move the data pointer along the z-axis, and a command ^ to rotate the IP direction around the (1,1,1) axis (i.e., left becomes up, up becomes "farther" on the z-axis, "farther" becomes left, etc.). <syntaxhighlight lang="haskell">exec '(' d t = moveMp d t [0,0,-1] exec ')' d t = moveMp d t [0,0, 1] exec '^' d t = return [t {dir=(d3:d1:d2:ds)}] where d1:d2:d3:ds = dir t <+> [0,0,0]</syntaxhighlight>