Grundlagen der Funktionalen Programmierung: Bäume

Motivation	Eingebaute Haskell-Listen sind eine unsymmetrische Datenstruktur.
	Der Zugriff auf das erste Element der Liste ist in konstanter Zeit möglich, der Zufriff auf das letzte nur in einer Zeit proportional zur Länge der Liste.
	Die Konkatenation zweier Listen benötigt eine Zeit proportional zur Länge der ersten Liste.

Verbesserung	Ein binärer Baum mit Information an den Blättern
	data Tree a = Nil -- leerer Baum \| Leaf a -- Blatt mit Information \| Fork (Tree a) (Tree a) -- binäre Verzweigung

Konsistenzbedingung	Nil nur als Wurzel, nicht als Teilbaum eines Fork-Knotens
	invTree :: Tree a -> Bool invTree Nil = True invTree (Leaf _x) = True invTree (Fork Nil r) = False invTree (Fork l Nil) = False invTree (Fork l r) = invTree l && invTree r

Smarte Konstruktoren	zum Sicherstellen der Konsistenzbedingung
	Hier: eigener Operator für das Konkatenieren von Bäumen
	infixr 5 <++> -- wie ++ (<++>) :: Tree a -> Tree a -> Tree a Nil <++> t2 = t2 t1 <++> Nil = t1 t1 <++> t2 = Fork t1 t2 cons :: a -> Tree a -> Tree a cons x t = Leaf x <++> t snoc :: Tree a -> a -> Tree a snoc t x = t <++> Leaf x
?	Laufzeit von <++>?
?	Laufzeit von cons und snoc?
	Die Verwendung von <++> anstatt Fork garantiert die Konsistenz per Konstruktion
	Symmetrie in der Datenstruktur ==> Symmetrie in den Algorithmen (cons und snoc)

Transformation	Baum ==> Liste
	flatten :: Tree a -> [a] flatten Nil = [] flatten (Leaf x) = [x] flatten (Fork l r) = flatten l ++ flatten r
	Datenstruktur bestimmt die Struktur der Algorithmen
	Laufzeit wird bestimmt durch die Laufzeit von ++
	Laufzeit nicht linear abhängig von der Anzahl Elemente im Baum
?	Verbesserung möglich?

Transformation	Liste ==> Baum
1. Versuch	naiv
	build0 :: [a] -> Tree a build0 [] = Nil build0 (x : xs) = Leaf x <++> build0 xs ==> build0 = foldr (λ x t -> Leaf x <++> t) Nil
	Laufzeit ist linear abhängig von der Anzahl der Elemente
	Struktur des Baumes ist isomorph zur Struktur der Liste
	Laufzeit des Zugriffs auf das letzte Element wie bei Listen

2. Versuch	Ausgewogener Baum
	build :: [a] -> Tree a build [] = Nil build [x] = Leaf x build xs = Fork (build l) (build r) where (l, r) = splitAt (length xs `div` 2) xs
	Ausgewogener Baum
	Alle Pfade von der Wurzel zu einem Blatt unterscheiden sich in der Länge höchstens um 1
	Die Anzahl der Elemente in den linken und rechten Teilbäumen unterscheiden sich höchstens um 1
	Laufzeiten der Zugriffe auf das erste und das letzte Element sind gleich
	Die Laufzeit von build hängt nicht linear ab von der Anzahl der Elemente

3. Versuch	Ausgewogener Baum
	build2 :: [a] -> Tree a build2 [] = Nil build2 xs = build' (map Leaf xs) where build' [t] = t build' ts = build' (merge ts) merge (t1 : t2 : ts) = Fork t1 t2 : merge ts merge ts = ts
	Ausgewogener Baum
	Hier gilt ein etwas schwächeres Ausbalacier-Kriterium
	Alle Pfade von der Wurzel zu einem Blatt haben eine minimale Länge
	Die Laufzeit von build2 hängt linear ab von der Anzahl der Elemente

Selektor-Funktionen	analog zu head, tail, last und init
	head :: Tree -> a head Nil = error "head with Nil" head (Leaf x) = x head (Fork l r) = head l last :: Tree -> a last Nil = error "last with Nil" last (Leaf x) = x last (Fork l r) = last r tail :: Tree a -> Tree a tail Nil = error "tail: empty tree" tail (Leaf _x) = Nil tail (Fork l r) = tail l <++> r init :: Tree a -> Tree a init Nil = error "init: empty tree" init (Leaf _x) = Nil init (Fork l r) = l <++> init r
	Symmetrie in Tree ==> Symmetrie von head und last und tail und init

	Laufzeit ist proportional zur Länge des Pfades von der Wurzel zum Blatt
	alle vier Funktionen sind partiell definiert
	Ähnlichkeiten im Code, also Abstrahieren!

Aufspalten	in erstes/letztes Element und Rest/Anfangsstück
	viewL :: Tree a -> Maybe (a, Tree a) viewL Nil = Nothing viewL (Leaf x) = Just (x, Nil) viewL (Fork l r)= Just (x, r1 <++> r) where Just (x, r1) = viewL l viewR :: Tree a -> Maybe (Tree a, a) viewR Nil = Nothing viewR (Leaf x) = Just (Nil, x) viewR (Fork l r)= Just (l1 <++> l, x) where Just (l1, x) = viewR r

	head, last, tail, init, safeHead, safeLast, safeTail und safeInit sind alle aus viewL und viewR ableitbar
	head = maybe (error "head with Nil") fst . viewL last = maybe (error "last with Nil") snd . viewR tail = maybe (error "tail with Nil") snd . viewL init = maybe (error "init with Nil") fst . viewR safeHead = mapMaybe fst . viewL safeLast = mapMaybe snd . viewR safeTail = mapMaybe snd . viewL safeInit = mapMaybe fst . viewR

mapTree	Verarbeiten aller Elemente eines Baumes unabhängig von ihrer Position
	mapTree :: (a -> b) -> Tree a -> Tree b mapTree f Nil = Nil mapTree f (Leaf x) = Leaf (f x) mapTree f (Fork l r) = mapTree f l `Fork` mapTree f r

	map, mapMaybe und mapTree sehr ähnlich!
	Abstrahieren und die map-Funktionen überladen

Functor-Klasse	für alle Datentypen, für die eine map-Funktion definiert werden kann
	class Functor f where -- vordefiniert fmap :: (a -> b) -> f a -> f b instance Functor [] where -- vordefiniert fmap = map instance Functor Maybe where -- vordefiniert fmap = mapMaybe instance Functor Tree where fmap = mapTree
	Functor ist eine Klasse für Typkostruktoren mit einem Parameter, nicht für konkrete Typen
	fmap ist die überladene Funktion, da der Name map schon für Listen verwendet wird

Gesetzte	die für alle Instanzen gelten müssen
	fmap id = id fmap f . fmap g = fmap (f . g)

Filtern	von Bäumen analog zum Filtern von Listen
	filterTree :: (a -> Bool) -> Tree a -> Tree a filterTree p Nil = Nil filterTree p t@(Leaf x) \| p x = t \| otherwise = Nil filterTree p (Fork l r) = filterTree p l <++> filterTree p r -- NOT: -- filterTree p (Fork l r) = filterTree p l `Fork` filterTree p r

Falten	von Bäumen analog zum Falten (foldr) von Listen
Beispiele	sumTree :: Num a => Tree a -> a sumTree Nil = 0 -- c = 0 sumTree (Leaf x) = x -- f = id sumTree (Fork l r) = sumTree l + sumTree r -- op = + size :: Tree a -> Int size Nil = 0 -- c = 0 size (Leaf _) = 1 -- f = const 1 size (Fork l r) = size l + size r -- op = + minpath :: Tree a -> Int minpath Nil = 0 minpath (Leaf _) = 1 minpath (Fork l r) = (minpath l `min` minpath r) + 1 maxpath :: Tree a -> Int maxpath Nil = 0 maxpath (Leaf _) = 1 maxpath (Fork l r) = (maxpath l `max` maxpath r) + 1 flatten :: Tree a -> [a] flatten Nil = [] -- c = [] flatten (Leaf x) = [x] -- f = λ x -> [x] flatten (Fork l r) = flatten l ++ flatten r -- op = ++
	Es gibt drei Stellen, an denen die Algorithmen sich unterscheiden
	Eine fold-Funktion benötigt also drei zusätzliche Parameter
.1	Das Resultat für den leeren Baum
.2	Eine 1-stellige Funktion zum Verarbeiten eines Blattes
.3	Eine 2-stellige Funktion zum Kombinieren der Resultate der verarbeiteten Teilbäume

	fold :: (b -> b -> b) -> (a -> b) -> b -> Tree a -> b fold op f c Nil = c fold op f c (Leaf x) = f x fold op f c (Fork l r) = fold op f c l `op` fold op f c r

Anwendungen	sumTree :: Num a => Tree a -> a sumTree = fold (+) id 0 size :: Tree a -> Int size = fold (+) (const 1) 0 minpath :: Tree a -> Int minpath = fold (λ x y -> x `min` y + 1) (const 1) 0 maxpath :: Tree a -> Int maxpath = fold (λ x y -> x `max` y + 1) (const 1) 0 flatten :: Tree a -> [a] flatten = fold (++) (λ x -> [x]) [] invTree :: Tree a -> Bool invTree Nil = True invTree t = fold (&&) (const True) False t mapTree :: (a -> b) -> Tree a -> Tree b mapTree f = fold Fork (Leaf . f) Nil filterTree :: (a -> Bool) -> Tree a -> Tree a filterTree p = fold (<++>) f Nil where f x \| p x = Leaf x \| otherwise = Nil

Verbesserung	der fold-Funktion
	fold :: (b -> b -> b) -> (a -> b) -> b -> Tree a -> b fold op f c = go where go Nil = c go (Leaf x) = f x go (Fork l r) = go l `op` go r

	Die zusätzlichen drei Parameter müssen in den rekursiven Funktionen nicht mit durchgeschleift werden
	Laufzeit: Funktionsrümpfe können an Aufrufstelle eingesetzt werden
	sumTree = go where go Nil = 0 go (Leaf x) = id x go (Fork l r) = go l + go r
effizientes flatten	mit einer Laufzeit proportional zur Anzahl der Elemente im Baum
	flatten :: Tree a -> [a] flatten = go [] where go acc Nil = acc go acc (Leaf x) = x : acc go acc (Fork l r) = go (go acc r) l
Idee	der Baum wird einmal von rechts nach links traversiert und die Elemente schrittweise bei der Verarbeitung der Blätter vorne an den Akkumulator angehängt

module Tree2
where
import Prelude hiding ( head, tail
                      , last, init
                      )
import qualified Prelude as P
import ShowTree
data Tree a = Nil
            | Leaf a
            | Fork (Tree a) (Tree a)
            deriving (Show)
-- invariant:
-- Nil only as root
invTree :: Tree a -> Bool
invTree Nil          = True
invTree (Leaf _x)    = True
invTree (Fork Nil r) = False
invTree (Fork l Nil) = False
invTree (Fork l r)   = invTree l && invTree r
-- slow flatten
flatten :: Tree a -> [a]
flatten Nil        = []
flatten (Leaf x)   = [x]
flatten (Fork l r) = flatten l ++ flatten r
-- fast flatten
flatten1 :: Tree a -> [a]
flatten1 t = go t []
  where
    go Nil        acc = acc
    go (Leaf x)   acc = x : acc
    go (Fork l r) acc = (go l . go r) acc
flatten1' :: Tree a -> [a]
flatten1' t = go t []
  where
    go Nil        = id
    go (Leaf x)   = (x :)
    go (Fork l r) = go l . go r
-- simple minded build, builds lists as trees
build0 :: [a] -> Tree a
build0 xs = foldr (<++>) Nil (map Leaf xs)
-- smart but a bit slow build due to splitAt and length
build1 :: [a] -> Tree a
build1 []  = Nil
build1 [x] = Leaf x
build1 xs  = Fork (build1 l) (build1 r)
  where
    (l, r) = splitAt (length xs `div` 2) xs
-- smart and fast build
build2 :: [a] -> Tree a
build2 xs
  | null xs   = Nil
  | otherwise = build' (map Leaf xs)
  where
    build' [t] = t
    build' ts  = build' (merge ts)
    merge (t1 : t2 : ts) = Fork t1 t2 : merge ts
    merge ts             = ts
-- some test trees
t0, t1, t2, t100 :: Tree Int
t0   = build0 [1..9]
t1   = build1 [1..9]
t2   = build2 [1..9]
t100 = build1 [1..100]
-- list like functions for trees
head :: Tree a -> a
head Nil         = error "head: empty list"
head (Leaf x)    = x
head (Fork l _r) = head l
last :: Tree a -> a
last Nil         = error "last: empty list"
last (Leaf x)    = x
last (Fork _l r) = last r
-- <++> is ++ for trees
infixr 5 <++>
(<++>) :: Tree a -> Tree a -> Tree a
Nil <++> t2  = t2
t1  <++> Nil = t1
t1  <++> t2  = Fork t1 t2
cons :: a -> Tree a -> Tree a
cons x t = Leaf x <++> t
snoc :: Tree a -> a -> Tree a
snoc t x = t <++> Leaf x
tail :: Tree a -> Tree a
tail Nil        = error "tail: empty tree"
tail (Leaf _x)  = Nil
tail (Fork l r) = tail l <++> r
init :: Tree a -> Tree a
init Nil        = error "init: empty tree"
init (Leaf _x)  = Nil
init (Fork l r) = l <++> init r
viewL :: Tree a -> Maybe (a, Tree a)
viewL Nil       = Nothing
viewL (Leaf x)  = Just (x, Nil)
viewL (Fork l r)= Just (x, r1 <++> r)
                  where
                    Just (x, r1) = viewL l
viewR :: Tree a -> Maybe (Tree a, a)
viewR Nil       = Nothing
viewR (Leaf x)  = Just (Nil, x)
viewR (Fork l r)= Just (l1 <++> l, x)
                  where
                    Just (l1, x) = viewR r
mapTree :: (a -> b) -> Tree a -> Tree b
mapTree f Nil        = Nil
mapTree f (Leaf x)   = Leaf (f x)
mapTree f (Fork l r) = Fork (mapTree f l) (mapTree f r)
instance Functor Tree where
  fmap = mapTree
filterTree :: (a -> Bool) -> Tree a -> Tree a
filterTree p Nil        = Nil
filterTree p (Leaf x)
    | p x               = Leaf x
    | otherwise         = Nil
filterTree p (Fork l r) = filterTree p l <++> filterTree p r
-- nice try:
-- filterTree p (Fork l r) = filterTree p l `Fork` filterTree p r
sumTree :: Num a => Tree a -> a
sumTree Nil        = 0
sumTree (Leaf x)   = x
sumTree (Fork l r) = sumTree l + sumTree r
size :: Tree a -> Int
size Nil        = 0
size (Leaf _)   = 1
size (Fork l r) = size l + size r
minpath :: Tree a -> Int
minpath Nil        = 0
minpath (Leaf _)   = 1
minpath (Fork l r) = (minpath l `min` minpath r) + 1
maxpath :: Tree a -> Int
maxpath Nil        = 0
maxpath (Leaf _)   = 1
maxpath (Fork l r) = (maxpath l `max` maxpath r) + 1
fold :: (b -> b -> b) -> (a -> b) -> b ->
        Tree a -> b
fold op f c Nil        = c
fold op f c (Leaf x)   = f x
fold op f c (Fork l r) = fold op f c l `op` fold op f c r
fold' :: (b -> b -> b) -> (a -> b) -> b ->
         Tree a -> b
fold' op f c = fold''
  where
    fold'' Nil        = c
    fold'' (Leaf x)   = f x
    fold'' (Fork l r) = fold'' l `op` fold'' r
sumTree' :: Num a => Tree a -> a
sumTree' = fold (+) id 0
size' :: Tree a -> Int
size' = fold (+) (const 1) 0
minpath' :: Tree a -> Int
minpath' = fold (\ x y -> x `min` y + 1) (const 1) 0
maxpath' :: Tree a -> Int
maxpath' = fold (\ x y -> x `max` y + 1) (const 1) 0
notNil :: Tree a -> Bool
notNil = fold (&&) (const True) False
-- slow flatten with fold
flatten' :: Tree a -> [a]
flatten' = fold (++) (\ x -> [x]) []
-- fast flatten, O(n)
flatten'' :: Tree a -> [a]
flatten'' = go []
  where
    go acc Nil = acc
    go acc (Leaf x) = x : acc
    go acc (Fork l r) = go (go acc r) l
-- mapTree with fold, like map for lists with foldr
mapTree' :: (a -> b) -> Tree a -> Tree b
mapTree' f = fold Fork (Leaf . f) Nil
-- --------------------
--
-- conversion of trees into pseudo graphics
showTree :: Show a => Tree a -> String
showTree = formatStringNTree . toNTree
  where
    toNTree Nil        = NTree "Nil" []
    toNTree (Leaf x)   = NTree ("Leaf " ++ show x) []
    toNTree (Fork l r) = NTree "Fork" [toNTree l, toNTree r]
-- formatted print of trees
printTree :: Show a => Tree a -> IO ()
printTree = putStrLn . showTree
-- --------------------

Bäume

Listen implementiert als Bäume

Beispiele zum Kapitel

Die Quellen