外面沙塵滾滾一直向北去了,意識到年關到了,碼農們都回鄉過年去了,而我卻留在這裡玩弄“拉鏈”。不要想歪了,我說的不是褲襠拉鏈而是scalaz Zipper,一種泛函數據結構游標(cursor)。在函數式編程模式里的集合通常是不可變的(immutable collection),我們會發現在FP編程過....
外面沙塵滾滾一直向北去了,意識到年關到了,碼農們都回鄉過年去了,而我卻留在這裡玩弄“拉鏈”。不要想歪了,我說的不是褲襠拉鏈而是scalaz Zipper,一種泛函數據結構游標(cursor)。在函數式編程模式里的集合通常是不可變的(immutable collection),我們會發現在FP編程過程中處理不可變集合(immutable collection)數據的方式好像總是缺些什麼,比如在集合里左右逐步游動像moveNext,movePrev等等,在一個集合的中間進行添加、更新、刪除的功能更是欠奉了,這主要是因為操作效率問題。不可變集合只有對前置操作(prepend operation)才能獲得可靠的效率,即對集合首位元素的操作,能得到相當於O(1)的速度,其它操作基本上都是O(n)速度,n是集合的長度,也就是隨著集合的長度增加,操作效率會以倍數下降。還有一個原因就是編程時會很不方便,因為大多數程式都會對各種集合進行大量的操作,最終也會導致程式的複雜臃腫,不符合函數式編程要求的精簡優雅表達形式。我想可能就是因為以上各種原因,scalaz提供了Zipper typeclass幫助對不可變集合操作的編程。Zipper的定義如下:scalaz/Zipper.scala
final case class Zipper[+A](lefts: Stream[A], focus: A, rights: Stream[A])
它以Stream為基礎,A可以是任何類型,無論基礎類型或高階類型。Zipper的結構如上:當前焦點視窗、左邊一串數據元素、右邊一串,形似拉鏈,因而命名Zipper。或者這樣看會更形象一點:
final case class Zipper[+A](
lefts: Stream[A],
focus: A,
rights: Stream[A])
scalaz提供了Zipper構建函數可以直接用Stream生成一個Zipper:
trait StreamFunctions {
...
final def toZipper[A](as: Stream[A]): Option[Zipper[A]] = as match {
case Empty => None
case h #:: t => Some(Zipper.zipper(empty, h, t))
}
final def zipperEnd[A](as: Stream[A]): Option[Zipper[A]] = as match {
case Empty => None
case _ =>
val x = as.reverse
Some(Zipper.zipper(x.tail, x.head, empty))
}
...
zipperEnd生成倒排序的Zipper:
1 Stream(1,2,3).toZipper //> res2: Option[scalaz.Zipper[Int]] = Some(Zipper(<lefts>, 1, <rights>))
2 Stream("A","B","C").toZipper //> res3: Option[scalaz.Zipper[String]] = Some(Zipper(<lefts>, A, <rights>))
3 Stream(Stream(1,2),Stream(3,4)).toZipper //> res4: Option[scalaz.Zipper[scala.collection.immutable.Stream[Int]]] = Some(Z
4 //| ipper(<lefts>, Stream(1, ?), <rights>))
5 Stream(1,2,3).zipperEnd //> res5: Option[scalaz.Zipper[Int]] = Some(Zipper(<lefts>, 3, <rights>))
scalaz也為List,NonEmptyList提供了Zipper構建函數:
trait ListFunctions {
...
final def toZipper[A](as: List[A]): Option[Zipper[A]] =
stream.toZipper(as.toStream)
final def zipperEnd[A](as: List[A]): Option[Zipper[A]] =
stream.zipperEnd(as.toStream)
...
final class NonEmptyList[+A] private[scalaz](val head: A, val tail: List[A]) {
...
def toZipper: Zipper[A] = zipper(Stream.Empty, head, tail.toStream)
def zipperEnd: Zipper[A] = {
import Stream._
tail.reverse match {
case Nil => zipper(empty, head, empty)
case t :: ts => zipper(ts.toStream :+ head, t, empty)
}
}
...
都是先轉換成Stream再生成Zipper的。Zipper本身的構建函數是zipper,在NonEmptyList的Zipper生成中調用過:
trait ZipperFunctions {
def zipper[A](ls: Stream[A], a: A, rs: Stream[A]): Zipper[A] =
Zipper(ls, a, rs)
}
用這些串形結構的構建函數產生Zipper同樣很簡單:
1 List(1,2,3,4).toZipper //> res0: Option[scalaz.Zipper[Int]] = Some(Zipper(<lefts>, 1, <rights>))
2 List(List(1,2),List(2,3)).toZipper //> res1: Option[scalaz.Zipper[List[Int]]] = Some(Zipper(<lefts>, List(1, 2), <r
3 //| ights>))
4 NonEmptyList("A","C","E").toZipper //> res2: scalaz.Zipper[String] = Zipper(<lefts>, A, <rights>)
5 NonEmptyList(1,2,3).zipperEnd //> res3: scalaz.Zipper[Int] = Zipper(<lefts>, 3, <rights>)
6
有了串形集合的Zipper構建方法後我們再看看一下Zipper的左右游動函數:
final case class Zipper[+A](lefts: Stream[A], focus: A, rights: Stream[A]) {
...
/**
* Possibly moves to next element to the right of focus.
*/
def next: Option[Zipper[A]] = rights match {
case Stream.Empty => None
case r #:: rs => Some(zipper(Stream.cons(focus, lefts), r, rs))
}
/**
* Possibly moves to next element to the right of focus.
*/
def nextOr[AA >: A](z: => Zipper[AA]): Zipper[AA] =
next getOrElse z
/**
* Possibly moves to the previous element to the left of focus.
*/
def previous: Option[Zipper[A]] = lefts match {
case Stream.Empty => None
case l #:: ls => Some(zipper(ls, l, Stream.cons(focus, rights)))
}
/**
* Possibly moves to previous element to the left of focus.
*/
def previousOr[AA >: A](z: => Zipper[AA]): Zipper[AA] =
previous getOrElse z
/**
* Moves focus n elements in the zipper, or None if there is no such element.
*
* @param n number of elements to move (positive is forward, negative is backwards)
*/
def move(n: Int): Option[Zipper[A]] = {
@tailrec
def move0(z: Option[Zipper[A]], n: Int): Option[Zipper[A]] =
if (n > 0 && rights.isEmpty || n < 0 && lefts.isEmpty) None
else {
if (n == 0) z
else if (n > 0) move0(z flatMap ((_: Zipper[A]).next), n - 1)
else move0(z flatMap ((_: Zipper[A]).previous), n + 1)
}
move0(Some(this), n)
}
/**
* Moves focus to the start of the zipper.
*/
def start: Zipper[A] = {
val rights = this.lefts.reverse ++ focus #:: this.rights
this.copy(Stream.Empty, rights.head, rights.tail)
}
/**
* Moves focus to the end of the zipper.
*/
def end: Zipper[A] = {
val lefts = this.rights.reverse ++ focus #:: this.lefts
this.copy(lefts.tail, lefts.head, Stream.empty)
}
/**
* Moves focus to the nth element of the zipper, or the default if there is no such element.
*/
def moveOr[AA >: A](n: Int, z: => Zipper[AA]): Zipper[AA] =
move(n) getOrElse z
...
start,end,move,next,previous移動方式都齊了。還有定位函數:
...
/**
* Moves focus to the nearest element matching the given predicate, preferring the left,
* or None if no element matches.
*/
def findZ(p: A => Boolean): Option[Zipper[A]] =
if (p(focus)) Some(this)
else {
val c = this.positions
std.stream.interleave(c.lefts, c.rights).find((x => p(x.focus)))
}
/**
* Moves focus to the nearest element matching the given predicate, preferring the left,
* or the default if no element matches.
*/
def findZor[AA >: A](p: A => Boolean, z: => Zipper[AA]): Zipper[AA] =
findZ(p) getOrElse z
/**
* Given a traversal function, find the first element along the traversal that matches a given predicate.
*/
def findBy[AA >: A](f: Zipper[AA] => Option[Zipper[AA]])(p: AA => Boolean): Option[Zipper[AA]] = {
@tailrec
def go(zopt: Option[Zipper[AA]]): Option[Zipper[AA]] = {
zopt match {
case Some(z) => if (p(z.focus)) Some(z) else go(f(z))
case None => None
}
}
go(f(this))
}
/**
* Moves focus to the nearest element on the right that matches the given predicate,
* or None if there is no such element.
*/
def findNext(p: A => Boolean): Option[Zipper[A]] = findBy((z: Zipper[A]) => z.next)(p)
/**
* Moves focus to the previous element on the left that matches the given predicate,
* or None if there is no such element.
*/
def findPrevious(p: A => Boolean): Option[Zipper[A]] = findBy((z: Zipper[A]) => z.previous)(p)
...
操作函數如下:
...
/**
* An alias for insertRight
*/
def insert[AA >: A]: (AA => Zipper[AA]) = insertRight(_: AA)
/**
* Inserts an element to the left of focus and focuses on the new element.
*/
def insertLeft[AA >: A](y: AA): Zipper[AA] = zipper(lefts, y, focus #:: rights)
/**
* Inserts an element to the right of focus and focuses on the new element.
*/
def insertRight[AA >: A](y: AA): Zipper[AA] = zipper(focus #:: lefts, y, rights)
/**
* An alias for `deleteRight`
*/
def delete: Option[Zipper[A]] = deleteRight
/**
* Deletes the element at focus and moves the focus to the left. If there is no element on the left,
* focus is moved to the right.
*/
def deleteLeft: Option[Zipper[A]] = lefts match {
case l #:: ls => Some(zipper(ls, l, rights))
case Stream.Empty => rights match {
case r #:: rs => Some(zipper(Stream.empty, r, rs))
case Stream.Empty => None
}
}
/**
* Deletes the element at focus and moves the focus to the left. If there is no element on the left,
* focus is moved to the right.
*/
def deleteLeftOr[AA >: A](z: => Zipper[AA]): Zipper[AA] =
deleteLeft getOrElse z
/**
* Deletes the element at focus and moves the focus to the right. If there is no element on the right,
* focus is moved to the left.
*/
def deleteRight: Option[Zipper[A]] = rights match {
case r #:: rs => Some(zipper(lefts, r, rs))
case Stream.Empty => lefts match {
case l #:: ls => Some(zipper(ls, l, Stream.empty))
case Stream.Empty => None
}
}
/**
* Deletes the element at focus and moves the focus to the right. If there is no element on the right,
* focus is moved to the left.
*/
def deleteRightOr[AA >: A](z: => Zipper[AA]): Zipper[AA] =
deleteRight getOrElse z
/**
* Deletes all elements except the focused element.
*/
def deleteOthers: Zipper[A] = zipper(Stream.Empty, focus, Stream.Empty)
...
/**
* Update the focus in this zipper.
*/
def update[AA >: A](focus: AA) = {
this.copy(this.lefts, focus, this.rights)
}
/**
* Apply f to the focus and update with the result.
*/
def modify[AA >: A](f: A => AA) = this.update(f(this.focus))
...
insert,modify,delete也很齊備。值得註意的是多數Zipper的移動函數和操作函數都返回Option[Zipper[A]]類型,如此我們可以用flatMap把這些動作都連接起來。換句話說就是我們可以用for-comprehension在Option的context內實現行令編程(imperative programming)。我們可以通過一些例子來示範Zipper用法:
1 val zv = for {
2 z <- List(2,8,1,5,4,11).toZipper
3 s1 <- z.next
4 s2 <- s1.modify{_ + 2}.some
5 } yield s2 //> zv : Option[scalaz.Zipper[Int]] = Some(Zipper(<lefts>, 10, <rights>))
6
7 zv.get.show //> res8: scalaz.Cord = Zipper(Stream(2), 10, Stream(1,5,4,11))
8 zv.get.toList //> res9: List[Int] = List(2, 10, 1, 5, 4, 11)
9 ...
10 val zv = for {
11 z <- List(2,8,1,5,4,11).toZipper
12 s1 <- z.next
13 s2 <- s1.modify{_ + 2}.some
14 s3 <- s2.move(1)
15 s4 <- s3.delete
16 } yield s4 //> zv : Option[scalaz.Zipper[Int]] = Some(Zipper(<lefts>, 5, <rights>))
17
18 zv.get.show //> res8: scalaz.Cord = Zipper(Stream(10,2), 5, Stream(4,11))
19 zv.get.toList //> res9: List[Int] = List(2, 10, 5, 4, 11)
20 ...
21 val zv = for {
22 z <- List(2,8,1,5,4,11).toZipper
23 s1 <- z.next
24 s2 <- s1.modify{_ + 2}.some
25 s3 <- s2.move(1)
26 s4 <- s3.delete
27 s5 <- s4.findZ {_ === 11}
28 s6 <- if (s5.focus === 12) s5.delete else s2.insert(12).some
29 } yield s6 //> zv : Option[scalaz.Zipper[Int]] = Some(Zipper(<lefts>, 12, <rights>))
30
31 zv.get.show //> res8: scalaz.Cord = Zipper(Stream(10,2), 12, Stream(1,5,4,11))
32 zv.get.toList //> res9: List[Int] = List(2, 10, 12, 1, 5, 4, 11)
33 ...
34 val zv = for {
35 z <- List(2,8,1,5,4,11).toZipper
36 s1 <- z.next
37 s2 <- s1.modify{_ + 2}.some
38 s3 <- s2.move(1)
39 s4 <- s3.delete
40 s5 <- s4.findZ {_ === 11}
41 s6 <- if (s5.focus === 12) s5.delete else s2.insert(12).some
42 s7 <- s6.end.delete
43 s8 <- s7.start.some
44 } yield s8 //> zv : Option[scalaz.Zipper[Int]] = Some(Zipper(<lefts>, 2, <rights>))
45
46 zv.get.show //> res8: scalaz.Cord = Zipper(Stream(), 2, Stream(10,12,1,5,4))
47 zv.get.toList //> res9: List[Int] = List(2, 10, 12, 1, 5, 4)
我在上面的程式里在for{...}yield裡面逐條添加指令從而示範游標當前焦點和集合元素跟隨著的變化。這段程式可以說就是一段行令程式。
回到上面提到的效率和代碼質量討論。我們提過scalaz提供Zipper就是為了使集合操作編程更簡明優雅,實際情況是怎樣的呢?
舉個例子:有一串數字,比如:List(1,4,7,9,5,6,10), 我想找出第一個高點元素,它的左邊低,右邊高,在我們的例子里是元素9。如果我們嘗試用習慣的行令方式用索引去編寫這個函數:
def peak(list: List[Int]): Option[Int] = {
list.indices.find { index =>
val x = list(index)
index > 0 && index < list.size - 1 &&
x > list(index - 1) && x > list(index + 1)
}.map(list(_))
}
哇!這東西不但極其複雜難懂而且效率低下,重覆用find索引導致速度降到O(n * n)。如果用Array會把效率提高到O(n),不過我們希望用immutable方式。那麼用函數式編程方式呢?
def peak_fp(list: List[Int]): Option[Int] = list match {
case x :: y :: z :: tl if y > x && y > z => Some(y)
case x :: tl => peak(tl)
case Nil => None
}
用模式匹配(pattern matching)和遞歸演算法(recursion),這段程式好看多了,而且效率也可以提高到O(n)。
但我們再把情況搞得複雜一點:把高點值增高一點(+1)。還是用FP方式編寫:
def raisePeak(list: List[Int]): Option[List[Int]] = {
def rec(head: List[Int], tail: List[Int]): Option[List[Int]] = tail match {
case x :: y :: z :: tl if y > x && y > z =>
Some((x :: head).reverse ::: ((y +1) :: z :: tl))
case x :: tl => rec(x :: head, tl) case Nil => None
}
rec(List.empty, list)
}
代碼又變得臃腫複雜起來。看來僅僅用FP編程方式還不足夠,還需要用一些新的數據結構什麼的來幫助。scalaz的Zipper可以在這個場景里派上用場了:
def raisePeak_z(list: List[Int]): Option[List[Int]] = {
for {
zipper <- list.toZipper
peak <- zipper.positions.findNext( z =>
(z.previous, z.next) match {
case (Some(p), Some(n)) => p.focus < z.focus && n.focus < z.focus
case _ => false
})
} yield (peak.focus.modify(_ + 1).toStream.toList)
}
用Zipper來寫程式表達清楚許多。這裡用上了Zipper.positions:
/**
* A zipper of all positions of the zipper, with focus on the current position.
*/
def positions: Zipper[Zipper[A]] = {
val left = std.stream.unfold(this)(_.previous.map(x => (x, x)))
val right = std.stream.unfold(this)(_.next.map(x => (x, x)))
zipper(left, this, right)
}
positions函數返回類型是Zipper[Zipper[A]]符合findNext使用。我們前面已經提到:使用Zipper的成本約為O(n)。