AVL Trees CSE 373 Data Structures Lecture 8
12/26/03AVL Trees - Lecture 82 Readings Reading Section 4.4,
12/26/03AVL Trees - Lecture 83 Binary Search Tree - Best Time All BST operations are O(d), where d is tree depth minimum d is for a binary tree with N nodes What is the best case tree? What is the worst case tree? So, best case running time of BST operations is O(log N)
12/26/03AVL Trees - Lecture 84 Binary Search Tree - Worst Time Worst case running time is O(N) What happens when you Insert elements in ascending order? Insert: 2, 4, 6, 8, 10, 12 into an empty BST Problem: Lack of balance: compare depths of left and right subtree Unbalanced degenerate tree
12/26/03AVL Trees - Lecture 85 Balanced and unbalanced BST Is this balanced?
12/26/03AVL Trees - Lecture 86 Approaches to balancing trees Don't balance May end up with some nodes very deep Strict balance The tree must always be balanced perfectly Pretty good balance Only allow a little out of balance Adjust on access Self-adjusting
12/26/03AVL Trees - Lecture 87 Balancing Binary Search Trees Many algorithms exist for keeping binary search trees balanced Adelson-Velskii and Landis (AVL) trees (height-balanced trees) Splay trees and other self-adjusting trees B-trees and other multiway search trees
12/26/03AVL Trees - Lecture 88 Perfect Balance Want a complete tree after every operation tree is full except possibly in the lower right This is expensive For example, insert 2 in the tree on the left and then rebuild as a complete tree Insert 2 & complete tree
12/26/03AVL Trees - Lecture 89 AVL - Good but not Perfect Balance AVL trees are height-balanced binary search trees Balance factor of a node height(left subtree) - height(right subtree) An AVL tree has balance factor calculated at every node For every node, heights of left and right subtree can differ by no more than 1 Store current heights in each node
12/26/03AVL Trees - Lecture 810 Height of an AVL Tree N(h) = minimum number of nodes in an AVL tree of height h. Basis N(0) = 1, N(1) = 2 Induction N(h) = N(h-1) + N(h-2) + 1 Solution (recall Fibonacci analysis) N(h) > h ( 1.62) h-1 h-2 h
12/26/03AVL Trees - Lecture 811 Height of an AVL Tree N(h) > h ( 1.62) Suppose we have n nodes in an AVL tree of height h. n > N(h) (because N(h) was the minimum) n > h hence log n > h (relatively well balanced tree!!) h < 1.44 log 2 n (i.e., Find takes O(logn))
12/26/03AVL Trees - Lecture 812 Node Heights height of node = h balance factor = h left -h right empty height = height=2 BF=1-0= Tree A (AVL)Tree B (AVL)
12/26/03AVL Trees - Lecture 813 Node Heights after Insert height of node = h balance factor = h left -h right empty height = balance factor 1-(-1) = 2 Tree A (AVL)Tree B (not AVL)
12/26/03AVL Trees - Lecture 814 Insert and Rotation in AVL Trees Insert operation may cause balance factor to become 2 or –2 for some node only nodes on the path from insertion point to root node have possibly changed in height So after the Insert, go back up to the root node by node, updating heights If a new balance factor (the difference h left - h right ) is 2 or –2, adjust tree by rotation around the node
12/26/03AVL Trees - Lecture 815 Single Rotation in an AVL Tree
12/26/03AVL Trees - Lecture 816 Let the node that needs rebalancing be. There are 4 cases: Outside Cases (require single rotation) : 1. Insertion into left subtree of left child of. 2. Insertion into right subtree of right child of. Inside Cases (require double rotation) : 3. Insertion into right subtree of left child of. 4. Insertion into left subtree of right child of. The rebalancing is performed through four separate rotation algorithms. Insertions in AVL Trees
12/26/03AVL Trees - Lecture 817 j k XY Z Consider a valid AVL subtree AVL Insertion: Outside Case h h h
12/26/03AVL Trees - Lecture 818 j k X Y Z Inserting into X destroys the AVL property at node j AVL Insertion: Outside Case h h+1h
12/26/03AVL Trees - Lecture 819 j k X Y Z Do a right rotation AVL Insertion: Outside Case h h+1h
12/26/03AVL Trees - Lecture 820 j k X Y Z Do a right rotation Single right rotation h h+1h
12/26/03AVL Trees - Lecture 821 j k X Y Z Right rotation done! (Left rotation is mirror symmetric) Outside Case Completed AVL property has been restored! h h+1 h
12/26/03AVL Trees - Lecture 822 j k XY Z AVL Insertion: Inside Case Consider a valid AVL subtree h h h
12/26/03AVL Trees - Lecture 823 Inserting into Y destroys the AVL property at node j j k X Y Z AVL Insertion: Inside Case Does right rotation restore balance? h h+1h
12/26/03AVL Trees - Lecture 824 j k X Y Z Right rotation does not restore balance… now k is out of balance AVL Insertion: Inside Case h h+1 h
12/26/03AVL Trees - Lecture 825 Consider the structure of subtree Y… j k X Y Z AVL Insertion: Inside Case h h+1h
12/26/03AVL Trees - Lecture 826 j k X V Z W i Y = node i and subtrees V and W AVL Insertion: Inside Case h h+1h h or h-1
12/26/03AVL Trees - Lecture 827 j k X V Z W i AVL Insertion: Inside Case We will do a left-right double rotation...
12/26/03AVL Trees - Lecture 828 j k X V Z W i Double rotation : first rotation left rotation complete
12/26/03AVL Trees - Lecture 829 j k X V Z W i Double rotation : second rotation Now do a right rotation
12/26/03AVL Trees - Lecture 830 j k X V Z W i Double rotation : second rotation right rotation complete Balance has been restored h h h or h-1
12/26/03AVL Trees - Lecture 831 Implementation balance (1,0,-1) key right left No need to keep the height; just the difference in height, i.e. the balance factor; this has to be modified on the path of insertion even if you dont perform rotations Once you have performed a rotation (single or double) you wont need to go back up the tree
12/26/03AVL Trees - Lecture 832 Single Rotation RotateFromRight(n : reference node pointer) { p : node pointer; p := n.right; n.right := p.left; p.left := n; n := p } X YZ n You also need to modify the heights or balance factors of n and p Insert
12/26/03AVL Trees - Lecture 833 Double Rotation Implement Double Rotation in two lines. DoubleRotateFromRight(n : reference node pointer) { ???? } X n VW Z
12/26/03AVL Trees - Lecture 834 Insertion in AVL Trees Insert at the leaf (as for all BST) only nodes on the path from insertion point to root node have possibly changed in height So after the Insert, go back up to the root node by node, updating heights If a new balance factor (the difference h left - h right ) is 2 or –2, adjust tree by rotation around the node
12/26/03AVL Trees - Lecture 835 Insert in BST Insert(T : reference tree pointer, x : element) : integer { if T = null then T := new tree; T.data := x; return 1;//the links to //children are null case T.data = x : return 0; //Duplicate do nothing T.data > x : return Insert(T.left, x); T.data < x : return Insert(T.right, x); endcase }
12/26/03AVL Trees - Lecture 836 Insert in AVL trees Insert(T : reference tree pointer, x : element) : { if T = null then {T := new tree; T.data := x; height := 0; return;} case T.data = x : return ; //Duplicate do nothing T.data > x : Insert(T.left, x); if ((height(T.left)- height(T.right)) = 2){ if (T.left.data > x ) then //outside case T = RotatefromLeft (T); else //inside case T = DoubleRotatefromLeft (T);} T.data < x : Insert(T.right, x); code similar to the left case Endcase T.height := max(height(T.left),height(T.right)) +1; return; }
12/26/03AVL Trees - Lecture 837 Example of Insertions in an AVL Tree Insert 5, 40
12/26/03AVL Trees - Lecture 838 Example of Insertions in an AVL Tree Now Insert 45
12/26/03AVL Trees - Lecture 839 Single rotation (outside case) Imbalance Now Insert 34
12/26/03AVL Trees - Lecture 840 Double rotation (inside case) Imbalance Insertion of
12/26/03AVL Trees - Lecture 841 AVL Tree Deletion Similar but more complex than insertion Rotations and double rotations needed to rebalance Imbalance may propagate upward so that many rotations may be needed.
12/26/03AVL Trees - Lecture 842 Arguments for AVL trees: 1.Search is O(log N) since AVL trees are always balanced. 2.Insertion and deletions are also O(logn) 3.The height balancing adds no more than a constant factor to the speed of insertion. Arguments against using AVL trees: 1.Difficult to program & debug; more space for balance factor. 2.Asymptotically faster but rebalancing costs time. 3.Most large searches are done in database systems on disk and use other structures (e.g. B-trees). 4.May be OK to have O(N) for a single operation if total run time for many consecutive operations is fast (e.g. Splay trees). Pros and Cons of AVL Trees
12/26/03AVL Trees - Lecture 843 Double Rotation Solution DoubleRotateFromRight(n : reference node pointer) { RotateFromLeft(n.right); RotateFromRight(n); } X n VW Z