Binary Search Trees

Berlin Chen 陳柏琳台灣師範大學資工系副教授

參考資料來源：http://www.csie.ntu.edu.tw/~ds/

Binary Search Tree

• Binary search tree– Every element has a unique key

– The keys in a nonempty left subtree (right subtree) are smaller (larger) than the key in the root of subtree

– The left and right subtrees are also binary search trees• A recursive definition!

65 8022

Searching a Binary Search Tree

tree_pointer search(tree_pointer root, int key){/* return a pointer to the node that contains key. If ther

e is no such node, return NULL */

if (!root) return NULL;

if (key == root->data) return root;

if (key < root->data) return search(root->left_child, key);

return search(root->right_child,key);}

A Recursive Version

Each tree node:

- A key value (data)

- Two pointers to it

left and right children

Complexity: O(h)

tree_pointer search2(tree_pointer tree, int key)

while (tree) {

if (key == tree->data) return tree;

if (key < tree->data)

tree = tree->left_child;

else tree = tree->right_child;

return NULL;

A Iterative Version

• Search by rank permitted if– Each node have an additional field LeftSize which is 1 plus the n

umber of elements in the left subtree of the node

tree_pointer search3(tree_pointer tree, int rank)

while (tree) {

if (rank == tree->LeftSize) return tree;

if (rank < tree-> LeftSize)

tree = tree->left_child;

rank-= tree->LeftSize;

tree = tree->right_child;

return NULL;

Insert Node in Binary Search Tree

2 35 80

Insert 80 Insert 35

Insertion into A Binary Search Tree

void insert_node(tree_pointer *node, int num){tree_pointer ptr, temp = modified_search(*node, num); if (temp || !(*node)) { ptr = (tree_pointer) malloc(sizeof(node)); if (IS_FULL(ptr)) { fprintf(stderr, “The memory is full\n”); exit(1); } ptr->data = num; ptr->left_child = ptr->right_child = NULL; if (*node) if (num<temp->data) temp->left_child=ptr; else temp->right_child = ptr; else *node = ptr; }}

// Not NULL

// Return last visited tree node

Deletion for A Binary Search Tree

• Delete a leaf node

– The left (or right) child field of its parent is set to 0 and the node is simply disposed

2 35 80

Delete 35 Delete 8030

• Delete a nonleaf node with only one child

– The node is deleted and the single-child takes the place of the disposed node

2 35 80

Delete 5 30

• Delete a nonleaf node with only one child

• Delete a nonleaf node that has two children

– The largest element in its left subtree or the smallest one in the right tree will replace the disposed mode

2 35 80

Delete 30 5

or 402

• Delete a nonleaf node that has two children– Recursive adjustments might happen

10 30 50 70

Before deleting 60 After deleting 60

Exercise

• Implement a binary search which is equipped with– Insertion operation

– Search operation• Search by key• Search by Rank

– Deletion operation

Tree A

struct DEF_TREE{ int Key; struct Tree *Child; struct Tree *Brother; struct Tree *Father;

Graphs

Berlin Chen 陳柏琳台灣師範大學資工所助理教授

參考資料來源：http://www.csie.ntu.edu.tw/~ds/

Definition

• A graph G consists of two sets– A finite, nonempty set of vertices V(G)– A finite, possible empty set of edges E(G)– G(V,E) represents a graph

• An undirected graph (graph) is one in which the pair of vertices in a edge is unordered, (v0, v1) = (v1,v0)

• A directed graph (digraph) is one in which each edge is a directed pair of vertices, <v0, v1> != <v1,v0>

tail head

Examples for Graph

3 4 5 6G1

V(G1)={0,1,2,3} E(G1)={(0,1),(0,2),(0,3),(1,2),(1,3),(2,3)}V(G2)={0,1,2,3,4,5,6} E(G2)={(0,1),(0,2),(1,3),(1,4),(2,5),(2,6)}V(G3)={0,1,2} E(G3)={<0,1>,<1,0>,<1,2>}

complete undirected graph: n(n-1)/2 edgescomplete directed graph: n(n-1) edges

complete graph incomplete graph

Complete Graph

• A complete graph is a graph that has the maximum number of edges

– For undirected graph with n vertices, the maximum number of edges is n(n-1)/2

– For directed graph with n vertices, the maximum number of edges is n(n-1)

– Example: G1 is a complete graph

complete graph

Adjacent and Incident

• If (v0, v1) is an edge in an undirected graph– v0 and v1 are adjacent– The edge (v0, v1) is incident on vertices v0 and v1

• If <v0, v1> is an edge in a directed graph– v0 is adjacent to v1, and v1 is adjacent from v0

– The edge <v0, v1> is incident on v0 and v1

3 4 5 6

Loop and Multigraph

• Example of a graph with feedback loops and a multigraph (Figure 6.3)– Edges of the form (v, v) or <v, v> : self edge or self loop

self edgemultigraph:multiple occurrencesof the same edge

Subgraph and Path

• A subgraph of G is a graph G’ such that V(G’) is a subset of V(G) and E(G’) is a subset of E(G)

• A path from vertex vp to vertex vq in a graph G, is a sequence of vertices, vp, vi1, vi2, ..., vin, vq, such that (vp, vi1), (vi1, vi2), ..., (vin, vq) are edges in an undirected graph

• The length of a path is the number of edges on it

3 (i) (ii) (iii) (iv) (a) Some of the subgraph of G1

2(i) (ii) (iii) (iv)

(b) Some of the subgraph of G3

分開單一

Figure 6.4: subgraphs of G1 and G3

Simple Path and Cycle

• A simple path is a path in which all vertices except possibly the first and the last are distinct

• A cycle is a simple path in which the first and the last vertices are the same

• In an undirected graph G, two vertices, v0 and v1, are connected if there is a path in G from v0 to v1

• An undirected graph is connected if, for every pair of distinct vertices vi, vj, there is a path from vi to vj

3 4 5 6G1

connected

tree (acyclic graph)

Connected Component

• A connected component of an undirected graph is a maximal connected subgraph

• A tree is a graph that is connected and acyclic

• A directed graph is strongly connected if there is a directed path from vi to vj and also from vj to vi

• A strongly connected component is a maximal subgraph that is strongly connected

Figure 6.5: A graph with two connected components

G4 (not connected)

connected component (maximal connected subgraph)

Figure 6.6: Strongly connected components of G3

not strongly connectedstrongly connected component

(maximal strongly connected subgraph)

Degree

• The degree of a vertex is the number of edges incident to that vertex

• For directed graph, – the in-degree of a vertex v is the number of edges

that have v as the head

– the out-degree of a vertex v is the number of edgesthat have v as the tail

– if di is the degree of a vertex i in a graph G with n vertices and e edges, the number of edges is

( ) /0

undirected graph

degree0

3 4 5 6

1 1 1 1

directed graphin-degreeout-degree

in:1, out: 1

in: 1, out: 2

in: 1, out: 0

ADT for Undirected Graph

structure Graph is objects: a nonempty set of vertices and a set of undirected edges, where each

edge is a pair of vertices

functions: for all graph Graph, v, v1 and v2 Vertices

Graph Create()::=return an empty graph

Graph InsertVertex(graph, v)::= return a graph with v inserted. v has no incident edge

Graph InsertEdge(graph, v1,v2)::= return a graph with new edge between v1 and v2

Graph DeleteVertex(graph, v)::= return a graph in which v and all edges incident to it are removed

Graph DeleteEdge(graph, v1, v2)::=return a graph in which the edge (v1, v2) is removed

Boolean IsEmpty(graph)::= if (graph==empty graph) return TRUE else return FALSE

List Adjacent(graph,v)::= return a list of all vertices that are adjacent to v

Graph Representations

• Adjacency Matrix• Adjacency Lists• Adjacency Multilists

Adjacency Matrix

• Let G=(V,E) be a graph with n vertices.

• The adjacency matrix of G is a two-dimensional n by n array, say adj_mat

• If the edge (vi, vj) is in E(G), adj_mat[i][j]=1

• If there is no such edge in E(G), adj_mat[i][j]=0

• The adjacency matrix for an undirected graph is symmetric; the adjacency matrix for a digraph need not be symmetric

Examples for Adjacency Matrix

symmetric

undirected: n2/2directed: n2

Merits of Adjacency Matrix

• From the adjacency matrix, to determine the connection of vertices is easy

• The degree of a vertex is

• For a digraph, the row sum is the out_degree, while the column sum is the in_degree

adj mat i jj

_ [ ][ ]

0],[)(

ji ijAvind

0],[)(

ji jiAvoutd

sum up a column sum up a row

Data Structures for Adjacency Lists

#define MAX_VERTICES 50typedef struct node *node_pointer;typedef struct node { int vertex; struct node *link;};node_pointer graph[MAX_VERTICES];int n=0; /* vertices currently in use */

Each row in adjacency matrix is represented as an adjacency list

01234567

1 2 30 2 30 1 30 1 2

1 20 30 31 254 65 76G4

An undirected graph with n vertices and e edges ==> n head nodes and 2e list nodes

Interesting Operations

• Degree of a vertex in an undirected graph– Number of nodes in adjacency list

• Number of edges in a graph– Determined in O(n+e)

• Out-degree of a vertex in a directed graph (digraph)– Number of nodes in its adjacency list

• In-degree of a vertex in a directed graph (digraph)– Traverse the whole data structure– Or maintained in inversed adjacency lists

Compact Representation

[0] 9 [8] 23 [16] 2[1] 11 [9] 1 [17] 5[2] 13 [10] 2 [18] 4[3] 15 [11] 0 [19] 6[4] 17 [12] 3 [20] 5[5] 18 [13] 0 [21] 7[6] 20 [14] 3 [22] 6[7] 22 [15] 1

node[0] … node[n-1]: starting point for vertices

node[n]: n+2e+1 (upper bound of the array)

The vertices adjacent from vertex i are stored in array indexed from node[i] to node[i+1]-1

Inverse Adjacency Lists

1 NULL

0 NULL

1 NULL

Determine in-degree of a vertex in a fast way.

Figure 6.10: Inverse adjacency lists for G3

• A simplified version of the list structure for sparse matrix representation

tail head column link for head row link for tail

Figure 6.11: Alternate node structure for adjacency lists

Information for an edge for in-degree information

for out-degree information

2 NULL

1 0 NULL

0 1 NULL NULL

1 2 NULL NULL

Figure 6.12: Orthogonal representation for graph G3

head nodes

(shown twice)

3 2 NULL 1 0

2 3 NULL 0 1

3 1 NULL 0 2

2 0 NULL 1 3

headnodes vertax link

Order is of no significance

Figure 6.13:Alternate order adjacency list for G1

Exercise

• P. 344 Exercises 2, 10

Adjacency Multilists

• An edge in an undirected graph is represented by two nodes in adjacency list representation.

• Adjacency Multilists– lists in which nodes may be shared among several lists

(an edge is shared by two different paths)

m vertex1 vertex2 list1 list2

A Boolean mark indicating whether

or not the edge has been examined

Example for Adjacency Multlists

Lists: vertex 0: N0->N1->N2 vertex 1: N0->N3->N4 vertex 2: N1->N3->N5 vertex 3: N2->N4->N5

0 1 N1 N3

0 2 N2 N3

0 3 N4

1 2 N4 N5

1 3 N5

edge (0,1)

edge (0,2)

edge (0,3)

edge (1,2)

edge (1,3)

edge (2,3)

3 six edges

Some Graph Operations

• TraversalGiven G=(V,E) and vertex v, find all wV, such that w connects v– Depth First Search (DFS)

preorder tree traversal

– Breadth First Search (BFS)level order tree traversal

• Connected Components

• Spanning Trees

Example: Graph G and its adjacency lists

depth first search: v0, v1, v3, v7, v4, v5, v2, v6

breadth first search: v0, v1, v2, v3, v4, v5, v6, v7

• Figure 6.16: visit all vertices in G that are reachable form v0

Depth First Search (DFS, dfs)

void dfs(int v){ node_pointer w; visited[v]= TRUE; printf(“%5d”, v); for (w=graph[v]; w; w=w->link) if (!visited[w->vertex]) dfs(w->vertex);}

#define FALSE 0#define TRUE 1short int visited[MAX_VERTICES];

Data structureadjacency list: O(e)adjacency matrix: O(n2)

A Recursive Function

Breadth First Search (BFS, bfs)

typedef struct queue *queue_pointer;

typedef struct queue {

int vertex;

queue_pointer link;

void addq(queue_pointer *, queue_pointer *, int);

int deleteq(queue_pointer *);

Breadth First Searchvoid bfs(int v){ node_pointer w; queue_pointer front, rear; front = rear = NULL; printf(“%5d”, v); visited[v] = TRUE; addq(&front, &rear, v);

adjacency list: O(e)adjacency matrix: O(n2)

while (front) { v= deleteq(&front); for (w=graph[v]; w; w=w->link) if (!visited[w->vertex]) { printf(“%5d”, w->vertex); addq(&front, &rear, w->vertex); visited[w->vertex] = TRUE; } }}

Connected Components

void connected(void){ for (i=0; i<n; i++) visited[i]=0;

for (i=0; i<n; i++) { if (!visited[i]) { dfs(i); printf(“\n”); } }}

adjacency list: O(n+e)adjacency matrix: O(n2)

Exercise

• Implement DFS and BFS• Implement a function to determine the connected

components for a graph

3 4 5 6

Spanning Trees

• When graph G is connected, a depth first or breadth first search starting at any vertex will visit all vertices in G– Form a tree that includes all the vertices of G

• A spanning tree is any tree that consists solely of edges in G and that includes all the vertices

• E(G): T (tree edges) + N (nontree edges)where T: set of edges used during search

N: set of remaining edges

Examples of Spanning Tree

Possible spanning trees

Spanning Trees

• Either DFS or BFS can be used to create a spanning tree

– When DFS is used, the resulting spanning tree is known as a depth first spanning tree

– When BFS is used, the resulting spanning tree is known as a breadth first spanning tree

• While adding a nontree edge into any spanning tree, this will create a cycle

DFS VS. BFS Spanning Tree

3 4 5 6

DFS Spanning BFS Spanning

3 4 5 6

7 nontree edgecycle

Biconnected Graph

• A spanning tree is a minimal subgraph, G’, of G such that V(G’)=V(G) and G’ is connected

• Any connected graph with n vertices must have at least n-1 edges

• A biconnected graph is a connected graph that has no articulation points

3 4 5 6

biconnected graph

Articulation Point

• A vertex v is an articulation point iff the deletion of v, together with the deletion of all edges incident to v, leaves behind a graph has at least two connected components

3 4 5 6

biconnected component: a maximal connected subgraph H(no subgraph that is both biconnected and properly contains H)

biconnected componentsTwo biconnected components can

at most have one vertex at common

Find biconnected component of a connected undirected graphby depth first spanning tree

3 0 5 3 5

4 61 6

9 8 9 8

(a) depth first spanning tree (b)

dfndepthfirst

number

nontreeedge

(back edge)

nontreeedge

(back edge)

If u is an ancestor of v then dfn(u) < dfn(v)

Minimum Cost Spanning Tree

• The cost of a spanning tree of a weighted undirected graph is the sum of the costs of the edges in the spanning tree

• A minimum cost spanning tree is a spanning tree of least cost

• Three different algorithms can be used– Kruskal– Prim– Sollin

Select n-1 edges from a weighted graphof n vertices with minimum cost.

Greedy Strategy

• An optimal solution is constructed in stages

• At each stage, the best decision is made at thistime

• Since this decision cannot be changed later, we make sure that the decision will result in a feasible solution

• Typically, the selection of an item at each stage is based on a least cost or a highest profit criterion

Kruskal’s Idea

• Build a minimum cost spanning tree T by adding edges to T one at a time

• Select the edges for inclusion in T in nondecreasing order of the cost

• An edge is added to T if it does not form a cycle

• Since G is connected and has n > 0 vertices, exactly n-1 edges will be selected

Examples for Kruskal’s Algorithm

121824

Examples for Kruskal’s Algorithm

28 + 3 6cycle

Examples for Kruskal’s Algorithm0 5

cost = 10 +25+22+12+16+14

Kruskal’s Algorithm

T= {};while (T contains less than n-1 edges

&& E is not empty) {choose a least cost edge (v,w) from E;delete (v,w) from E;if ((v,w) does not create a cycle in T) add (v,w) to T else discard (v,w);}if (T contains fewer than n-1 edges) printf(“No spanning tree\n”);

Exercise

• The missionaries and cannibals problem is usually stated as follows. Three missionaries and three cannibals are on one side of a river, along with a boat that can hold one or two people. Find a way to get every one to the other side, without leaving a group of missionaries in one place outnumbered by the cannibals in that place.

Binary Search Trees

Documents

Expression Tree Binary Tree Search

Trees עצים ועצי חיפוש Chapter 5.5– Trees (91 – 97) Chapter 13– Binary Search Trees (244 – 262) חומר קריאה לשיעור זה Lecture3 of Geiger & Itai’s

Chapter 6. Binary Search Trees Internet Computing KUT Youn-Hee Han

Balanced Binary Search Tree

Balanced Trees. Maintaining Balance Binary Search Tree – Height governed by Initial order Sequence of insertion/deletion – Changes occur at leaf nodes

Chapter 12 Digital Search Structures - 南華大學chun/DS(II)-Ch12-DigitalSearchStructures.pdf · 1 C-C Tsai P.1 Chapter 12 Digital Search Structures Digital Search Trees Binary

Chapter 11 Multiway Trees. Content Points Orchards, Trees, and Binary Trees Traverse of Orchards and trees Huffman trees

Data Structures( 数据结构 ) Course 8: Search Trees. 2 西南财经大学天府学院 Chapter 8 search trees Binary search trees and AVL trees 8-1 Binary search trees Problem:

New 5.7 Binary Search Treescontents.kocw.net/KOCW/document/2014/hanbat/ahnkeehong1/... · 2016. 9. 9. · 5.7.4 Deletion from a Binary Search Tree 1) 단말노드인경우: 해당노드

資料結構的樹與二元樹（Trees and Binary Treeswayne.cif.takming.edu.tw/datastru/tree.pdf · 資料結構的樹與二元樹（Trees and Binary Trees）資訊科技系林偉川

Algoritmet dhe struktura e të dhënaveBST (Binary Search Trees) Prishtinë, 2015/2016 ©vehbineziri.com 3 • Rregulla e ruajtës në BST • Të gjitha vlerat në nën pemën e majtë

Binary Search Trees and Rare Super Carswtkrieger.faculty.noctrl.edu › csc210-spring2020 › docs › pk03_bst.pdf · Implementation Keep track of the root, the left, and right links

Part4 search trees

1 Jeff Edmonds York University COSC 2011 Lecture 6 Balanced Trees Dictionary/Map ADT Binary Search Trees Insertions and Deletions AVL Trees Rebalancing

Optimal Binary Search Trees with Costs Depending on the Access

Balanced Binary Trees

Binary Search Tree - National Tsing Hua Universitywkhon/algo08-tutorials/tutorial-bst.pdf · 2008-04-10 · Introduction What’s a binary search tree? –It’s a binary tree ! –For

Inferring Strings from Suffix Trees and Links on a Binary Alphabet

Binary Search Tree

Balanced Binary Search Tree - KOCWcontents.kocw.net/KOCW/document/2014/HankukForeign/... · 2016. 9. 9. · Complexities on Binary Search Tree • Search, insertion, and deletion