diff --git a/Submissions/002709710_HemaSree_Bojanapally/Assignment3.README.md b/Submissions/002709710_HemaSree_Bojanapally/Assignment3.README.md new file mode 100644 index 0000000..c9a21d8 --- /dev/null +++ b/Submissions/002709710_HemaSree_Bojanapally/Assignment3.README.md @@ -0,0 +1,13 @@ +Name: Hema Sree Bojanapally +NUID: 002709710 +email: bojanapally.h@northeastern.edu + +##Assignment 3 + +This assignment's main goal was to provide with a solid understanding of algorithmic concepts and real-world uses. The exercise also entailed in improving logical thinking and critical thinking, providing to take on challenging computational problems. + +Below are the learnings and the concepts covered as part of Assignment 3: +- Network Flow +- Bellman-Ford Algorithm +- Push-Relabel Algorithm +- Ford- Fulkerson Algorithm diff --git a/Submissions/002709710_HemaSree_Bojanapally/Assignment3.ipynb b/Submissions/002709710_HemaSree_Bojanapally/Assignment3.ipynb new file mode 100644 index 0000000..ba3ab47 --- /dev/null +++ b/Submissions/002709710_HemaSree_Bojanapally/Assignment3.ipynb @@ -0,0 +1,432 @@ +{ + "cells": [ + { + "cell_type": "markdown", + "metadata": { + "id": "9cEVuujzVQMh" + }, + "source": [ + "##### 1. \n", + "Give brief definitions of the following terms:\n", + "\n", + "\n", + "* Residual Graph\n", + "* Ford-Fulkerson\n", + "* Capacity Scaling\n", + "* Cut Capacity\n", + "* Push- Relabel\n", + "\n", + "**Answer:**\n", + "\n", + "* **Residual Graph** : A residual graph is a graph used in the context of network flow algorithms, representing the remaining capacity on each edge after the initial flow has been sent. It allows for augmenting paths to be found and additional flow to be sent through the network.\n", + "* **Ford-Fulkerson** : Ford-Fulkerson algorithm is a method used to compute the maximum flow in a flow network. It iteratively augments the flow along augmenting paths in the residual graph until no more augmenting paths can be found. The algorithm terminates when there are no more augmenting paths, giving the maximum flow value.\n", + "* **Capacity Scaling** : Capacity scaling is an optimization technique used in network flow algorithms to improve their efficiency. It involves iteratively finding augmenting paths only for edges with capacities greater than a certain threshold, ignoring edges with smaller capacities. As the algorithm progresses, the threshold decreases, allowing the discovery of smaller augmenting paths.\n", + "* **Push - Relabel** : The push-relabel algorithm is a method for computing the maximum flow in a network flow graph. It maintains a preflow and uses operations like push and relabel to push excess flow from higher nodes to lower nodes until an optimal flow is achieved.\n", + "* **Cut Capacity** : In network flow problems, a cut is a partition of the vertices into two disjoint sets. The cut capacity is the sum of capacities of the edges that cross the cut from one set to the other. It represents the maximum flow that can pass from the source set to the sink set and plays a crucial role in various flow network algorithms.\n", + "\n", + "**Reflection: **\n", + "\n", + "By interacting with ChatGPT, I gained insights into the essential concepts, ensuring that the problem I designed reflected the core ideas of network flow algorithms.\n", + "\n", + "Balancing relevance and originality was a challenge, encouraging innovative problem variations. I learned the importance of clarity and uniqueness in problem design. Justifying the solution enhanced my problem-solving skills. This task highlighted the collaborative potential of tools like ChatGPT, emphasizing their value in problem-solving.\n", + "\n", + "\n", + "\n" + ] + }, + { + "cell_type": "markdown", + "metadata": { + "id": "f9stZL8vuTJ9" + }, + "source": [ + "##### 2. \n", + "**Problem Statement:** \n", + " Given a weighted directed graph represented as an adjacency matrix and starting node A and ending node E, find the shortest path from node A to E using the Bellman-Ford algorithm.\n", + "\n", + "**Input:**\n", + "* An adjacency matrix representing the weighted directed graph.\n", + "* Starting node A and ending node E.\n", + "\n", + "**Output:**\n", + "* The shortest path from node A to E.\n", + "* If there is no path from A to E, output \"No path exists.\"\n", + "\n", + "*Sample Input:*\n", + "\n", + "\n", + "![image.png]()\n", + "\n", + "*Sample Output:*\n", + "\n", + "\n", + "\n", + "\n", + "\n", + "![image.png]()\n", + "\n", + "**Constraints:**\n", + "* The graph is represented as an N x N adjacency matrix (1 <= N <= 100).\n", + "* The weights of edges can be positive, negative, or zero (-1000 <= weight <= 1000).\n", + "* Nodes are represented by uppercase letters (A to E).\n", + "* The graph may not be connected, and there might be isolated nodes.\n", + "* The graph can contain cycles.\n", + "\n", + "**Solution and Justification:**\n", + "\n", + "The algorithm handles negative weights by detecting negative cycles. If a negative cycle is detected during the N-1 iterations, it indicates the presence of an infinite negative weight path, and the algorithm returns an error. Otherwise, it finds the shortest path.\n", + "\n", + "\n", + "![image.png]()\n", + "\n", + "* Choose path value 0 for the source vertex and infinity for all other vertices.\n", + "* If the new calculated path length is less than the previous path length, go to the source vertex's neighboring Edge and relax the path length of the adjacent Vertex.\n", + "* This procedure must be repeated V-1 times, where V is the number of vertices in total. This happened because, in the worst-case scenario, any vertex's path length can be changed N times to an even shorter path length.\n", + "* As a result, after V-1 iterations, you find your new path lengths and can determine in case the graph has a negative cycle or not.\n", + "\n", + "\n", + "**Pseudocode:**\n", + "\n", + " function bellmanFord(G, S)\n", + " for each vertex V in G\n", + " distance[V] <- infinite\n", + " previous[V] <- NULL\n", + " distance[S] <- 0\n", + "\n", + " for each vertex V in G\n", + " for each edge (U,V) in G\n", + " tempDistance <- distance[U] + edge_weight(U, V)\n", + " if tempDistance < distance[V]\n", + " distance[V] <- tempDistance\n", + " previous[V] <- U\n", + "\n", + " for each edge (U,V) in G\n", + " If distance[U] + edge_weight(U, V) < distance[V}\n", + " Error: Negative Cycle Exists\n", + "\n", + " return distance[], previous[]\n", + "\n", + "*Reflection:*\n", + "\n", + "ChatGPT was helpful in clarifying the Bellman-Ford algorithm's handling of negative weights and negative cycles. It provided insights into potential edge cases involving negative weights, enhancing the problem's depth and complexity.\n", + "\n", + "This task highlighted the importance of accounting for real-world scenarios, such as negative edge weights, in algorithmic problem design. It emphasized the significance of clear problem constraints and the need to address potential edge cases to create well-rounded problems.\n" + ] + }, + { + "cell_type": "markdown", + "metadata": { + "id": "rb0R_Z-wlujf" + }, + "source": [ + "#### 3.\n", + "**Problem Statement:**\n", + "\n", + "You are given a directed graph G = (V, E), with a positive integer\n", + "capacity ce on each edge e, a designated source s ~ V, and a designated sink\n", + "t ~ V. You are also given an integer maximum s-t flow in G, defined by a flow\n", + "value fe on each edge e. Now suppose we pick a specific edge e ~ E and increase its capacity by one unit. Show how to find a maximum flow in the resulting capacitated graph in time O(m + n), where m is the number of edges in G and n is the number of nodes.\n", + "\n", + "**Input:**\n", + "\n", + "* G=(V,E): The original graph with edge capacities.\n", + "\n", + "* s: The source node.\n", + "\n", + "* t: The sink node.\n", + "\n", + "* e: The specific edge whose capacity is increased.\n", + "\n", + "**Output:**\n", + "* Maximum flow in the updated graph G'\n", + "\n", + "**Constraints:**\n", + "\n", + "* 1 ≤ Number of nodes ∣V∣≤1000\n", + "* 1 ≤ Number of edges ∣E∣≤1000\n", + "* 1 ≤ Edge capacity ce≤100\n", + "\n", + "**Solution and Justitification:**\n", + "\n", + "* **Step 1: Create the Residual Graph**\n", + "\n", + " * Construct the residual graph Gr from the original graph G, considering both forward and backward edges. For each edge e(u,v) in G with capacity ce, add two edges to Gr:e(u,v) with residual capacity ce−fe and e(v,u) with residual capacity fe (where fe is the current flow on edge e).\n", + "\n", + "* **Step 2: Modify the Residual Graph**\n", + "\n", + " * If e(u,v) is the specific edge whose capacity is increased, set its capacity in the residual graph Gr to ce−fe+1. Otherwise, keep the capacities of all other edges in Gr the same.\n", + "\n", + "* **Step 3: Find Maximum Flow in Residual Graph**\n", + "\n", + " * Use Breadth-First Search (BFS) or Depth-First Search (DFS) to find the maximum flow in the modified residual graph Gr. Start the search from the source node s and continue until no augmenting path can be found.\n", + "\n", + "* **Step 4: Output Result**\n", + "\n", + " * The maximum flow found in step 3 represents the maximum flow in the updated capacitated graph G'after increasing the capacity of edge e.\n", + "\n", + "**Proof Of Correctness:**\n", + "\n", + "* Residual Graph Construction: The construction of the residual graph is based on standard rules, ensuring that residual capacities are non-negative and satisfy flow conservation.\n", + "\n", + "* Modification of Residual Capacities: Increasing the capacity of edge e by one unit ensures that the augmenting path can utilize this additional capacity.\n", + "\n", + "* Finding Augmenting Paths: The BFS or DFS algorithm finds augmenting paths by exploring edges with positive residual capacities, ensuring that the flow can be increased along these paths.\n", + "\n", + "* Termination: The algorithm terminates when no augmenting path can be found, indicating that the maximum flow in the updated graph has been achieved.\n", + "\n", + "*Reflection:*\n", + "\n", + "ChatGPT assisted in understanding the intricacies of graph algorithms, specifically in augmenting flows.\n", + "\n", + "One of the main challenges was maintaining the essence of the example problem while adding a unique twist. Ensuring that the modification (increasing the capacity of a specific edge) did not alter the fundamental principles of maximum flow algorithms was critical.\n", + "\n", + "I learned that even a seemingly minor change, like increasing the capacity of an edge, can lead to profound algorithmic implications" + ] + }, + { + "cell_type": "markdown", + "metadata": { + "id": "re1tvh8rkw79" + }, + "source": [ + "##### 4. \n", + "Given a flow network, let f be any flow and let (A,B) be any cut. Then, the net flow across (A,B) is __________ the value of f.\n", + "\n", + "\n", + "1. Lessthan\n", + "2. Greaterthan\n", + "3. Equal To\n", + "4. Not related to\n", + "\n", + "**Answer:** 3\n", + "\n", + "**Explanation:**\n", + "\n", + "* In a flow network, a cut (A,B) refers to a partition of the vertices into two disjoint subsets A and B. The net flow across the cut (A,B) is calculated by summing up the flow values of the edges going from set A to B and and subtracting the flow values of the edges going from set B to A.\n", + "\n", + "* Regardless of the specific flow f, the net flow across the cut (A,B) is equal to the total flow entering A and leaving B. This net flow is always equal to the value of the flow f in the network, as flow conservation ensures that the amount of flow entering a set of vertices is equal to the amount of flow leaving that set. Therefore, the net flow across the cut (A,B) is always equal to the value of f.\n", + "\n", + "*Reflection:*\n", + "\n", + "ChatGPT assisted in the formulation of a complex multiple-choice question on flow networks and minimum cuts. By discussing key concepts, I clarified the question's requirements and received guidance on crafting plausible yet incorrect answer choices, adding depth to the problem.\n", + "\n", + "The challenge was in achieving a balance between complexity and clarity, ensuring the question tested deep understanding without becoming convoluted. This task highlighted the importance of precision and unambiguous language in problem design. It underscored the need for questions that challenge students' core knowledge, fostering a profound understanding of algorithms rather than mere memorization. The experience emphasized the value of tools like ChatGPT in creating effective and insightful algorithmic problems.\n", + "\n", + "\n" + ] + }, + { + "cell_type": "markdown", + "metadata": { + "id": "vqz6A23wn-P2" + }, + "source": [ + "#### 5.\n", + "**Problem Statement:** \n", + " Given a weighted directed graph, where each edge has a capacity and starting node S and ending node T, find the maximum flow from node S to T using the Push-Relabel algorithm.\n", + "\n", + "**Input:**\n", + "* A weighted graph with starting node S and ending node T.\n", + "\n", + "**Output:**\n", + "* The maximum flow from S to T.\n", + "\n", + "*Sample Input:*\n", + "\n", + "![image.png]()\n", + "\n", + "*Sample Output:*\n", + "\n", + "![image.png]()\n", + "\n", + "**Constraints:**\n", + "* Edge capacities are positive integers (1 <= capacity <= 100).\n", + "* Nodes are represented by uppercase letters (S to T).\n", + "* The graph may not be connected, and there might be isolated nodes.\n", + "\n", + "**Solution and Justification:**\n", + "\n", + "Push-Relabel algorithm is used to find the maximum flow from node S to T in the given weighted directed graph. We Implement the algorithm using techniques like vertex relabeling and edge pushing to achieve the maximum flow.\n", + "\n", + "Initially the Flow=0. Let's take an augmenting path S-A-D-T and the capacity=8 So, we have to update the values of this augmenting path\n", + "\n", + "\n", + "![image.png]()\n", + "\n", + "\n", + "Now, take an augmenting path S-C-D-T, and the capacity=2\n", + "\n", + "![image.png]()\n", + "\n", + "\n", + "Take an augmenting path S-C-D-A-B-T, capacity=4\n", + "\n", + "![image.png]()\n", + "\n", + "Take an augmenting path S-A-D-B-T,capacity=2\n", + "\n", + "![image.png]()\n", + "\n", + "Take an augmenting path S-C-D-B-T, capacity=3\n", + "\n", + "![image.png]()\n", + "\n", + "Hence, the maximum flow = 8+2+4+2+3 = 19\n", + "\n", + "*Reflection:*\n", + "\n", + "\n", + "ChatGPT played a crucial role in clarifying the Push-Relabel algorithm's intricacies. It provided valuable insights into optimizing the algorithm for efficiency and accuracy.\n", + "\n", + "The main challenge was ensuring that the problem remained non-trivial and reflected the essence of the Push-Relabel method.\n", + "\n", + "\n" + ] + }, + { + "cell_type": "markdown", + "metadata": { + "id": "5u8e6sISDzrJ" + }, + "source": [ + "#### 6.\n", + "** Problem Statement:**\n", + "Let G = (V, E) be a directed graph, with source s ~ V, sink t ~ V, and\n", + "nonnegative edge capacities {ce}. Give a polynomial-time algorithm to\n", + "decide whether G has a unique minimum s-t cut (i.e., an s-t of capacity\n", + "strictly less than that of all other s-t cuts).\n", + "\n", + "**Input:**\n", + "\n", + "* Directed graph represented as an adjacency list G=(V,E).\n", + "* Source node s and sink node t within the graph.\n", + "* Edge capacities {c e} indicating the capacities of edges E.\n", + "\n", + "**Output:**\n", + "\n", + "* True if there exists a unique minimum s-t cut, False otherwise.\n", + "\n", + "**Constraints:**\n", + "\n", + "* The graph G is a directed graph with positive edge capacities.\n", + "* The source s and sink t nodes are valid nodes in the graph.\n", + "* Edge capacities ce are nonnegative integers.\n", + "\n", + "**Solution and Justification:**\n", + "\n", + "* **Step 1:** Compute the maximum flow in the graph G using any max-flow algorithm (such as Ford-Fulkerson or Edmonds-Karp) from source s to sink t. Let the maximum flow value be F.\n", + "\n", + "* **Step 2:** Modify the capacities of all edges in the graph by subtracting a small positive value ϵ from each capacity, where ϵ is a very small positive number (close to zero). This ensures that no edge has its capacity reduced to zero.\n", + "\n", + "* **Step 3:** Recompute the maximum flow in the modified graph with adjusted capacities. Let the modified maximum flow value be F'.\n", + "\n", + "* **Step 4:** If F' < F, return True. There exists a unique minimum s-t cut in the graph. If F'=F, return False. There are multiple s-t cuts with the same minimum capacity.\n", + "\n", + "**Proof of Correctness:**\n", + "\n", + "By reducing the capacities slightly in Step 2, we are effectively creating a situation where the original minimum s-t cut is the only s-t cut with reduced capacity. If F'< F, it means the original minimum s-t cut is unique. If F'=F, multiple s-t cuts have the same minimum capacity, indicating that the original s-t cut is not unique.\n", + "\n", + "*Reflection:*\n", + "\n", + "\n", + "ChatGPT helped me to grasp the keyinsights of this problem and develop a unique algorithm to solve it.\n", + "\n", + "This problem highlighted the relationship between flow values and capacities in graph theory.\n", + "\n", + "\n" + ] + }, + { + "cell_type": "markdown", + "metadata": { + "id": "1CAJbqCXXn1z" + }, + "source": [ + "#### 7.\n", + "\n", + "\n", + "**Problem Statement:** \n", + " Given a weighted directed graph, where each edge has a capacity and starting node S and ending node T, find the maximum flow and the shortest path from node S to T using the Ford-Fulkerson algorithm.\n", + "\n", + "**Input:**\n", + "* A weighted graph with starting node S and ending node T.\n", + "\n", + "**Output:**\n", + "* The maximum flow from S to T.\n", + "* The shortest path from S to T.\n", + "* If there is no path from S to T, output \"No path exists.\"\n", + "\n", + "*Sample Input:*\n", + "\n", + "![image.png]()\n", + "\n", + "*Sample Output:*\n", + "\n", + "![image.png]()\n", + "\n", + "**Constraints:**\n", + "\n", + "* Edge capacities are positive integers (1 <= capacity <= 100).\n", + "* Nodes are represented by uppercase letters (S to T).\n", + "* The graph may not be connected, and there might be isolated nodes.\n", + "\n", + "**Solution and Justification:**\n", + "\n", + "Ford-Fulkerson algorithm is used to find the maximum flow and shortest path from node S to T in the given weighted directed graph. We implement the algorithm using techniques like residual graphs and augmenting paths while ensuring the alogorithm has the maximum flow while finding the shortest path.\n", + "\n", + "\n", + "Select any arbitrary path from S to T. In this step, we have selected path S-A-B-T\n", + "\n", + "1. Select any arbitrary path from S to T. In this step, we have selected path S-A-B-T.\n", + "2. The minimum capacity among the three edges is 2 (B-T). Based on this, update the flow/capacity for each path.\n", + "\n", + "![image.png]()\n", + "\n", + "3. Select another path S-D-C-T. The minimum capacity among these edges is 3 (S-D) and update the capacities.\n", + "\n", + "![image.png]()\n", + "\n", + "4. Now, let us consider the reverse-path B-D as well. Selecting path S-A-B-D-C-T. The minimum residual capacity among the edges is 1 (D-C) and update the capacities.\n", + "5. The capacity for forward and reverse paths are considered separately.\n", + "6. Adding all the flows = 2 + 3 + 1 = 6, which is the maximum possible flow on the flow network.\n", + "\n", + "*Reflection:*\n", + "\n", + "ChatGPT helped in clarifying the Ford-Fulkerson algorithm's handling of negative weights and negative cycles. It provided insights into potential edge cases involving negative weights, enhancing the problem's depth and complexity.\n", + "\n", + "The main challenge was integrating the concepts of maximum flow and shortest path in the problem. This problem illustrated the significance of clear problem statements and the need to validate problem complexities effectively.\n" + ] + }, + { + "cell_type": "markdown", + "metadata": { + "id": "0L5Z_BA0KQjP" + }, + "source": [ + "**References:**\n", + "\n", + "[1] https://www.sanfoundry.com/merge-sort-multiple-choice-questions-answers-mcqs/\n", + "\n", + "[2] https://www.codingninjas.com/studio/library/implementation-of-push-relabel-algorithm\n", + "\n", + "[3] https://www.simplilearn.com/tutorials/data-structure-tutorial/bellman-ford-algorithm\n", + "\n", + "[4] https://www.inf.ufpr.br/andre/textos-CI1238-INFO7056/Algorithm%20Design%20-%20Jon%20Kleinberg%2C%20%C3%89va%20Tardos.pdf" + ] + } + ], + "metadata": { + "colab": { + "provenance": [] + }, + "kernelspec": { + "display_name": "Python 3", + "name": "python3" + }, + "language_info": { + "name": "python" + } + }, + "nbformat": 4, + "nbformat_minor": 0 +} diff --git a/Submissions/002709710_HemaSree_Bojanapally/Assignment4.README.md b/Submissions/002709710_HemaSree_Bojanapally/Assignment4.README.md new file mode 100644 index 0000000..5d3c055 --- /dev/null +++ b/Submissions/002709710_HemaSree_Bojanapally/Assignment4.README.md @@ -0,0 +1,7 @@ +Name: Hema Sree Bojanapally +NUID: 002709710 +email: bojanapally.h@northeastern.edu + +##Assignment 4 + +This assignment's main goal was to provide with a solid understanding of P vs NP. diff --git a/Submissions/002709710_HemaSree_Bojanapally/Assignment4_002709710.ipynb b/Submissions/002709710_HemaSree_Bojanapally/Assignment4_002709710.ipynb new file mode 100644 index 0000000..e9f8c70 --- /dev/null +++ b/Submissions/002709710_HemaSree_Bojanapally/Assignment4_002709710.ipynb @@ -0,0 +1,451 @@ +{ + "nbformat": 4, + "nbformat_minor": 0, + "metadata": { + "colab": { + "provenance": [] + }, + "kernelspec": { + "name": "python3", + "display_name": "Python 3" + }, + "language_info": { + "name": "python" + } + }, + "cells": [ + { + "cell_type": "markdown", + "source": [ + "#### 1.\n", + "\n", + "**Problem Statement:**\n", + "\n", + "You are in charge of organizing a series of educational workshops. Each workshop will cover one of the n different topics, such as software development, robotics, digital marketing, project management, etc. You have received applications from m potential workshop leaders. For each of the\n", + "n topics, there is a subset of potential leaders who are experts in that area. The challenge is: For a given number k≤m), is it possible to select at most k leaders such that all n workshop topics are covered?\n", + "\n", + "**Input:**\n", + "\n", + "* An integer n, the number of workshop topics.\n", + "* An integer m, the number of potential workshop leaders.\n", + "* For each topic1≤i≤n), a list of potential leaders qualified for that topic.\n", + "* An integer k, the maximum number of leaders to be selected.\n", + "\n", + "**Output:**\n", + "* Return True if it's possible to select at most k leaders who can cover all workshop topics; otherwise, return False\n", + "\n", + "*Sample Input:*\n", + "\n", + "Number of topics: 3\n", + "\n", + "Number of leaders: 5\n", + "\n", + "Topics:\n", + "[1, 2]\n", + "[2, 3, 5]\n", + "[1, 4]\n", + "\n", + "Maximum leaders to select: 2\n", + "\n", + "*Sample Output:*\n", + "\n", + "True\n", + "\n", + "**Constraints:**\n", + "* The number of topics n and potential leaders m are positive integers.\n", + "* The list of leaders for each topic is a subset of the potential leaders.\n", + "* The maximum number of leaders k is less than or equal to m.\n", + "\n", + "** Solution and Justification:**\n", + "\n", + "The solution involves a backtracking algorithm that attempts to select a leader for each topic while not exceeding the maximum number of k leaders.\n", + "\n", + "*Pseudocode:*\n", + "\n", + "def optimal_workshop_leader_set(n, m, topics, k):\n", + " leaders_covered = set()\n", + " \n", + " def backtrack(topic_index, selected_leaders):\n", + " nonlocal leaders_covered\n", + " if len(selected_leaders) > k:\n", + " return False\n", + " if topic_index == n:\n", + " leaders_covered.update(selected_leaders)\n", + " return True\n", + " for leader in topics[topic_index]:\n", + " if leader not in selected_leaders:\n", + " if backtrack(topic_index + 1, selected_leaders + [leader]):\n", + " return True\n", + " return False\n", + " \n", + " result = backtrack(0, [])\n", + " return result and len(leaders_covered) <= k\n", + "\n", + "\n", + "*Proof of Correctness:*\n", + "\n", + "The algorithm is correct because it tries all possible combinations of leaders without exceeding the limit k. If it finds a combination where all topics are covered, it returns True. This is analogous to finding a cover for a set system, which confirms that if k subsets exist whose union covers all elements, then k leaders can cover all topics.\n", + "\n", + "\n", + "*Reflection:*\n", + "\n", + "ChatGPT played a critical role in understanding the structure of the NP-complete problem from the example and helped to craft a new but similar problem—Optimal Workshop Leader Set—adapting it to a workshop context. The challenge was to make sure the new problem was neither too simple nor unsolvable while preserving the computational complexity of the original problem. This exercise enhanced the understanding of NP-completeness and the application of algorithmic problem design across various real-world scenarios." + ], + "metadata": { + "id": "c_kj9Tio-yvD" + } + }, + { + "cell_type": "markdown", + "source": [ + "##### 2. \n", + "\n", + "Consider the following version of the Steiner Tree Problem, which we'll\n", + "refer to as Graphical Steiner Tree. You are given an undirected graph\n", + "G = (V, E), a set X _ V of vertices, and a number k. You want to decide\n", + "whether there is a set F ~ E of at most k edges so that in the graph (V, F),\n", + "X belongs to a single connected component.\n", + "Show that Graphical Steiner Tree is NP-complete.\n", + "\n", + "\n", + "**Problem Statement:**\n", + "\n", + "We need to determine whether, for a given undirected graph G=(V,E), a subset of vertices X⊆V, and an integer k, there exists a subset of edges F⊆E with at most k edges such that the subgraph (V,F) has X in a single connected component.\n", + "\n", + "**Input:**\n", + "* A list of vertices V.\n", + "* A list of edges E defining the connections between vertices in V.\n", + "* A subset of vertices X⊆V.\n", + "* An integer k.\n", + "\n", + "\n", + "**Output:**\n", + "* 'Yes' if such a subset F⊆E exists.\n", + "* 'No' otherwise.\n", + "\n", + "*Sample Input:*\n", + "\n", + "* V={1,2,3,4,5}\n", + "\n", + "* E={(1,2),(2,3),(3,4),(4,5),(1,5)}\n", + "\n", + "* X={1,3,5}\n", + "\n", + "* k=2\n", + "\n", + "*Sample Output:*\n", + "\n", + "'Yes' (since edges (2,3) can be chosen)\n", + "\n", + "**Constraints:**\n", + "* The graph G is undirected and does not contain self-loops or multiple edges between the same pair of vertices\n", + "* ∣X∣≤∣V∣ and X≠∅\n", + "\n", + "* k is a non-negative integer\n", + "\n", + "**Solution**\n", + "To show that the Graphical Steiner Tree problem is NP-complete, we will follow a two-step process: first proving that the problem is in NP, and then proving that it is NP-hard. If a problem is both in NP and NP-hard, it is considered NP-complete.\n", + "\n", + "*Proof for NP Completeness:*\n", + "\n", + "* A problem is in NP if a solution to the problem can be verified quickly (in polynomial time). For the Graphical Steiner Tree problem, given a set F of edges, we can verify whether F connects all vertices in X to form a single connected component in polynomial time by using a breadth-first search (BFS) or depth-first search (DFS). Thus, the Graphical Steiner Tree problem is in NP.\n", + "\n", + "*Proof of NP Hard:*\n", + "\n", + " * To prove NP-hardness, we need to show that any instance of a known NP-complete problem can be reduced to an instance of the Graphical Steiner Tree problem in polynomial time.\n", + "\n", + " * We will reduce from the classical Steiner Tree problem, which is known to be NP-complete. The classical Steiner Tree problem is: given a graph G'= (v',E') a subset of vertices R⊆V (the required vertices), and a number k' decide whether there is a subtree of G' that includes all vertices in R and has at most k' edges.\n", + "\n", + "*Here's the reduction:*\n", + "\n", + "* Reduction: Given an instance of the classical Steiner Tree problem (G′,R,k') we construct an instance of the Graphical Steiner Tree problem (G,X,k) by setting G=G′ X=R, and k=k'. This is a direct mapping where the subset of required vertices R in the classical problem becomes the set X in the Graphical Steiner Tree problem, and the bound on the number of edges k'remains the same.\n", + "\n", + "* Correctness of the reduction: If there is a solution to the classical Steiner Tree problem, then there is a subtree in G'that spans all vertices in R with at most k'edges. This subtree is also a connected component in G that connects all vertices in X with at most k edges, forming a valid solution to the Graphical Steiner Tree problem. Conversely, if there is a solution to the Graphical Steiner Tree problem, then the set F of edges forms a subtree in G'that connects all vertices in R, which is a valid solution to the classical Steiner Tree problem.\n", + "\n", + "**Justification and Proof of Correctness:**\n", + "\n", + "* The reduction from the classical Steiner Tree problem to the Graphical Steiner Tree problem is polynomial in time since we are simply renaming the components and not altering the structure or size of the problem. Moreover, because a solution to one problem is also a solution to the other, and vice versa, the reduction is valid. This demonstrates that the Graphical Steiner Tree problem is at least as hard as the classical Steiner Tree problem, which is NP-complete. Therefore, the Graphical Steiner Tree problem is NP-hard.\n", + "\n", + "Since we have shown that the Graphical Steiner Tree problem is both in NP and NP-hard, it follows that the Graphical Steiner Tree problem is NP-complete.\n", + "\n", + "*Reflection:*\n", + "\n", + "ChatGPT tool provided a framework for structuring the NP-completeness proof and assisted in formulating the problem statement and solution strategy.\n", + "\n", + "The main challenge was to ensure that the problem is neither trivial nor unsolvable. It was important to maintain the spirit of the classic Steiner Tree problem while presenting it in a new context.\n", + "\n", + "This problem reinforced the importance of clear problem statements and the intricacy of NP-completeness proofs. It also provided insight into how reductions work to prove the hardness of computational problems.\n", + "\n", + "\n" + ], + "metadata": { + "id": "re1tvh8rkw79" + } + }, + { + "cell_type": "markdown", + "source": [ + "##### 3.\n", + "Given sets W, X, Y, and Z, each of size n, and a collection C of ordered 4-tuples of the form (wi, xj, Yk, z~), do there exist n 4-tuples from C so that no two have an element in common? Prove that 4-Dimensional Matching is NP-complete.\n", + "\n", + "**Problem Statement:** \n", + " Given four sets W, X, Y, and Z, each containing n elements, and a collection C of ordered 4-tuples of the form (wi, xj, yk, zl), the task is to determine whether there exist n 4-tuples from C such that no two tuples share any element in common.\n", + "\n", + "**Input:**\n", + "* Four sets W, X, Y, Z each of size n.\n", + "* A collection C consisting of ordered 4-tuples (wi, xj, yk, zl).\n", + "\n", + "**Output:**\n", + "* \"YES\" if there exist n 4-tuples in C with no common elements; otherwise, \"NO\".\n", + "\n", + "\n", + "*Sample Input:*\n", + "W = {w1, w2, w3}\n", + "X = {x1, x2, x3}\n", + "Y = {y1, y2, y3}\n", + "Z = {z1, z2, z3}\n", + "C = {(w1, x1, y1, z1), (w2, x2, y2, z2), (w3, x3, y3, z3)}\n", + "\n", + "*Sample Output:*\n", + "\n", + "YES\n", + "\n", + "\n", + "**Constraints:**\n", + "* All sets W, X, Y, Z have the same number of elements, n.\n", + "\n", + "\n", + "**Solution and Justification:**\n", + "\n", + "* 4-Dimensional Matching is in NP:\n", + "\n", + " A solution to the problem can be verified in polynomial time. Given n 4-tuples, we can check that no element is repeated in polynomial time.\n", + "\n", + "* NP-hardness (Reduction from 3-Dimensional Matching):\n", + "\n", + " The 3-Dimensional Matching Problem is NP-complete. It involves sets U, V, and W, each with n elements, and a set S of triples (u, v, w), where the task is to find n triples in S with no common elements.\n", + "\n", + " To reduce 3-Dimensional Matching to 4-Dimensional Matching, add an additional set Z with n unique elements and extend each triple (u, v, w) to a 4-tuple (u, v, w, z) where z is a distinct element from Z.\n", + "\n", + " If we can solve the 4-Dimensional Matching, we can also solve the original 3-Dimensional Matching, thus proving that 4-Dimensional Matching is NP-hard.\n", + "\n", + "\n", + "**Pseudocode:**\n", + "\n", + " function is4DMatchingPossible(W, X, Y, Z, C):\n", + " # This function verifies whether a solution exists\n", + " for each combination of n 4-tuples in C:\n", + " if no element is repeated in the tuples:\n", + " return \"YES\"\n", + " return \"NO\"\n", + "\n", + "*Reflection:*\n", + "\n", + "ChatGPT was helpful in understanding the complexity and nuances of NP-completeness. It provided insights into the structure and requirements of proving NP-completeness, specifically in how to reduce a known NP-complete problem (3-Dimensional Matching) to the new problem (4-Dimensional Matching).\n", + "\n", + "The primary challenge was to ensure the 4-Dimensional Matching problem maintained the essence of NP-completeness while being a distinct and meaningful extension of the 3-Dimensional Matching problem.\n", + "This exercise demonstrated the way in which abstract theoretical concepts can be elaborated upon and adapted to more intricate situations, underscoring the imaginative nature of crafting problems within algorithm theory." + ], + "metadata": { + "id": "f9stZL8vuTJ9" + } + }, + { + "cell_type": "markdown", + "source": [ + "#### 4.\n", + "\n", + "You are managing a communication network, modeled by a directed graph G = (V, E). There are c users who are interested in malting use of this network. User i (for each i = t, 2 ..... c) issues a request to reserve a specific path Pi in G on which to transmit data.\n", + "You are interested in accepting as many of these path requests as\n", + "possible, subject to the following restriction: ff you accept both Pi and\n", + "then Pi and Pj cannot share any nodes. Thus, the Path Selection Problem asks: Given a directed graph G = (V, E), a set of requests P1, P2 ..... pc--each of which must be a path in G--and a number k, is it possible to select at least k of the paths so that no two of the selected paths share any nodes?\n", + "Prove that Path Selection is NP-complete.\n", + "\n", + "**Problem Statement:**\n", + "\n", + "Given a communication network modeled as a directed graph G = (V, E), where V represents the set of nodes (vertices) and E represents the set of directed edges, and there are c users, each of whom issues a request to reserve a specific path Pi in G for transmitting data, the goal is to accept as many of these path requests as possible. However, this acceptance is subject to the constraint that if you accept both Pi and Pj, then Pi and Pj cannot share any nodes in their respective paths. Prove that Path Selection is NP-complete.\n", + "\n", + "**Input:**\n", + "\n", + "* A directed graph G = (V, E), where V represents the set of nodes (vertices) and E represents the set of directed edges.\n", + "\n", + "* A set of path requests P1, P2, ..., Pc, where each Pi is a path in G represented as a sequence of nodes.\n", + "\n", + "* An integer k, representing the minimum number of paths to select.\n", + "\n", + "**Output:**\n", + "* A boolean value indicating whether it is possible to select at least k of the paths such that no two selected paths share any nodes.'\n", + "\n", + "*Sample Input:*\n", + "\n", + "Graph G:\n", + "V = {A, B, C, D, E}\n", + "E = {(A, B), (B, C), (C, D), (D, E)}\n", + "\n", + "Path Requests:\n", + "P1 = [A, B, C]\n", + "P2 = [B, C, D]\n", + "P3 = [C, D, E]\n", + "P4 = [A, B, D]\n", + "\n", + "k = 3\n", + "\n", + "*Sample Output:*\n", + "\n", + "True\n", + "\n", + "**Constraints:**\n", + "\n", + "* The number of nodes |V| and the number of edges |E| in the graph are bounded.\n", + "* The number of path requests c is bounded.\n", + "* 1 ≤ k ≤ c ≤ |V| (the minimum number of paths to select is at least 1, and the maximum is the number of nodes in the graph).\n", + "\n", + "**Solution and Justitification:**\n", + "\n", + "* The Path Selection Problem is a variation of the Set Cover Problem, which is known to be NP-complete. To prove that the Path Selection Problem is NP-complete, we can reduce an instance of the Set Cover Problem to an instance of the Path Selection Problem.\n", + "\n", + "* Given an instance of the Set Cover Problem with a universe U and a collection of sets S1, S2, ..., Sm, we can construct a directed graph G as follows:\n", + "\n", + " * Create a node in G for each element in U.\n", + " * For each set Si, create a directed path Pi in G that visits all the elements in Si.\n", + " * Set k = m (the number of sets in the Set Cover instance).\n", + "\n", + "* Now, if there exists a solution to the Set Cover Problem (i.e., a subset of sets that covers all elements in U), there also exists a solution to the Path Selection Problem, where we select the corresponding paths for the chosen sets. Conversely, if there exists a solution to the Path Selection Problem, it implies a solution to the Set Cover Problem.\n", + "\n", + "* Since the Set Cover Problem is known to be NP-complete, the reduction from Set Cover to Path Selection demonstrates that Path Selection is also NP-complete.\n", + "\n", + "\n", + "*Reflection:*\n", + "\n", + "ChatGPT assisted in understanding the problem and brainstorming ideas for formulating it.\n", + "\n", + "This problem design exercise reinforced the concept of reducing one NP-complete problem to another to establish NP-completeness. It also showed how graph-related problems can be used in various ways to illustrate the concept of NP-completeness." + ], + "metadata": { + "id": "rb0R_Z-wlujf" + } + }, + { + "cell_type": "markdown", + "source": [ + "##### 5. \n", + "Suppose you’re helping to organize a summer sports camp, and the\n", + "following problem comes up. The camp is supposed to have at least one counselor who’s skilled at each of the n sports covered by the camp\n", + "(baseball, volleyball, and so on). They have received job applications from\n", + "m potential counselors. For each of the n sports, there is some subset\n", + "of the m applicants qualified in that sport. The question is: For a given\n", + "number k < m, is it possible to hire at most k of the counselors and have\n", + "at least one counselor qualified in each of the n sports? We’ll call this the\n", + "Efficient Recruiting Problem.\n", + "Show that Efficient Recruiting is NP-complete\n", + "\n", + "\n", + "**Answer:**\n", + "\n", + "**Problem Statement:**\n", + "\n", + "Given a set of n sports required for a summer camp and m potential counselors, each with a subset of sports they are qualified to teach, the Efficient Recruiting Problem asks if it is possible to hire at most k counselors (where k < m) such that every sport has at least one qualified counselor. This decision problem aims to determine the feasibility of achieving full sports coverage with a limited number of counselor hires.\n", + "\n", + "**Input:**\n", + "\n", + "* The first line contains three integers: n (the number of sports), m (the number of potential counselors), and k (the maximum number of counselors that can be hired).\n", + "* The next n lines list the qualifications of the counselors for each sport. Each line begins with an integer q_i (the number of counselors qualified for sport i), followed by q_i space-separated integers representing the IDs of the qualified counselors.\n", + "\n", + "**Output:**\n", + "* A single line containing \"YES\" if it is possible to hire at most k counselors to cover all n sports, or \"NO\" if it is not possible.\n", + "\n", + "*Sample Input:*\n", + "\n", + "3 5 2\n", + "2 1 3\n", + "3 1 2 4\n", + "2 4 5\n", + "\n", + "*Sample Output:*\n", + "\n", + "YES\n", + "\n", + "**Constraints:**\n", + "\n", + "1. 1 ≤ n ≤ 50 (The number of sports)\n", + "2. n < m ≤ 100 (The number of potential counselors)\n", + "3. k < m (The cap on the number of hires)\n", + "\n", + "\n", + "**Solution and Justification:**\n", + "\n", + "* To show that the Efficient Recruiting Problem is NP-complete, we need to demonstrate both that it belongs to the class NP and that it is as hard as the hardest problems in NP (NP-hard).\n", + "\n", + "* A problem is in NP if a solution for the problem can be verified in polynomial time. For the Efficient Recruiting Problem, given a proposed solution (a set of k counselors), we can verify whether each of the n sports is covered by at least one of these counselors in polynomial time. This verification involves checking the qualifications of each selected counselor, a process that takes polynomial time relative to the number of counselors and sports.\n", + "\n", + "* To prove NP-hardness, we typically show that an existing NP-hard problem can be reduced to our problem in polynomial time. A standard approach is to use the Set Cover problem, which is a well-known NP-hard problem.\n", + "\n", + "A standard approach is to use the Set Cover problem, which is a well-known NP-hard problem.\n", + "\n", + "*Set Cover Problem:*\n", + "\n", + "1. Given a universe U of elements and a collection S of subsets of U, and an integer k, the goal is to determine if there exist k or fewer subsets in S whose union equals U.\n", + "\n", + "2. Reduction from Set Cover to Efficient Recruiting:\n", + "Mapping to Efficient Recruiting:\n", + "\n", + " * Each sport in the Efficient Recruiting Problem corresponds to an element in the universe U of the Set Cover problem.\n", + " * Each counselor corresponds to a subset in S, where the subset contains the sports that the counselor is qualified in.\n", + " * The task of hiring k counselors to cover all sports parallels choosing k subsets from S whose union covers U.\n", + "\n", + "3. Polynomial-time Reduction:\n", + "\n", + " * This mapping can be done in polynomial time. If you can solve the Efficient Recruiting Problem, you also solve the corresponding Set Cover problem.\n", + "\n", + "Pseudocode for Verification:\n", + "\n", + "Here's a pseudocode to verify a solution for the Efficient Recruiting Problem. This pseudocode assumes that we have a proposed solution set of k counselors and we want to check if all sports are covered.\n", + "\n", + " function isSolutionValid(counselors, sports, selectedCounselors):\n", + " coveredSports = set()\n", + "\n", + " for counselor in selectedCounselors:\n", + " for sport in counselors[counselor]:\n", + " coveredSports.add(sport)\n", + "\n", + " return len(coveredSports) == len(sports)\n", + "\n", + "\n", + "Since we can verify a solution to the Efficient Recruiting Problem in polynomial time, and it can simulate any instance of the Set Cover problem (which is NP-hard), the Efficient Recruiting Problem is NP-complete.\n", + "\n", + "\n", + "\n", + "\n", + "**Proof of Correctness:**\n", + "\n", + "* The pseudocode provided correctly verifies a solution to the Efficient Recruiting Problem by ensuring that all sports are covered by the selected set of counselors. This verification is a requisite for problems in NP, and the mapping from Set Cover ensures the NP-hardness. Therefore, the problem is NP-complete.\n", + "\n", + "\n", + "\n", + "\n", + "**Reflection: **\n", + "\n", + "ChatGPT was pivotal in elucidating the intricate concepts of NP, NP-hard, and NP-complete problems, providing essential understanding of their theoretical foundations and skillfully aiding in the transformation of these abstract concepts into the tangible framework of the Efficient Recruiting Problem.\n", + "\n", + "Creating this problem involved understanding the Set Cover problem and creatively mapping it onto a real-world scenario without losing the computational complexity. The challenge was ensuring that the problem maintained the spirit of NP-completeness while being understandable and relatable. This task highlighted the importance of clear problem definitions, the intricacy of reduction proofs, and the creativity involved in bridging abstract computational concepts with practical applications.\n", + "\n", + "\n", + "\n" + ], + "metadata": { + "id": "9cEVuujzVQMh" + } + }, + { + "cell_type": "markdown", + "source": [ + "**References: **\n", + "\n", + "[1] https://www.inf.ufpr.br/andre/textos-CI1238-INFO7056/Algorithm%20Design%20-%20Jon%20Kleinberg%2C%20%C3%89va%20Tardos.pdf" + ], + "metadata": { + "id": "0L5Z_BA0KQjP" + } + } + ] +} \ No newline at end of file