Skip to main content

CS502 Current Midterm Paper Fall 2013 File 10



by innocent larki on December 25, 2013 at 7:12pm
How we build heap
We build a max heap out of the given array of numbers A[1..n]. We repeatedly extract the maximum item from the heap. Once the max item is removed, we are left with a hole at the root. 
Write Pseudo code for KNAPSACK algorithm?
Solution:
KNAPSACK (n, W)
1 for w=0,W
2 do V[0,w]ß0
3 for i=0,n
4 do V[i,0]ß0
5   for w=0,W
6   do if(wi w&  +V[i-1,w- ]>V[i-1,w])
7                      then V[I,w]ß +V[i-1,w] 8                      else V[i,w]ßV[i-1,w]
The time complexity is clearly O(n,W), it must be cautioned that as n and W get large, both time and space complexity become significant.
Spelling correction in edit distance?
A better way to display this editing process is to place the words above the other:
S  D  I  M  D  M
M A  -   T  H  S
A  -   R  T  -  S
The first word has agap for every insertion (1) and the second word has a gap for every deletion (d). Mathes (m) do not count. The edit transcript is defined as a string over the alphabet m,s,i,d that describes a transformation of one string into other.
Differentiate b/w Bubble sort, insertion sort and selection sort?
Bubble sort: scan the array. Whenever two consecutive items are found that are out of order, swap them. Repeat until all consecutive items are in order.
Insertion sort: assume that A[1…i-1] have already been sorted. Insert A[i] into its proper position in this sub array. Create this position by shifting all larger elements to the right.
Selection sort:
Assume that A[1..i-1] contain the i-1 smallest elements in sorted order. Find the smallest in A[i….n] swap it with A[i].
Write down the steps of dynamic programming strategy?
Developing a dynamic programming algorithm generally involves two separate steps:
1_formulate problem recursively.
                                     Write down a formula for the whole problem as a simple combination of answers to smaller sub problems.
2_ Build solution to recurrence from bottom up: 
                                                                    Write an algorithm that starts with base cases and works its way up to the final solution.
What are the applications of edit distance technique? Name any three 
Solution:
Spelling Correction
Plagiarism Detection
Computational Molecular Biology
Solve: T(n) =  (T(q − 1) + T(2 − q) + 2)
What is the worst case running time for the bucket sort? What simple change is required in the algorithm to preserve its linear expected running time and makes it worst case time Θ(n log n)  
Solution:
The worst case for bucket sort occurs when the all inputs falls into single bucket,
for example. Since we use insertion sort for sorting buckets and insertion sort has a worst case of O(n2). the worst case runtime for bucket sort is O(n2).
By using an algorithm with worst case runtime of O(nlgn) instead of insertion sort for sorting buckets, we can ensure that worst case is O(nlgn) without affecting the average case behavior.
To see that the worst case is still O(nlgn), lets consider a case where n data are distributed among two buckets, a elements in one bucket and na in the other. Since we use O(nlgn sorting algorithm in each bucket, the run time for each sort is, kalg(a) + cand k(n  a)lg(n  a) + c2, where k; c2 are positive constants. The total run time iskalg(a)+k(na)lg(na)+2c2 This quantity attains its maximum value when a = 0 or a = n and the maximum value is knlgn + 2c2. Thus total run time is still O(nlgn). It is clear from this that maximum running cost occurs when data are in single bucket instead of spread in two buckets. Extending this argument, we can see that worst case
for the hash table occurs when all inputs hash into the same bucket. (We also note that the expressions obtained are basically convex combinations of nlgn which is a convex function and then Jensen’s rule can be applied to arrive at the same conclusion).
Given an unordered list of n x0, x1, x2, …, xn and elements is common, if there are atleast n/5 copies of it.We want to identify all the common numbers in our list. Give O(n log n) to solve the problem. 
Solution:
We define R n to be the set of ordered n-tuples of real numbers,
Rn := {(x1, x2 . . . xn) : x1, . . . , xn R}.The elements of Rn are called vectors. Given a vector x = (x1, . . . , xn), the numbers x1, . . . , xn are called the components of x.
You are already quite familiar with Rnfor small values of n. 
Find the out cost A1=5 x 4 A2= 4 x 6  A3= 6×2
Solution:-
For Instance
A1.       =          5          x          4
A2        =          4          x          6
A3        =          6          X          2
A4 =          2          x          7
Hence
A1 x (A2 A3) XA4 = ((5 X 4 X 2 ) + (4 X 6  2)) + 2 X 7 X 5
= 40     +          48        + 70
=  88    +          70
= 158
HERE OPTIMAL SEQUENCE IS A1 (A2 A3) A4. For this cost 158 which is optimal the optimal sequence is a1x (a2 xa3) xa4
How to construct an optimal solution for o/1 knapsack problem ?
Construct an optimal solution from computed information. Let us try this: If items are labelled 1, 2, . . . , n, then a subproblem would be to find an optimal solution for S k = items labelled 1, 2, . . . , k This is a valid subproblem definition. The question is: can we describe the final solution S n in terms of
subproblems S k ? Unfortunately, we cannot do that. Here is why. Consider the optimal solution if we can choose items 1 through 4 only.
Items chosen are 1, 2, 3, 4
Total weight: 2 + 3 + 4 + 5 = 14
Total value: 3 + 4 + 5 + 8 = 20
Now consider the optimal solution when items 1 through 5 are available
Effect of calling max heap
The smallest key is in the root in a min heap; in the max heap, the largest is in the root.
What is heap and what is heap order?
The heap is the section of computer memory where all the variables created or initialized at runtime are stored.  The heap order property: in a
(min) heap, for every node X, the key in the parent is smaller than or equal to the key in X.
 Quick sort such that sort the array in to non-increasing order?
Quick sorting, an array A[1..n] of n  numbers We are to reorder these elements into increasing (or decreasing) order.  More generally, A is an array of objects and we sort them based on one of the attributes – the key value. The key value need not be a number. It can be any object from a totally ordered domain. Totally ordered domain means that for any two elements of the domain, x and y, either x < y, x = y or x > y.
Draw the cost table for chain multiplication problem with initial states
(A1)(A2A3A4 . . .An)
or (A1A2)(A3A4 . . .An)
or (A1A2A3)(A4 . . .An)
. . . . . .
or (A1A2A3A4 . . .An−1)(An)
 QNo.4  we can avoid unnecessary repetitions for recursive calls?
Answer:-
We can avoid these unnecessary repetitions by writing down the results of recursive calls and looking them up again if we need them later. This process is called memorization
 Worst case for edit distance algorithm? What is the simple change that can change the worst case time?
Analysis of DP edits distance
There are  entries in the matrix. Each entry E(i,j) takes  time to compute. The total running is  Recursion clearly leads to the same repetitive call pattern that we saw in Fibonnaci sequence. To avoid this, we will  use the DP approach. We will build the solutionb bottom-up. We will use the base case E(0,j) to fill first row and the base case E(I,0) to fill first column. We will fill the remaining E matrix row by row.
 Describe an efficient algorithm to find the median of a set of 106 integers; it is known that there are fewer than 100 distinct integers in the set
Step1: Start
Step2: Find the 100 distinct numbers among 10^6 integers.
Step3: Sort the 100 distinct numbers
Step4: Count the distinct numbers
Step5: if count is odd, middle number is the median
Step6: if count is even, add the middle two numbers then divide by 2, the result is the median
Step5: End number.
 What is the formula for generating Catalan numbers?
Equation (22) is a recurrence relation.
C_(n+1) = C_n * [2(2n+1)] / (n+2)
we have the values of n in one column and the values of C_n in another, then to put this formula in Excel, on the (n+1)-th row just replace C_n and n with the appropriate cells from the previous row.
 What are Catalan numbers? Give the formula.
 Catalan numbers form a sequence of natural numbers that occur in various counting, often involving recursively defined objects
Formula is C(n) = 2n Cn / (n+1)
 Write a pseudo code Fibonacci With memorization? —
Sol
MEMOFIB(n)
1 if (n < 2)
2 then return n
3 if (F[n] is undefined)
4 then F[n] MEMOFIB(n − 1) + MEMOFIB(n − 2)
5 return F[n] 
Write Down the steps of Dynamic programming
Dynamic programming is essentially recursion without repetition. Developing a dynamic programming
algorithm generally involves two separate steps:
·         Formulate problem recursively. Write down a formula for the whole problem as a simple
combination of answers to smaller sub problems.
·         Build solution to recurrence from bottom up. Write an algorithm that starts with base cases and works its way up to the final solution.
Dynamic programming algorithms need to store the results of intermediate sub problems. This is often but not always done with some kind of table. We will now cover a number of examples of problems in which the solution is based on dynamic programming strategy.

Comments

Popular posts from this blog

CS614 Quiz No.4 Shared by Princess (solved), Spring 2014

  “What means What”. The phrase refers to: Select correct option:  Meta data  External data Transformed data Internal representations Question # 2 of 10 Which of the following is NOT one of the activities of “Maintenance and Growth” phase in Kimball’s DWH development approach? Select correct option: Education Technical Education Program Support  Interface Deployment                 Question # 3 of 10 Horizontally wide data means: Select correct option: Dataset has large no. of attributes Dataset has large no. of records Dataset has attribute skews Dataset has partitioning skews                 Question # 4 of 10 Which of the following is NOT one of the top-10 mistakes that should be avoided during DWH development? Select correct option: Not interacting directly with end ...

CS614 Quiz No.4 Shared by MT Khan (Solved)

Question # 1 of 10 ( Start time: 09:04:39 PM ) Total Marks: 1 A typical cycle of implementing the change in DWH comprises of the sequence: Select correct option: Production -> QA -> Development Development-> QA -> Production(CORRECT) Development -> Production -> QA Production -> Development -> QA Question # 2 of 10 ( Start time: 09:05:16 PM ) Total Marks: 1 Vertically wide data means: Select correct option: Dataset has large no. of attributes Dataset has large no. of records(CORRECT) Dataset has attribute skews Dataset has partitioning skews Question # 3 of 10 ( Start time: 09:05:43 PM ) Total Marks: 1 In ___________ phase of kimballs approach, we identify the components needed now and in future. Select correct option: Requirement definition Architectural design Product development Analytical application development Question # 4 of 10 ( Start time: 09:06:56 PM ) Total Marks: 1 Technical architecture design supports the communicat...

CS614 Quiz No.3 Shared by Students (Solved), Spring 2014

______ index stores first value in each block in the sequential file and a pointer to the block.  Select correct option:   Dense  Sparse  B-Tree  Hash In context of data parallelism, the work done by query processor should be:  Select correct option:  Almost zero  Maximum  Pipelined  Filtered across partitions The optimizer uses a hash join to join two tables if they are joined using an equijoin and  Select correct option:   Outer table has less number of rows  Inner table has less number of rows  Cardinality of tables is equal  Large amount of data needs to be joined Bitmap index is appropriate for:  Select correct option:  Low cardinality data  High cardinality data  Clustered data  Aggregated data If a task takes “T” time units to execute on a single data item, then execution of this task on “N” data items will take __...