In Solution 1: Checking Off of Problem Solving with Algorithms and Data Structures, just beneath the ActiveCode: 1 extract (included at the bottom of this post for reference), it is stated:
each of the n characters in s1 will cause an iteration through up to n characters in the list from s2.
...which to paraphrase (less succinctly), means "each character in s1 could result in all characters in s2 being inspected".
The following sentence states:
Each of the n positions in the list will be visited once to match a character from s1.
Firstly I'm unsure which list is being referred, I think it means the variable alist, i.e. s2. Is this correct?
The number of visits then becomes the sum of the integers from 1 to n.
Secondly (and the real point of this post), if each item in the list (variable alist) is inspected for each character in s1, then why is this not simply O(n) = n^2 as per "nested loops"? If n=5, then then sum(n) = 15, where as n^2 = 25.
I'm fine with simplifying the equation for summing 1 to n integers to get n^2, but I'm confused about how sum(5) = 15, whereas O(n) = n^2.
ActiveCode: 1 Checking Off (active5)
def anagramSolution1(s1,s2): alist = list(s2) pos1 = 0 stillOK = True while pos1 < len(s1) and stillOK: pos2 = 0 found = False while pos2 < len(alist) and not found: if s1[pos1] == alist[pos2]: found = True else: pos2 = pos2 + 1 if found: alist[pos2] = None else: stillOK = False pos1 = pos1 + 1 return stillOK print(anagramSolution1('abcd','dcba') Edit to add
Having written this out it struck me that perhaps dropping the (1/2) and (1/2)n from the formula to sum(n) is the reason for the difference of actually summing n, as opposed to just doing n^2.
To be absolutely clear, I understand the formula for summing n is n^2 when dropping the non-dominant terms. What I don't understand is why we're summing n and not immediately and directly stating it is actually n^2 as a result of the nested loop and this sentence:
each of the n characters in s1 will cause an iteration through up to n characters in the list from s2.
Asked By : Jack
Answered By : unutbu
Let's be concrete. Suppose s1 = 'abcd' and s2 = 'dcba'.
For the a in s1, it will take 4 iterations to find the a in s2. For the b in s1, it will take 3 iterations to find the b in s2. For the c in s1, it will take 2 iterations to find the c in s2. For the d in s1, it will take 1 iteration to find the d in s2.
So in this case, it does take sum(i) for i = 1,...,n iterations.
Now some quiet meditation should convince you than if s2 is an anagram of s1, it will again take exactly the same number of iterations; the summands will just appear in a different order. (After all, whatever is the first character in s2 will only require 1 iteration to be found, the second character requires 2 iterations, etc.)
If s2 is not an anagram of s1, then the algorithm will short-circuit when the first character not in s2 is tested. So then the algorithm will take less than O(n^2) time. But in the worst case, the algorithm takes O(n^2) time.
Best Answer from StackOverflow
Question Source : http://cs.stackexchange.com/questions/33618
0 comments:
Post a Comment
Let us know your responses and feedback