- To proof that
A &sube B, we
can show that:
- In this case, it is easier to
show the reverse:
- A &sube B
⇔
∀ x ∉ B:
x ∉ A
See:
|
- Let a ¬in K
be an arbitrary element that was
not included
in K
at the end of the algorithm
Let fx(a)
be the (actual) number of occurence
of a
in the input stream
(fx(a) > 0,
otherwise the proof is
trivial...)
- Therefore,
a was
delete by the
delete phase
The delete phase will
decrement the
count of
each element
in the current set K
when the |K| > 1/θ
- Therefore,
each occurence of
a was
deleted together with 1/θ -1
other symbols
So the total number of
symbols deleted due to the
removal of a is:
- fx(a)
×
(1
(a)
+ (1/θ - 1)
(the other symbols))
- Or:
fx(a)
×
1/θ
|
- There are at most N
symbols removed, therefore:
- fx(a)
×
1/θ
<
N
- Or:
fx(a)
<
θN
|
- Since
fx(a)
<
θN,
it follows that:
- In other words:
- Therefore: I(x,θ)
&sube K
|