.. (לתיקייה המכילה) | ||
How do we compute Pr(t|Mc)? | |
Consider the collection as a one large document, which is a concatenation of all documents in the collection. Thus Pr(t|Mc) = tf_collection(t) / collection_length tf_collection(t) - is a sum over tf(t) for all documents in the collection, i.e. the total number of occurrences of t in the collection. collection_length - is a sum over all document lengths in the collection, i.e. the total number of terms occurrences in the collection. |
B-Tree | |
Please follow the B-Tree description as in the 'Introduction to Algorithms' book. Note that other resources on the WEB may contain a slightly different version. |