Project

General

Profile

Word index prototype » History » Version 4

Greg Burri, 09/10/2009 10:54 AM

1 1 Greg Burri
h1. Word index prototype
2
3 2 Greg Burri
see here : [[Algorithms#Word-indexing]]
4 3 Greg Burri
5
h2. Measure
6
7
h3. Case 1
8
9
* 8'531 files/folders (some mp3)
10
* Time to index : ~1s (8'000 item/s)
11
* Average size per indexed item : 464 bytes.
12
* Total size in memory : ~3.7 MB
13
* Speed to do a search : < 1 ms
14
15
h3. Case 2
16
17
* 309'269 files/folders
18
* Time to index : ~30s (10'000 item/s)
19
* Average size per indexed item : 140 bytes. (The filenames from case 1 are surely longer and thus own more words).
20
* Total size in memory : ~42 MB
21
* Speed to do a search : < 1 ms
22
23
24
h2. Conclusion
25
26 4 Greg Burri
This algorithm is very time effective for searching or indexing but takes a lot of memory. For the moment it will be used unchanged but some space optimization may be done for the future.