Word index prototype » History » Version 4
Greg Burri, 09/10/2009 10:54 AM
1 | 1 | Greg Burri | h1. Word index prototype |
---|---|---|---|
2 | |||
3 | 2 | Greg Burri | see here : [[Algorithms#Word-indexing]] |
4 | 3 | Greg Burri | |
5 | h2. Measure |
||
6 | |||
7 | h3. Case 1 |
||
8 | |||
9 | * 8'531 files/folders (some mp3) |
||
10 | * Time to index : ~1s (8'000 item/s) |
||
11 | * Average size per indexed item : 464 bytes. |
||
12 | * Total size in memory : ~3.7 MB |
||
13 | * Speed to do a search : < 1 ms |
||
14 | |||
15 | h3. Case 2 |
||
16 | |||
17 | * 309'269 files/folders |
||
18 | * Time to index : ~30s (10'000 item/s) |
||
19 | * Average size per indexed item : 140 bytes. (The filenames from case 1 are surely longer and thus own more words). |
||
20 | * Total size in memory : ~42 MB |
||
21 | * Speed to do a search : < 1 ms |
||
22 | |||
23 | |||
24 | h2. Conclusion |
||
25 | |||
26 | 4 | Greg Burri | This algorithm is very time effective for searching or indexing but takes a lot of memory. For the moment it will be used unchanged but some space optimization may be done for the future. |