Word index prototype » History » Version 5
Greg Burri, 09/10/2009 10:56 AM
1 | 1 | Greg Burri | h1. Word index prototype |
---|---|---|---|
2 | |||
3 | 2 | Greg Burri | see here : [[Algorithms#Word-indexing]] |
4 | 3 | Greg Burri | |
5 | h2. Measure |
||
6 | |||
7 | 5 | Greg Burri | Each file and folder is split in words and indexed by these words. |
8 | |||
9 | 3 | Greg Burri | h3. Case 1 |
10 | |||
11 | * 8'531 files/folders (some mp3) |
||
12 | * Time to index : ~1s (8'000 item/s) |
||
13 | * Average size per indexed item : 464 bytes. |
||
14 | * Total size in memory : ~3.7 MB |
||
15 | * Speed to do a search : < 1 ms |
||
16 | |||
17 | h3. Case 2 |
||
18 | 1 | Greg Burri | |
19 | 5 | Greg Burri | * 309'269 files/folders (various files) |
20 | 3 | Greg Burri | * Time to index : ~30s (10'000 item/s) |
21 | * Average size per indexed item : 140 bytes. (The filenames from case 1 are surely longer and thus own more words). |
||
22 | * Total size in memory : ~42 MB |
||
23 | * Speed to do a search : < 1 ms |
||
24 | |||
25 | |||
26 | h2. Conclusion |
||
27 | |||
28 | 4 | Greg Burri | This algorithm is very time effective for searching or indexing but takes a lot of memory. For the moment it will be used unchanged but some space optimization may be done for the future. |