Version 23 - History - Algorithms - D-LAN - Redmine

Algorithms » History » Version 23

Greg Burri, 08/11/2009 11:46 PM

-Greg Burri
+h1. Algorithms
 Greg Burri
 h2. Word indexing
 Theses steps are valid independently of the language.
 # Each file is split in word, there is a set of characters which will use as delimiter between the word like [' ', '#', '.', '?',..].
 # Some character are replaced by a another, for example : ['é', 'ë',..] => 'e'. The goal is to remove all accent.
-Greg Burri
+h2. Searching
 Greg Burri
-Greg Burri
+For a functional description see here : [[Functional definition#The-search-window]]
 Greg Burri
-Greg Burri
+The match of a word can be partial from the beginning, for example _train_ will match _training_.
-Greg Burri
+!aybabtu_search.png!
-Greg Burri
+_This schema depicts how the results are sorted from one peer. Each peer result are then merged._
 Greg Burri
-Greg Burri
+h2. Peer ID
-Greg Burri
+Each peer owns a peer id which is unique and generated during the first start. This ID is used to identify a peer, it's better than the previous usage of peer IP, considering this situation :
 Greg Burri
 * _A_ put in queue a file entry _f_ from _B_, _B_ doesn't know the hashes of this file entry.
 * _B_ change his IP address.
 * _A_ want to download _f_, it can ask _B_ for the hashes even _B_'s IP changed.
-Greg Burri
+h2. Core threads
-Greg Burri
+There are three kind of threads in the core in addition to the main thread :
 * [[Protocol_core-core#Downloading-thread|Downloading thread]] : @DownloadManager::ChunkDownloader@
-Greg Burri
+* Uploading thread : @UploadManager::Uploader@
-Greg Burri
+* [[Algorithms#Updating-the-file-cache|Updating file cache thread]] : @FileManager::FileUpdater@
 Greg Burri
-Greg Burri
+h2. Updating the file cache
 Greg Burri
 Here is the algorithm for the thread (@FileManager::FileUpdater@) which will periodically update the file cache and persist it.
-Greg Burri
+There is a prototype for the watcher here : [[Watcher Prototype]]
-Greg Burri
+<pre>
-Greg Burri
+D : The set of shared directories
-Greg Burri
+T : Time during which the hashes are computed (for example 30s)
 F : A set containing file with unknown hashes, initially empty
 W : the directory watcher
 Greg Burri
 // First synchronize (at start)
 For each d in D (recursively) :
-Greg Burri
+   - Add d to W
-Greg Burri
+   - Synchronize physical folders and files with d content
-Greg Burri
+   - Add in F the files which don't have hashes
 Greg Burri
-Greg Burri
+Loop :
-Greg Burri
+   t : Time.now
-Greg Burri
+   For each f in F :
-Greg Burri
+      - Compute the unknown hash of files f
-Greg Burri
+      - Remove f from F
       If (Time.now - t) > T : break
-Greg Burri
+   - Wait for changes for a period of (if F is empty then INFINITE else 0)
       - When a modification occurs synchronize the file/folder
-Greg Burri
+      - Add each new file in F
-Greg Burri
+   - Persist the entire cache in a file (only every ~30min)
-Greg Burri
+</pre>
 Greg Burri
-Greg Burri
+h2. Downloading
 See here : [[Protocol_core-core#Downloading-threads]]

Project

General

Profile

D-LAN

Algorithms » History » Version 23