Hi all, I would like to index the mouse or human genome with an ESA. I need to do this more than once though and would like to store the ESA on disk as it takes some hours to construct. Is this feasible? Is there any way to do this in SeqAn already? Also parallel construction is interesting to me. To quote wikipedia (http://en.wikipedia.org/wiki/Suffix_tree#External_construction): ERA is a recent parallel suffix tree construction method that is significantly faster. ERA can index the entire human genome in 19 minutes on an 8-core desktop computer with 16GB RAM. On a simple Linux cluster with 16 nodes (4GB RAM per node), ERA can index the entire human genome in less than 9 minutesAre there any plans to incorporate the ERA algorithm (http://www.vldb.org/pvldb/vol5/p049_essammansour_vldb2012.pdf) into SeqAn? Thanks, John. |