FU Logo
  • Startseite
  • Kontakt
  • Impressum
  • Home
  • Listenauswahl
  • Anleitungen

Re: [Seqan-dev] Client-server lambda?

<-- thread -->
<-- date -->
  • From: Jose Manuel Duarte <jose.duarte@psi.ch>
  • To: Hannes Hauswedell <hannes.hauswedell@fu-berlin.de>
  • Date: Mon, 11 May 2015 15:11:29 +0200
  • Cc: SeqAn Development <seqan-dev@lists.fu-berlin.de>
  • Reply-to: SeqAn Development <seqan-dev@lists.fu-berlin.de>
  • Subject: Re: [Seqan-dev] Client-server lambda?

Have you tried storing the database file (including lambda's files) in a
shared memory filesystem, e.g. /dev/shm ? If you do this all data will already
be in main memory when the program is started -- however it will still need to
be copied around, so of course its not optimal. Also during program run-time
the sequences will both be in the program's allocated memory and in the shm,
so they will effectively use double the space. But it might still be
worthwhile for you, I can't say without knowing the exact use-case and
hardware available.

I've tried /dev/shm already but it didn't make a difference.

Here are the runtimes from 3 consecutive runs (269 sequences in one file against a database that takes 7.7GB in plain text fasta file).

Reading from disk:

real    1m49.591s
real    1m49.259s
real    1m49.282s

Reading from /dev/shm:

real    1m49.480s
real    1m49.290s
real    1m49.007s


As you say the data still needs to be copied around and that is most likely where most of that time is spent (steps "Loading Subj Sequences" and "Loading Subj Ids" seem to be the slow ones).

My guess is that there's also a lot of disk buffering happening when it's read from disk (the system I'm running has 128GB of memory and not so loaded at the moment, so I'm sure it has enough memory to keep all the files in the buffer cache), that's my explanation as to why there's not much difference between the disk and the /dev/shm runs.

Cheers

Jose



<-- thread -->
<-- date -->
  • References:
    • [Seqan-dev] Client-server lambda?
      • From: Jose Manuel Duarte <jose.duarte@psi.ch>
    • Re: [Seqan-dev] Client-server lambda?
      • From: Hannes Hauswedell <hannes.hauswedell@fu-berlin.de>
  • seqan-dev - May 2015 - Archives indexes sorted by:
    [ thread ] [ subject ] [ author ] [ date ]
  • Complete archive of the seqan-dev mailing list
  • More info on this list...

Hilfe

  • FAQ
  • Dienstbeschreibung
  • ZEDAT Beratung
  • postmaster@lists.fu-berlin.de

Service-Navigation

  • Startseite
  • Listenauswahl

Einrichtung Mailingliste

  • ZEDAT-Portal
  • Mailinglisten Portal