[tahoe-lafs-trac-stream] [tahoe-lafs] #1523: MDMF upload via web-API uses much more memory in the gateway process than expected (was: MDMF upload via web-API uses much more memory in the gateway process than updating the same file via SFTP)

tahoe-lafs trac at tahoe-lafs.org
Fri Sep 2 11:50:57 PDT 2011


#1523: MDMF upload via web-API uses much more memory in the gateway process than
expected
--------------------------------+------------------------------------------
     Reporter:  davidsarah      |      Owner:  davidsarah
         Type:  defect          |     Status:  assigned
     Priority:  major           |  Milestone:  undecided
    Component:  code-frontend-  |    Version:  1.9.0a1
  web                           |   Keywords:  memory tahoe-put performance
   Resolution:                  |
Launchpad Bug:                  |
--------------------------------+------------------------------------------
Description changed by davidsarah:

Old description:

> Split from #113:
>
> The web-API interface does not support streaming (#113, #320), so it is
> expected for the gateway to need to hold the whole file in memory in
> order to upload it. However, when using {{{tahoe put}}} to upload an MDMF
> file, the increase in memory usage of the gateway process seems to be
> more than double the file size. For example, when uploading a 191 MiB
> MDMF file in 1.9alpha using {{{tahoe put --mutable --mutable-
> type=mdmf}}}, the peak RSS of the gateway (which was also a storage
> server in this test) was about 510 MiB greater than when updating the
> same file using SFTP (which streams to, and then from, a temporary file).

New description:

 Split from #113:

 The web-API interface does not support streaming (#113, #320), so it is
 expected for the gateway to need to hold the whole file in memory in order
 to upload it. However, when using {{{tahoe put}}} to upload an MDMF file,
 the increase in memory usage of the gateway process seems to be more than
 double the file size. For example, when uploading a 191 MiB MDMF file in
 1.9alpha using {{{tahoe put --mutable --mutable-type=mdmf}}}, the peak RSS
 of the gateway (which was also a storage server in this test) was over
 1300 MiB. There is also a huge memory leak of more than 700 MiB after the
 upload has finished.

 I originally thought that the memory usage was larger when using the web-
 API than when updating the same file using SFTP, but apparently that was
 wrong (I may have been misled by at first doing the SFTP experiment
 without restarting the nodes).

--

-- 
Ticket URL: <http://tahoe-lafs.org/trac/tahoe-lafs/ticket/1523#comment:7>
tahoe-lafs <http://tahoe-lafs.org>
secure decentralized storage


More information about the tahoe-lafs-trac-stream mailing list