Coder Social home page Coder Social logo

Comments (4)

 avatar commented on May 2, 2024 1

This is actually a really interesting problem. Compression often is a bottleneck, but sometimes it's the network in the end.

I think the ideal solution is dynamic compression, most likely with lzo since it's normally not CPU bound. You can vary the rate by adjusting compression until the input read speed is highest (since input read rates effectively slow down as output gets slowed down). That might be no compression or max compression.

I haven't seen any utilities that do this, but it should be... not easy but relatively straight forward. And should work without caveats.

from wdt.

nikunjy avatar nikunjy commented on May 2, 2024

Hey, compression is on the things we want to do. We have an ongoing effort (but not high pri yet), but you will see it soon in WDT. In the time being:

  1. We use gzip in wcp (https://github.com/facebook/wdt/blob/master/wcp.sh#L87)
  2. In WDT we have done some testing using folly compression https://github.com/facebook/folly/blob/master/folly/io/Compression.h#L56
    Look at Snappy and LZ4_VARIANT

Please feel free to share any experience you have had with respect to compression and tell us more about what are you using WDT with.

from wdt.

junhuihuang avatar junhuihuang commented on May 2, 2024

Thanks for your reply. I am testing WDT for transferring log file between two machines, use disk, but the speed is not so good, about 18Mbytes/sec. If adding compression and write data using pwrite function, i think the throughput could be higher.

from wdt.

uddipta avatar uddipta commented on May 2, 2024

Try with -block_size_mbytes=-1 and -num_ports=1.
In case of disk, just one writer thread generally gives best throughput. We have multiple threads to optimize for flash.

Also you can use -enable_perf_stat_collection to find out which system calls are taking more time.

Let us know if that improves throughput or not.

from wdt.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.