Coder Social home page Coder Social logo

Size limitations about ursadb HOT 2 CLOSED

cert-polska avatar cert-polska commented on May 14, 2024
Size limitations

from ursadb.

Comments (2)

icedevml avatar icedevml commented on May 14, 2024

Hello,

so far we've got information about 2 TB of data (in total) succesfully indexed with mquery, while still keeping reasonable query time of 3-5 seconds.

When indexing a single file, the size shouldn't matter, because it will never have more than 2^24 of unique trigrams.

I don't think if we've got any more usage data so far.

from ursadb.

msm-code avatar msm-code commented on May 14, 2024

Just to expand on this before closing the issue: right now we have much more than 2TB indexed, and query time is still around 5 seconds.

As icedevml has said, there are no limitations for a single file size - but the bigger are your files the less sense it makes for mquery (due to the way it works - at some point it will contain every possible trigram). Hard to say when, but indexing 4GB files is definitely counter-productive, while 10MB files are definitely handled well.

from ursadb.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.