Coder Social home page Coder Social logo

Comments (17)

cldellow avatar cldellow commented on June 24, 2024

Maybe Northwinds DB?

It's under MSPL: https://web.archive.org/web/20170623074454/https://northwinddatabase.codeplex.com/license

Someone has a SQLite port of it: https://github.com/jpwhite3/northwind-SQLite3/issues

from datasette-ui-extras.

cldellow avatar cldellow commented on June 24, 2024

also chinook, a weird mix of ERP and ... music? https://www.sqlitetutorial.net/sqlite-sample-database/

from datasette-ui-extras.

cldellow avatar cldellow commented on June 24, 2024

https://openparliament.ca/data-download/ is a PG dump. We could maybe convert it, but it's 5gb uncompressed, so probably not a great choice for a demo.

OTOH, it'd have lots of "real" data...

from datasette-ui-extras.

cldellow avatar cldellow commented on June 24, 2024

almost certainly a bad idea: could we pull extracts of wikipedia, eg using https://github.com/spencermountain/wtf_wikipedia ?

Could we instead write some SPARQL queries against wikidata? SPARQL looks like it has a vertical learning curve...

from datasette-ui-extras.

cldellow avatar cldellow commented on June 24, 2024

cia factbook: https://github.com/factbook/factbook.sql

from datasette-ui-extras.

cldellow avatar cldellow commented on June 24, 2024

cars: https://github.com/abhionlyone/us-car-models-data/blob/master/1992.csv

very thin data, but many rows, and bonus json tag array

from datasette-ui-extras.

cldellow avatar cldellow commented on June 24, 2024

maybe check r/datasets ?

from datasette-ui-extras.

cldellow avatar cldellow commented on June 24, 2024

Sakila: https://lite.datasette.io/?url=https%3A%2F%2Fraw.githubusercontent.com%2Fbradleygrant%2Fsakila-sqlite3%2Fmain%2Fsakila_master.db%3Fraw%3Dtrue

from datasette-ui-extras.

cldellow avatar cldellow commented on June 24, 2024

USDA food: https://github.com/alyssaq/usda-sqlite

from datasette-ui-extras.

cldellow avatar cldellow commented on June 24, 2024

Early dumps of stack exchange sites: https://tejp.de/files/so/dbdump/

from datasette-ui-extras.

cldellow avatar cldellow commented on June 24, 2024

I think the early dumps of SE sites is an interesting way to go, it has:

  • dates (vote creation table)
  • timestamps
  • images (user gravatar)
  • json arrays (question tags)
  • lots of fkeys
  • an opportunity to make views, to split the posts table into questions/answers
  • full text search (titles/bodies)
  • prefix search (user names)
  • an OK number of rows -- ~16K in the post table

My chief complaint is maybe that it's too small to be a really good test. I think I'd prefer like 50K rows in its biggest table. Ah well, let's see what happens.

from datasette-ui-extras.

cldellow avatar cldellow commented on June 24, 2024

Yes, let's go with Stack Exchange. See https://github.com/cldellow/stackexchange-to-sqlite

from datasette-ui-extras.

cldellow avatar cldellow commented on June 24, 2024

https://dux-demo.fly.dev/

from datasette-ui-extras.

cldellow avatar cldellow commented on June 24, 2024

https://github.com/cldellow/dux-demo, I haven't tried to automate things yet. Dunno if I'll stick with fly, we'll see.

from datasette-ui-extras.

cldellow avatar cldellow commented on June 24, 2024

This does make me want another, bigger database. The cooking DB is pretty zippy, something 10x as big would be a nice proof that things scale

from datasette-ui-extras.

cldellow avatar cldellow commented on June 24, 2024

superuser is ~14x the size... that's maybe a bit too much? Still need room for an FTS index, eg.

OTOH, "annoying" is probably the right qualitative test for a big database.

from datasette-ui-extras.

cldellow avatar cldellow commented on June 24, 2024

Plus, if I eventually script this into a GH action, I'll stop noticing.

from datasette-ui-extras.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.