Comments (17)
Maybe Northwinds DB?
It's under MSPL: https://web.archive.org/web/20170623074454/https://northwinddatabase.codeplex.com/license
Someone has a SQLite port of it: https://github.com/jpwhite3/northwind-SQLite3/issues
from datasette-ui-extras.
also chinook, a weird mix of ERP and ... music? https://www.sqlitetutorial.net/sqlite-sample-database/
from datasette-ui-extras.
https://openparliament.ca/data-download/ is a PG dump. We could maybe convert it, but it's 5gb uncompressed, so probably not a great choice for a demo.
OTOH, it'd have lots of "real" data...
from datasette-ui-extras.
almost certainly a bad idea: could we pull extracts of wikipedia, eg using https://github.com/spencermountain/wtf_wikipedia ?
Could we instead write some SPARQL queries against wikidata? SPARQL looks like it has a vertical learning curve...
from datasette-ui-extras.
cia factbook: https://github.com/factbook/factbook.sql
from datasette-ui-extras.
cars: https://github.com/abhionlyone/us-car-models-data/blob/master/1992.csv
very thin data, but many rows, and bonus json tag array
from datasette-ui-extras.
maybe check r/datasets ?
from datasette-ui-extras.
from datasette-ui-extras.
USDA food: https://github.com/alyssaq/usda-sqlite
from datasette-ui-extras.
Early dumps of stack exchange sites: https://tejp.de/files/so/dbdump/
from datasette-ui-extras.
I think the early dumps of SE sites is an interesting way to go, it has:
- dates (vote creation table)
- timestamps
- images (user gravatar)
- json arrays (question tags)
- lots of fkeys
- an opportunity to make views, to split the posts table into questions/answers
- full text search (titles/bodies)
- prefix search (user names)
- an OK number of rows -- ~16K in the post table
My chief complaint is maybe that it's too small to be a really good test. I think I'd prefer like 50K rows in its biggest table. Ah well, let's see what happens.
from datasette-ui-extras.
Yes, let's go with Stack Exchange. See https://github.com/cldellow/stackexchange-to-sqlite
from datasette-ui-extras.
from datasette-ui-extras.
https://github.com/cldellow/dux-demo, I haven't tried to automate things yet. Dunno if I'll stick with fly, we'll see.
from datasette-ui-extras.
This does make me want another, bigger database. The cooking DB is pretty zippy, something 10x as big would be a nice proof that things scale
from datasette-ui-extras.
superuser is ~14x the size... that's maybe a bit too much? Still need room for an FTS index, eg.
OTOH, "annoying" is probably the right qualitative test for a big database.
from datasette-ui-extras.
Plus, if I eventually script this into a GH action, I'll stop noticing.
from datasette-ui-extras.
Related Issues (20)
- don't enable fkey checks on _internal database
- omnisearch fails to redirect row if compound pkey is used HOT 1
- omnisearch can refuse to do string searches if there are a few integers mixed in
- JSONTagsControl: no way to add an item that doesn't exist in autosuggest
- add row: give option to specify pkey if it's a text field without a default
- Datasette 1.0/main compatibility HOT 1
- python windows charset invalid
- Not working with recent releases
- Add the ability to export a mutable database HOT 1
- Autosuggest doesn't work on mobile
- Consider JSON Schema support HOT 1
- Add `dux` command, rename `dedux` to `undux` HOT 1
- view support for omnisearch
- be able to explicitly specify columns for omnisearch
- CHECK (column IS NULL OR column IN (...)) should work HOT 2
- omnisearch: string fkey to table with no label column is omitted
- pysqlite3-binary does not like setting synchronous = NORMAL HOT 1
- dux_stats_column_values `pks` column got double-encoded somehow HOT 2
- investigate sqlite3 vs pysqlite3-binary difference HOT 4
- `dux_pending_rows` trigger can't handle blobs
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from datasette-ui-extras.