Comments (2)
Sure, would be happy to accept a pull request that implements it.
There's no problem with it. The index format includes a pre-record (per CDX line) version number. So create a new ddcodd
from outbackcdx.
Gah. Sorry. 'close issue' is too near the text box on mobile.
... So create a new Capture.decodeValueV2() method for a version 2 record format that supports the robots field and update Capture.encodeValue() to write the new format. Then the index server will happily read both new and old records and you can even mix them in the one index while incrementally reindexing to fill in the robots field data.
It was marked as todo simply because I didn't have any CDX files on hand with that field populated and wasn't sure what the data format was or what exactly it was used for in Wayback.
from outbackcdx.
Related Issues (20)
- OWB compat: annotate 'closest' capture for BubbleCalendar
- Access rule editor: clearing 'accessed between' fields doesn't work
- CDXJ: Error: no such capture field: method HOT 7
- URL:s ending with an asterisk isn't found when searching for the same URL HOT 3
- Upgrade rocksdb
- Pasting a URL in the query field in the dashboard doesn't work
- Document index upgrades HOT 1
- ArrayIndexOutOfBoundsException, NumberFormatException when loading indexed ARC-files HOT 2
- Releases HOT 1
- API to scan all records HOT 3
- Possible infinite loop upon malformed requests HOT 6
- Handling of `+` characters in queries HOT 1
- CDX11+3 support HOT 2
- Replication secondary applies the last batch over and over HOT 3
- resolving a surt-timestamp collision should not replace a non-revisit record with a revisit record HOT 9
- Signed WARC URL generation
- Handle invalid dates HOT 4
- Report exception to client in WbCdxApi HOT 2
- Possible race condition under load HOT 8
- XML protocol: numreturned and numresults
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from outbackcdx.