dangbert / clippy-kindle Goto Github PK
View Code? Open in Web Editor NEWCreate Anki flashcards and markdown files from your Kindle notes/highlights.
Home Page: https://www.engbert.me/blog/anki-flashcards-on-kindle/
License: MIT License
Create Anki flashcards and markdown files from your Kindle notes/highlights.
Home Page: https://www.engbert.me/blog/anki-flashcards-on-kindle/
License: MIT License
Highlights made in the beginning of certain pdf files may record the page number as a roman numeral e.g. 'V'. We need to consider how to handle this:
Parsing file: 'My Clippings.txt'
ERROR: unable to parse highlight (in unexpected/unsupported format)
problem section in file (lines 43739 - 43743) >>>
'2015_Book_IntroductionToEvolutionaryComp '
'- Your Highlight on page V-V | Added on Friday, September 9, 2022 1:40:06 AM'
''
'This book has a supporting website at www.evolutionarycomputation.org which ofers additional information'
<<<
Exact section of file causing issue:
2015_Book_IntroductionToEvolutionaryComp
- Your Highlight on page V-V | Added on Friday, September 9, 2022 1:40:06 AM
This book has a supporting website at www.evolutionarycomputation.org which ofers additional information
==========
Should make it much easier for people to install and use this repo (e.g. pip install clippy-kindle
)
References:
Considerations:
ensure pytest and docs dependencies are not required for installing this pip package. (need a better way to manage dev vs prod dependencies)
usetox for testing/validation of package in future
An MVP for this is already complete in the anki
branch of this repo, but it's hardcoded with the names/settings of my personal Anki data. The solutions there needs to be generalized as follows:
{
// global settings;
"user_anki_dir": "~/.local/share/Anki2/User 1",
// settings for a particular book group:
"spanish": {
"deck": "My-cool-deck",
"tags": ["world::lang::es"],
// ...
},
}
The bottleneck with running clippy.py
is the duplicate removal stage.
collection.json
), to only consider new clippings since last parse for duplicate detection...--no-cache
flag to opt out of this behaviour (e.g. in case user didn't remove duplicates previously cached run)A declarative, efficient, and flexible JavaScript library for building user interfaces.
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. ๐๐๐
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google โค๏ธ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.