Coder Social home page Coder Social logo

sueszli / notionsnapshot Goto Github PK

View Code? Open in Web Editor NEW
61.0 1.0 9.0 9.11 MB

notion web scraper

License: GNU Affero General Public License v3.0

Python 75.50% CSS 13.06% JavaScript 6.81% Shell 2.59% Dockerfile 2.05%
notion export-to-html notion-automation notion-backup notion-export notion2html

notionsnapshot's People

Contributors

2m avatar aahnik avatar aehernandez avatar akash-sharma-1 avatar berndtuhlig avatar bryanhpchiang avatar dependabot[bot] avatar dosoft avatar douglasjarquin avatar ecolabardini avatar joonatanjak avatar kevindaffaarr avatar kmlbgn avatar leoncvlt avatar leshchenko1979 avatar mjdeligan avatar nanobjorn avatar nijatismayilzada avatar noahsaso avatar safaorhan avatar sathesh95 avatar sueszli avatar sunziebabaq avatar tobiasleibrock avatar vincent-maladiere avatar web3gurung avatar zvedenyuk avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar

notionsnapshot's Issues

Breadcrumbs without links

Links () are lost in the breadcrumbs. Can you fix it?
The root link is especially important. Without it, you can't get back to the main page in a simple way.
Thank you!

image

Forbidden to download assets

When I tried to scrape a public notion page, it reports 403 status code when downloading assets.

But if I replaced all "www.notion.so" with "xxx.notion.site", and scrape it again. The 403 status code disappeared.

I suggest to extract the subdomain from the notion page URL instead of using "www.notion.so" to access notion page assets.

Not found Chrome Driver issue

Hi Sueszli,

I got the issue about no Chrome Driver found for 119.0.6045 Version

I have tired in Github CodeSpace, Ubuntu and local windows machine. All faced the Driver issue

Error msg:

ValueError: There is no such driver by url https://chromedriver.storage.googleapis.com/119.0.6045/chromedriver_win32.zip
ValueError: There is no such driver by url https://chromedriver.storage.googleapis.com/119.0.6045/chromedriver_linux64.zip

image

Failed to download chromedrive

When I run test.sh, it failed to download the chromedriver.

I found out the reason is the chromedriver download link has been changed since version M115. Pls refer to https://sites.google.com/chromium.org/driver/ for more information.

My current workaround is manually download chrome and chromedriver, and specify the executable_path in driver.py mannually. But I think there should be a generic solution.

My environment is Ubuntu 20.04.

config.toml ?

As a user of loconotion, I'm using the config.toml extensively, especially for its code injections features. Has this been deprecated in this version ?

feature request: dockerfile

would make sense.

it's hard to track all lib versions and maintain installation instructions on multiple different operation systems.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.