Comments (5)
Hello Marcel,
I was able to get an almost 1:1 exact same page as on the web with that command. Could you please point out what seems to be missing in the saved file?
from monolith.
It was missing the huge table following an H2 tag..
Oh wow, what the heck.. now it works here too 🤔
Unfortunately I had overwritten the file I previously got to diff
with curl
.. that was created with
monolith --no-css --no-images --no-js 'https://distrowatch.com/table.php?distribution=void'
:
distrowatch.com--void-linux.txt
Mh actually that is still missing it.. gotta run now got train to catch
from monolith.
Interesting. Could you please try saving it again and wait for another train? It doesn't look like I'm able to reproduce it on my end.
from monolith.
Looks like it needs to either have JS or CSS to render those tables, or alternatively you can provide this flag: -n
. It'll unwrap NOSCRIPT tags and make it look the way things look in browsers that don't have JS enabled.
from monolith.
Ok coming back to this, a more interesting observation is that output fluctuates... Try this command repeatedly:
monolith 'https://distrowatch.com/table.php?distribution=void' > distrowatch.com--void-linux.$(date +%F.%H%Mh%S).htm
The -n
option actually makes no difference with that..
from monolith.
Related Issues (20)
- Outdated Project Reference in README HOT 2
- The page you need to log in cannot be saved after logging in HOT 1
- whole progress failed caused by get favicon.ico HOT 1
- Unicode mangling
- [Feature request] Simple way to permanently store and use Blacklist of domains HOT 13
- [Bug] Data URLs exceed length limits HOT 8
- Save apple-touch-icon too HOT 3
- "https://mp.weixin.qq.com" web title and CSS switch image on click not work
- How to get just HTML, no <script> HOT 1
- Additionally fetch dynamic content HOT 3
- [proposal] An option to remove alternative sources for media urls HOT 1
- HTML page content partially invisible HOT 2
- Site doesnt work HOT 1
- download path ? HOT 1
- Site doesn't work, redirected towards ct.captcha-delivery.com HOT 1
- What's the default location sites are being saved to? HOT 3
- Relax glibc version requirement HOT 3
- Aggregate multiple html files? HOT 3
- Saving all files separately like IDM. HOT 3
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from monolith.