chriswren / grunt-link-checker Goto Github PK
View Code? Open in Web Editor NEWRun node-simple-crawler to discover broken links on your website
License: MIT License
Run node-simple-crawler to discover broken links on your website
License: MIT License
Hi,
The crawler goes through a couple of pages without any problems, but then throws this error. Any ideas as to why that might be the case, and how I could fix it? My configuration looks as follows:
linkChecker: {
options: {
maxConcurrency: 10
},
postDeploy: {
site: 'www.radiologen-konstanz.de'
}
}
Kind regards,
Max
Hi
I'm getting a very strange behavior with the link checker. It tries to get the CSS from the link tag and fails with 404 by trying to make the URL relative.
Resource not found linked from http://myIP:myPORT/products-services/healthcare-credit-card.html to http://myIP:myPORT/products-services/%27/styles/main.css
Status code: 404
I removed the IP since this client work.
Have you seen anything like this? The CSS resolves fine in the page and all styles work. I was just wondering why this error would appear. It doesn't appear on all pages either, just 3 of them.
Thanks
Joe
It looks like the CONTRIBUTING.md is copied from other projects. I started trying to fix the links but I guess you'd probably just want to re-write the whole thing and you'd know better than me what should go there.
grunt-link-checker is really powerful, but it can be slow as it steps through all the pages in a site. It would be nice if there was some kind of progress indication (pages checked/remaining) or similar to give an idea of progress. Not essential, just nice to have.
Hey, this looks great! It would be good if there was a plain Node module though. That would make it a lot more accessible. Let's say there would be a new "link-checker" (core) module.
Then this project would have the new "link-checker" module as a dependency, so this would just wrap it for grunt.
Then I could also make a gulp module if that made sense (maybe a Gulp plugin isn't needed; maybe using the plain Node module with gulp on its own would make most sense, I don't know).
Would you be open to that? I had a quick look over the source and it doesn't seem like it would be hard. I'd be up for helping anyway.
It would be nice to be able to whitelist a bunch of URLs which are expected to break for a number of reasons: dynamic-template URLS, hacky graceful fallbacks code etc.
Can i somehow do this with the current implementation?
I'm getting the above error after attempting to set up this plugin. These are what my files look like...
Gruntfile.js
module.exports = function(grunt) {
grunt.initConfig({
pkg: grunt.file.readJSON('package.json'),
'link-checker': {
dev: {
site: 'http://www.website.com',
options: {
maxConcurrency: 20,
}
}
}
});
grunt.loadNpmTasks('grunt-link-checker');
grunt.registerTask('default', ['link-checker']);
};
package.json
{
"name": "Website-Link-Crawler",
"version": "0.0.1",
"devDependencies": {
"grunt": "^0.4.5",
"grunt-link-checker": "0.0.6"
}
}
Probably because the site that I'm crawling has an pretty high amount of resources. Though, I wonder if this isn't preventable? Am I overlooking an option here?
Is it possible to check for redirects too?
For example, it reports a 404 for this URL:
Hi. When attempting to run a basic setup on a local server, I'm getting this error:
$ grunt checklinks
Running "link-checker:dev" (link-checker) task
Fatal error: Cannot read property 'cyan' of undefined
'link-checker': {
dev: {
site: '0.0.0.0:8080'
}
}
My setup:
[email protected]
[email protected]
[email protected]
[email protected]
[email protected]
Any ideas? I guess maybe it's something to do with the colors dependency? This is a fresh install of the package today.
I've got a few anchors on my page, and when links to those anchors are being followed, they're getting marked as 404s. A typical example:
<a href="#fast-service" id="fast-service-link" title="Fast service times">Fast service times</a>
As a link to:
<div class="hiccup-right light-grey how-we-are-here-panel" id="fast-service"></div>
And the error:
Resource not found linked from https://[mydomain]/about-us to https://[mydomain]/about-us#fast-service
Status code: 404
Is this expected behaviour?
When registering the Grunt task, took me a while to figure our that you need to enter 'linkChecker' โ perhaps it might be good to include in the readme? Thanks :-)
This should make things simpler to work with grunt templates.
connect: {
server: {
options: {
port: '<% linkChecker.options.initialPort %>',
base: 'test/fixtures'
}
}
}
A major version bump should be made though.
A declarative, efficient, and flexible JavaScript library for building user interfaces.
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. ๐๐๐
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google โค๏ธ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.