silentrob / apricot Goto Github PK
View Code? Open in Web Editor NEWA simple Hpricot Clone for nodejs.
License: MIT License
A simple Hpricot Clone for nodejs.
License: MIT License
I can require Apricot, but attempting to actually use it fails with a jsdom error. All I've installed on this box is node, npm, and apricot. All should be the newest versions, node and npm were cloned from git today, apricot was then installed via npm install apricot.
$ cat test.js
var apricot = require('apricot').Apricot;
console.log('apricot declared');
apricot.parse('
paragraph
', function() {$ node test.js
apricot declared
node.js:50
throw e;
^
Error: Cannot find module 'jsdom-0.1.2/jsdom/level1/core'
at loadModule (node.js:477:15)
at require (node.js:613:14)
at Object.Apricot (/usr/local/lib/node/.npm/apricot/0.0.4/package/lib/apricot.js:6:13)
at Function.parse (/usr/local/lib/node/.npm/apricot/0.0.4/package/lib/apricot.js:282:12)
at Object. (/home/geekdev/test.js:4:9)
at Module._compile (node.js:664:23)
at Module._loadScriptSync (node.js:671:10)
at Module.loadSync (node.js:540:12)
at Object.runMain (node.js:724:24)
at Array. (node.js:747:12)
$ npm ls installed
npm info it worked if it ends with ok
npm info version 0.2.1
[email protected] =silentrob active installed latest remote
[email protected] =tautologistics active installed latest remote
[email protected] =tmpvar active installed remote
[email protected] =tmpvar active installed remote stable latest
[email protected] =isaacs active installed latest remote
npm ok
I get this error trying to parse an html file. Doc is underfined.
{ _code: 5 }
No other info is given. What does 5 indicate?
i want to parse and extend templates
In the following test, only the link declared outside the IE directive can be found by Apricot:
var Apricot = require('apricot').Apricot,
assert = require('assert');
var html = '<!--[if IE]><link href="http://foo" media="screen, projection" rel="stylesheet" type="text/css" /><![endif]-->' +
'<link href="http://foo" media="screen, projection" rel="stylesheet" type="text/css" />'
Apricot.parse(html, function(err, doc) {
if (err) throw err;
var numLinks = 0;
doc.find('link').each(function(link) {
assert.equal('http://foo', link.href);
numLinks++;
});
assert.equal(2, numLinks);
});
Trying to install Apricot on a Windows machine get this error message:
C:\node\crawler>npm install apricot -g
npm http GET https://registry.npmjs.org/apricot
npm http 304 https://registry.npmjs.org/apricot
npm ERR! Unsupported
npm ERR! Not compatible with your operating system or architecture: apricot@0.0.
5
npm ERR! Valid OS: linux,macos,win
npm ERR! Valid Arch: x86,ppc,x86_64
npm ERR! Actual OS: win32
npm ERR! Actual Arch: ia32
npm ERR!
npm ERR! System Windows_NT 6.1.7601
npm ERR! command "C:\\Program Files\\nodejs\\\\node.exe" "C:\\Program Files\\nod
ejs\\node_modules\\npm\\bin\\npm-cli.js" "install" "apricot" "-g"
npm ERR! cwd C:\node\crawler
npm ERR! node -v v0.6.15
npm ERR! npm -v 1.1.16
npm ERR! code EBADPLATFORM
npm ERR! message Unsupported
npm ERR! errno {}
npm ERR!
npm ERR! Additional logging details can be found in:
npm ERR! C:\node\crawler\npm-debug.log
npm not ok
Specifications:
Windows 7 Professional 32 bits
Service Pack 1
Amd Athlon II x2 270 Processor 3.4Ghz
Im trying to rewrite a bunch of asset urls in an html file.
All the script tags that are outside of an IE conditional are fine and are rewritten correctly when I modify the src but, the problem is anything within:
<!--[if lt IE 9]>
<script src="bower_components/es5-shim/es5-shim.js"></script>
<script src="bower_components/json3/lib/json3.min.js"></script>
<![endif]-->
isn't returned by find.
For a page with something like
<a href='abc.php'>
When I open it with the callback
doc.find("a").each(el) {
console.log(el.href)
}
it shows /home/me/project-path/node-modules/apricot/lib/abc.php. I'd expect just abc.php there, or that you'd resolve it relative to the URL I passed to Apricot.open().
Interestingly, URLs that begin with a "/" are left as they are and no attempt is made to add the domain etc.
apricot claims arm architecture is not supported. Is there any reason for that ?
Line 10 in 0763f9b
var dom = require('jsdom-0.1.2/jsdom/level1/core').dom.level1.core;
should be
var dom = require('jsdom/jsdom/level1/core').dom.level1.core;
or
var dom = require('[email protected]/jsdom/level1/core').dom.level1.core;
Apricot.open("http://audun.ytterdal.net/test.php", function(doc) {
doc.find('title');
doc.each(function(el) {
console.log('found "' + el.innerHTML + '"');
})
})
Produces
$ node examples.js
found "
"
found "Tittelen på siden
"
juuso@node:~/Dropbox/etsi$ npm install apricot
npm WARN package.json [email protected] No README.md file found!
npm http GET https://registry.npmjs.org/apricot
npm http 304 https://registry.npmjs.org/apricot
npm http GET https://registry.npmjs.org/jsdom
npm http GET https://registry.npmjs.org/node-sizzle
npm http 304 https://registry.npmjs.org/jsdom
npm http 304 https://registry.npmjs.org/node-sizzle
npm http GET https://registry.npmjs.org/libxmljs
npm http GET https://registry.npmjs.org/htmlparser
npm http GET https://registry.npmjs.org/request
npm http GET https://registry.npmjs.org/cssom
npm http GET https://registry.npmjs.org/contextify
npm http 304 https://registry.npmjs.org/libxmljs
npm http 304 https://registry.npmjs.org/contextify
npm http 304 https://registry.npmjs.org/htmlparser
npm http GET https://registry.npmjs.org/bindings/1.0.0
npm http 304 https://registry.npmjs.org/request
npm http 304 https://registry.npmjs.org/cssom
npm http GET https://registry.npmjs.org/bindings
npm http 304 https://registry.npmjs.org/bindings/1.0.0
> [email protected] install /home/juuso/Dropbox/etsi/node_modules/apricot/node_modules/node-sizzle/node_modules/libxmljs
> node-gyp rebuild
/bin/sh: 1: xml2-config: not found
Traceback (most recent call last):
File "/home/juuso/.node-gyp/0.8.4/tools/gyp_addon", line 38, in <module>
rc = gyp.main(gyp_args)
File "/home/juuso/.node-gyp/0.8.4/tools/gyp/pylib/gyp/__init__.py", line 471, in main
options.circular_check)
File "/home/juuso/.node-gyp/0.8.4/tools/gyp/pylib/gyp/__init__.py", line 111, in Load
depth, generator_input_info, check, circular_check)
File "/home/juuso/.node-gyp/0.8.4/tools/gyp/pylib/gyp/input.py", line 2378, in Load
depth, check)
File "/home/juuso/.node-gyp/0.8.4/tools/gyp/pylib/gyp/input.py", line 384, in LoadTargetBuildFile
build_file_data, PHASE_EARLY, variables, build_file_path)
File "/home/juuso/.node-gyp/0.8.4/tools/gyp/pylib/gyp/input.py", line 1053, in ProcessVariablesAndConditionsInDict
build_file)
File "/home/juuso/.node-gyp/0.8.4/tools/gyp/pylib/gyp/input.py", line 1068, in ProcessVariablesAndConditionsInList
ProcessVariablesAndConditionsInDict(item, phase, variables, build_file)
File "/home/juuso/.node-gyp/0.8.4/tools/gyp/pylib/gyp/input.py", line 1027, in ProcessVariablesAndConditionsInDict
ProcessConditionsInDict(the_dict, phase, variables, build_file)
File "/home/juuso/.node-gyp/0.8.4/tools/gyp/pylib/gyp/input.py", line 904, in ProcessConditionsInDict
variables, build_file)
File "/home/juuso/.node-gyp/0.8.4/tools/gyp/pylib/gyp/input.py", line 1053, in ProcessVariablesAndConditionsInDict
build_file)
File "/home/juuso/.node-gyp/0.8.4/tools/gyp/pylib/gyp/input.py", line 1072, in ProcessVariablesAndConditionsInList
expanded = ExpandVariables(item, phase, variables, build_file)
File "/home/juuso/.node-gyp/0.8.4/tools/gyp/pylib/gyp/input.py", line 714, in ExpandVariables
(contents, p.returncode))
Exception: Call to 'xml2-config --libs' returned exit status 127. while trying to load binding.gyp
gyp ERR! configure error
gyp ERR! stack Error: `gyp_addon` failed with exit code: 1
gyp ERR! stack at ChildProcess.onCpExit (/usr/lib/nodejs/npm/node_modules/node-gyp/lib/configure.js:290:16)
gyp ERR! stack at ChildProcess.EventEmitter.emit (events.js:91:17)
gyp ERR! stack at Process._handle.onexit (child_process.js:674:10)
gyp ERR! System Linux 3.2.0-27-generic
gyp ERR! command "node" "/usr/lib/nodejs/npm/node_modules/node-gyp/bin/node-gyp.js" "rebuild"
gyp ERR! cwd /home/juuso/Dropbox/etsi/node_modules/apricot/node_modules/node-sizzle/node_modules/libxmljs
gyp ERR! node -v v0.8.4
gyp ERR! node-gyp -v v0.6.1
gyp ERR! not ok
npm ERR! [email protected] install: `node-gyp rebuild`
npm ERR! `sh "-c" "node-gyp rebuild"` failed with 1
npm ERR!
npm ERR! Failed at the [email protected] install script.
npm ERR! This is most likely a problem with the libxmljs package,
npm ERR! not with npm itself.
npm ERR! Tell the author that this fails on your system:
npm ERR! node-gyp rebuild
npm ERR! You can get their info via:
npm ERR! npm owner ls libxmljs
npm ERR! There is likely additional logging output above.
npm ERR! System Linux 3.2.0-27-generic
npm ERR! command "nodejs" "/usr/bin/npm" "install" "apricot"
npm ERR! cwd /home/juuso/Dropbox/etsi
npm ERR! node -v v0.8.4
npm ERR! npm -v 1.1.45
npm ERR! code ELIFECYCLE
npm http 304 https://registry.npmjs.org/bindings
> [email protected] install /home/juuso/Dropbox/etsi/node_modules/apricot/node_modules/jsdom/node_modules/contextify
> node-gyp rebuild
Traceback (most recent call last):
File "/home/juuso/.node-gyp/0.8.4/tools/gyp_addon", line 38, in <module>
rc = gyp.main(gyp_args)
File "/home/juuso/.node-gyp/0.8.4/tools/gyp/pylib/gyp/__init__.py", line 471, in main
options.circular_check)
File "/home/juuso/.node-gyp/0.8.4/tools/gyp/pylib/gyp/__init__.py", line 111, in Load
depth, generator_input_info, check, circular_check)
File "/home/juuso/.node-gyp/0.8.4/tools/gyp/pylib/gyp/input.py", line 2378, in Load
depth, check)
File "/home/juuso/.node-gyp/0.8.4/tools/gyp/pylib/gyp/input.py", line 358, in LoadTargetBuildFile
includes, True, check)
File "/home/juuso/.node-gyp/0.8.4/tools/gyp/pylib/gyp/input.py", line 208, in LoadOneBuildFile
raise Exception("%s not found (cwd: %s)" % (build_file_path, os.getcwd()))
Exception: binding.gyp not found (cwd: /home/juuso/Dropbox/etsi/node_modules/apricot/node_modules/jsdom/node_modules/contextify) while trying to load binding.gyp
gyp ERR! configure error
gyp ERR! stack Error: `gyp_addon` failed with exit code: 1
gyp ERR! stack at ChildProcess.onCpExit (/usr/lib/nodejs/npm/node_modules/node-gyp/lib/configure.js:290:16)
gyp ERR! stack at ChildProcess.EventEmitter.emit (events.js:91:17)
gyp ERR! stack at Process._handle.onexit (child_process.js:674:10)
gyp ERR! System Linux 3.2.0-27-generic
gyp ERR! command "node" "/usr/lib/nodejs/npm/node_modules/node-gyp/bin/node-gyp.js" "rebuild"
gyp ERR! cwd /home/juuso/Dropbox/etsi/node_modules/apricot/node_modules/jsdom/node_modules/contextify
gyp ERR! node -v v0.8.4
gyp ERR! node-gyp -v v0.6.1
gyp ERR! not ok
npm WARN optional dep failed, continuing [email protected]
npm ERR!
npm ERR! Additional logging details can be found in:
npm ERR! /home/juuso/Dropbox/etsi/npm-debug.log
npm ERR! not ok code 0
When I run 'npm install apricot', I get message that x64 is not supported (though x86_64 is listed as supported). I'm guessing this may simply be a reporting issue, and a false positive? Other npm packages I've installed don't seem to have this issue. Are there any workarounds for me to install apricot on this machine?
Not compatible with your operating system or architecture: [email protected]
Valid OS: linux,macos,win
Valid Arch: x86,ppc,x86_64
Actual OS: linux
Actual Arch: x64
Atleast when using npm the documentation says to use
Apricot.open("url", function(err,doc)
while it really only takes function(doc) in the release (fixed in trunk?)
Making it dirty like this http://github.com/tautologistics/node-htmlparser/blob/master/utils_example.js is so much faster! I have rewritten a piece of code from PHP to node and Apricot is really slow compared to that for big and many HTML files.
This happens, I think, because of superfluous package info.
On Win7x64
process.platform : 'win32'
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.