Hi, I just tried to install your awsome project to an local folder (

Thank you, it was indeed a problem, I fixed it in <a class="commit-link" data-hovercar

Hi <a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="

Trafilatura can't be loaded after installing it to local folder about trafilatura HOT 5 CLOSED

adbar commented on May 18, 2024

Trafilatura can't be loaded after installing it to local folder

from trafilatura.

Comments (5)

HaIDsIEx commented on May 18, 2024 1

Hey, thanks for the fast response! I just made a new environment and tried to set up everything. However, (even if the package gets downloaded into the "package"-folder) I get following error:

Traceback (most recent call last):
  File "{path}/function.py", line 1, in <module>
    from package import trafilatura
  File "{path}\package\trafilatura\__init__.py", line 16, in <module>
    from .core import extract, process_record
  File "{path}\package\trafilatura\core.py", line 16, in <module>
    from lxml import etree, html
ModuleNotFoundError: No module named 'lxml'

from trafilatura.

HaIDsIEx commented on May 18, 2024 1

Afaik, lxml needs to be compiled and installed for each machine. Therefore, „portable“ compatibility (install it to a folder and copy it anywhere and run it) can’t be achieved with lxml. In my case I would like to push it on AWS Lambda (localstack; can't install things easily there). I guess it won’t work as long as this project builds on lxml. However, I already found some alternatives for now (I expose your project via a REST-Service on my machine for development purposes). Later (on AWS) I should be able to use EC2 to install all necessary packages such as lxml.

from trafilatura.

adbar commented on May 18, 2024

Thank you, it was indeed a problem, I fixed it in 1a57635, could you please confirm by trying the version straight from the repository? (pip install -U git+https://github.com/adbar/trafilatura.git with your --target)

from trafilatura.

adbar commented on May 18, 2024

Hi, the changes I introduced created a bug on some platforms but I don't think it was the issue here. I guess you face a package managing issue, as lxml should have been installed and added to your Python path. It seems that errors linked to target directories with pip are not fully documented: pypa/pip#8725 Maybe this kind of approach could be useful for you: https://stackoverflow.com/questions/24174821/how-to-change-default-install-location-for-pip/24175174#24175174

from trafilatura.

adbar commented on May 18, 2024

Hi @HaIDsIEx, please refer to this answer and this code snippet, both show how to solve the issue with LXML.

from trafilatura.

Recommend Projects

Trafilatura can't be loaded after installing it to local folder about trafilatura HOT 5 CLOSED

Comments (5)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent