If MIGRATION event is received then we should first abort the crawler and then persist

Do this once <a class="issue-link js-issue-link" data-error-text="Failed to load title

Implemented in <a class="issue-link js-issue-link" data-error-text="Failed to load tit

Better shutdown behavior on MIGRATION event about crawlee HOT 6 CLOSED

apify commented on July 28, 2024

Better shutdown behavior on MIGRATION event

from crawlee.

Comments (6)

jancurn commented on July 28, 2024

Also, AutoscaledPool should probably stop launching new tasks. And when all tasks are finished, it could just exit the process with an error exit code, so that the task is migrated to new server.

from crawlee.

jancurn commented on July 28, 2024

BTW if actor doesn't have "Restart on error" flag set, then we shouldn't send any migration event, since the actor won't be able to handle it anyway

from crawlee.

mtrunkat commented on July 28, 2024

I think that those are two different cases. "Restart on error" means that error in actor is something that may occasionally happen as, for example, memory exceeded when doing big long crawl.

Some of the actors for example - actor that aggregates data from a dataset and sends email is something I want to restart on migration but not on error - because error should not happen there.

from crawlee.

jancurn commented on July 28, 2024

You're right, this should probably be other setting - e.g. "Fail on migration"

from crawlee.

mnmkng commented on July 28, 2024

Do this once #195 is done.

from crawlee.

mnmkng commented on July 28, 2024

Implemented in #268

from crawlee.

Better shutdown behavior on MIGRATION event about crawlee HOT 6 CLOSED

Comments (6)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent