Original idea from <a class="issue-link js-issue-link" data-error-text="Failed to load

On Mon, Apr 25, 2016 at 1:43 AM, Jakub Kruszona-Zawadzki < <a href="mailto:noti

Original idea from <a class="issue-link js-issue-link" data-error-text="F

Lazy chunk removal about moosefs HOT 5 OPEN

moosefs commented on May 18, 2024

Lazy chunk removal

from moosefs.

Comments (5)

acid-maker commented on May 18, 2024

I'll think about it. Removal of overgoal chunks and also system rebalance might be suspended for short (configurable) period of time after connecting of new chunkserver (or after adding HDD with chunks). The question is: How long should it be suspended? Hour?

The other question is. Is it really necessary? When disk went offline it is good idea to attach it in "mark for removal" mode because usually this means that such disk is broken and should be replaced asap. In what scenario disk that went offline will be attached again as "normal" disk?

from moosefs.

kiatoa commented on May 18, 2024

On Mon, Apr 25, 2016 at 1:43 AM, Jakub Kruszona-Zawadzki <
[email protected]> wrote:

I'll think about it. Removal of overgoal chunks and also system rebalance
might be suspended for short (configurable) period of time after connecting
of new chunkserver (or after adding HDD with chunks). The question is: How
long should it be suspended? Hour?

The other question is. Is it really necessary? When disk went offline it
is good idea to attach it in "mark for removal" mode because usually this
means that such disk is broken and should be replaced asap. In what
scenario disk that went offline will be attached again as "normal" disk?

I admit my use case is not the norm for Moosefs. I have master + two chunk
servers in the house and a chunkserver + metadata mirror in an outbuilding.
The chunk servers are on cubieboards. The connection to the outbuilding was
via wireless and it was somewhat unreliable. The purpose was to ensure that
if something bad happened to the house (break in, fire etc.) the data would
be saved in the out building. My hope was that using topology file and
setting goal of three on really important files would result in safe
storage with acceptable performance. My goals for performance are very
modest but even so the remote chunkserver over wireless impacted
performance severely. I ran a network cable and performance is now ok.

My goal with Moosefs is cheap, trustworthy storage with low stress and
burden. Moose has excelled in this for me for quite a few years now. When
disks die, replacing them is trivial, mostly automatic and overall a very
low burden to me. When a disk dies in a btrfs or raid based system it is
relatively complicated to recover. This quality of Moosefs is absolutely
wonderful.

I think removing chunks could possibly be driven by a free space parameter.
I have ~5.5T of raw space of which I'm only using 2.5T. In my ideal world
I would set a free space threshold under which Moosefs would not bother to
remove chunks. This threshold would be distributed across all disks. For
example I'd set a free space threshold of 1.5T and each chunkserver would
trigger disk cleanup only if there was less than 500G of space available. I
suspect this might help performance in some situations as I've seen Moosefs
get very slow when re-balancing.

Anyhow, this is a very low priority suggestion that I thought I'd just
mention. I'm very satisfied with Moosefs and grateful for it's being made
available via open source. Thanks much for that.

—
You are receiving this because you are subscribed to this thread.
Reply to this email directly or view it on GitHub
#10 (comment)

from moosefs.

nrm21 commented on May 18, 2024

My goal with Moosefs is cheap, trustworthy storage with low stress and
burden. Moose has excelled in this for me for quite a few years now. When
disks die, replacing them is trivial, mostly automatic and overall a very
low burden to me. When a disk dies in a btrfs or raid based system it is
relatively complicated to recover. This quality of Moosefs is absolutely
wonderful.

I'd just like to reiterate this myself. When I started with MooseFS over a year ago it was a bit of a learning curve and setup. But it was time well spent. Once up and running MooseFS was very hands off and continues to amaze me how it just sits there and works without needing constant attention or a sysadmin to tweak it all day. It truly amazes me how many storage and sysadmins these days still mess with RAID5 RAID6 or RAID anything really. The "era of Ceph storage" seems to finally be here (even if Ceph isn't the software we've all chosen here, it just seens to be the trendy one all the enterprises are talking and writing articles about).

It's just sooo much easier to handle the inevitable failure (that will happen) when it does happen, by letting a filesystem auto move around and rebalance your data without your intervention, than trying to rebuild disks and hope you don't loose another one in the re-silvering process. I too use MooseFS at my home as a backup for just about all my stuff (I also keep an offsite second backup of my truly critical data). I also use MFS as my datastore for my home ESXi box. And I'm testing the possibility of using it at work as well.

Thanks guys for all your hard work! And keep it up. :)

from moosefs.

borkd commented on May 18, 2024

I'll think about it. Removal of overgoal chunks and also system rebalance might be suspended for short (configurable) period of time after connecting of new chunkserver (or after adding HDD with chunks). The question is: How long should it be suspended? Hour?

If there was a way to track the "health" of a chunkserver/disk pairs in terms of current uptime, recent number of disconnects and IO errors in the last 1h/12h/24h (pick any reasonable timeframe here) against admin specified thresholds the decision of which cutoff to pick could be easier to automate?

A preferred alternative to embedding all such logic in mfschunkserver and mfsmaster would be to have an out-of-band "health" checking capability. Periodical poll of a sqlite database, external executable, or web api call that provides the caller with sets of operational parameters for maintenance modes, predicted failures, along with TTLs for how long any of such overrides are to be considered valid would enable automation and customization for a variety of deployments. Value of this would extend beyond deciding whether to remove chunks lazily, rapidly or with default speed.

With all that said, hats off to Moosefs for an already great solution.

from moosefs.

borkd commented on May 18, 2024

Original idea from #8:

I think an option to make removing extra chunks more lazy would be good.
Scenario is that a disk goes offline and gets replicated then comes online and gets removed, then it happens again. All this churn would be reduced by more lazy removal of extra chunks.

Another scenario: entire datacenter/rack worth of chunkservers disappears due to unplanned outage, remaining locations bring replicas to defined levels and everything continues as normal. When datacenter reappears master rapidly deletes known good copies, not knowing that the outage resulted from a cooling failure and that dozens of disks in the originally offlined location will start returning I/O errors next time filesystem activity or scan hits some or any chunks stored on them. This tightly couples with trust model for .chunkdb proposed in #165. This is not a hypothetical scenario, btw.

from moosefs.

Lazy chunk removal about moosefs HOT 5 OPEN

Comments (5)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent