storage: fix ondemand read may cause IO err when enable prefetch #468

kevinXYin · 2022-06-06T06:40:51Z

For fscache scenario, when prefetch workers set data chunks to pending,
the ondemand read procedure does not wait for these pending chunks to
be downloaded and persisted before replying cread.

Make fetch_range_uncompressed() also waiting for inflight io complete
for fscache.

Signed-off-by: Xin Yin [email protected]

hsiangkao

I think we could do like this as a start.

hsiangkao · 2022-06-06T07:03:48Z

storage/src/cache/cachedfile.rs

@@ -355,7 +355,16 @@ impl FileCacheEntry {
            None => return Ok(0),
            Some(v) => {
                if v.is_empty() {
-                    return Ok(0);
+                    if wait_inflight {


if wait_inflight && !bitmap.wait_for_range_ready(chunk_index, count)? {
return Err()
} else {
return Ok(0)
}
?

yeah, make sense , will update soon.

imeoer · 2022-06-06T07:50:54Z

storage/src/cache/cachedfile.rs

@@ -355,7 +355,16 @@ impl FileCacheEntry {
            None => return Ok(0),
            Some(v) => {
                if v.is_empty() {
-                    return Ok(0);
+                    if wait_inflight {
+                        // we get inflight io , wait them


we get inflight io, wait for them

thanks, will fix it

changweige · 2022-06-06T08:08:00Z

Seems two read paths do_fetch_chunks can both read from the remote storage backend since check_range_ready_and_mark_pending is simultaneously called to check whether the target chunks are ready ending up with duplicated read from the backend. The two read paths both have non-empty pending vectors.
So not only for fscache usage but also for FUSE, we still have this problem.
I suppose there is no need to add the flag wait_inflight to do_fetch_chunks.

For this single patch, I think pending is empty means all chunks are ready in the local cache file, so the wait is needless.
We should check if chunks are in the progress of being read from remote storage and wait there if they are.

kevinXYin · 2022-06-06T08:26:41Z

AFAK, ‘pending’ is empty seams can not guarantee all chunks are persisted in local file, if we already have chunks inflight? please correct me if I am wrong.

changweige · 2022-06-06T08:38:33Z

I checked the implementation IndexedChunkMap::check_range_ready_and_mark_pending, no inflight tracer is working or holding inflight states there. So pending is empty at least for IndexdChunkMap has no effect of waiting for inflight.

Moreover, I suppose do_fetch_chunks should be a synchronous operation. Maybe, we should clarify the semantics of it.
Then a simpler method might be just removing the return statement if pending is empty.

changweige · 2022-06-06T09:14:32Z

I see. There is a new type BlobStateMap wrapping IndexedChunkmap and the other one. Struct BlobStateMap has inflight_tracer. BlobStateMap IndexedChunkmap and the other type of ChunMap all implement RangeMap.
And only BlobStateMap is instantiated when creating a cache object.

@jiangliu Should we define do_fetch_chunks as a synchronous operation?

jiangliu · 2022-06-06T10:35:33Z

Those fetch_xxx method defined by the BlobObject trait has no output buffer to receive the data, so all data must be written to the underlying cache files synchronously, otherwise the client may get stale data.
We should make do_fetch_chunks as synchronous operation.

kevinXYin · 2022-06-06T11:46:01Z

Refer to comment for BlobObject trait , seems only fetch_range_uncompressed need to make sure all data ranges are ready. Does that mean other methods can return before the data is persisted in a local file?

And if do_fetch_chunks called from prefetch worker is it still needed to wait all data ready? In this case the data does not need to be used immediately.

jiangliu · 2022-06-06T12:33:40Z

Seems it would be better to enforce synchronous for all fetch_xxx method in BlobObject, to avoid possible data crash issues. If it turns out that prefetch suffers from synchronous sematic, we may add new API for it. And we are trying to enable async io recently, which will help too.

kevinXYin · 2022-06-06T13:00:27Z

OK , so for this patch we just drop wait_inflight , and if 'pending' is empty, call wait_for_range_ready to wait for all chunks to ready. Does it make sense?

And I still worry that this may invoke profemance drop for prefetch workers.

jiangliu · 2022-06-07T02:35:06Z

The prefetch performance issue could be traded off by add more working threads, and we are working on enabling async io framework, which will then fix the issue.
#467

For fscache scenario, when prefetch workers set data chunks to pending, the ondemand read procedure does not wait for these pending chunks to be downloaded and persisted before replying cread. Make do_fetch_chunks() also waiting for inflight io complete. Signed-off-by: Xin Yin <[email protected]>

kevinXYin · 2022-06-07T04:28:31Z

Thanks , updated

changweige

LGTM

changweige · 2022-06-07T06:17:29Z

@kevinXYin
In fact, prefetch can also be an evil thing according to your production environment since prefetch can steal network bandwidth from normal user business IO. So prefetch IO is usually limited especially when many nydusd processes are started in your nodes and cluster-wide.

So I personally don't think it is a big concern about prefetch performance.

kevinXYin · 2022-06-07T07:37:41Z

Thanks, understood.

kevinXYin requested review from imeoer, jiangliu, changweige and hsiangkao June 6, 2022 06:40

hsiangkao approved these changes Jun 6, 2022

View reviewed changes

imeoer reviewed Jun 6, 2022

View reviewed changes

kevinXYin force-pushed the master branch from 71ce45d to d32cc17 Compare June 7, 2022 04:27

changweige approved these changes Jun 7, 2022

View reviewed changes

imeoer merged commit 081f609 into dragonflyoss:master Jun 7, 2022

storage: fix ondemand read may cause IO err when enable prefetch #468

storage: fix ondemand read may cause IO err when enable prefetch #468

Uh oh!

Conversation

kevinXYin commented Jun 6, 2022

Uh oh!

hsiangkao left a comment

Choose a reason for hiding this comment

Uh oh!

hsiangkao Jun 6, 2022

Choose a reason for hiding this comment

Uh oh!

kevinXYin Jun 6, 2022

Choose a reason for hiding this comment

Uh oh!

imeoer Jun 6, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

kevinXYin Jun 6, 2022

Choose a reason for hiding this comment

Uh oh!

changweige commented Jun 6, 2022

Uh oh!

kevinXYin commented Jun 6, 2022

Uh oh!

changweige commented Jun 6, 2022

Uh oh!

changweige commented Jun 6, 2022

Uh oh!

jiangliu commented Jun 6, 2022

Uh oh!

kevinXYin commented Jun 6, 2022

Uh oh!

jiangliu commented Jun 6, 2022

Uh oh!

kevinXYin commented Jun 6, 2022

Uh oh!

jiangliu commented Jun 7, 2022

Uh oh!

kevinXYin commented Jun 7, 2022

Uh oh!

changweige left a comment

Choose a reason for hiding this comment

Uh oh!

changweige commented Jun 7, 2022

Uh oh!

kevinXYin commented Jun 7, 2022

Uh oh!

Uh oh!

imeoer Jun 6, 2022 •

edited

Loading