On 08/02, Chao Yu wrote:
> On 2019/8/2 6:27, Jaegeuk Kim wrote:
> > On 08/01, Chao Yu wrote:
> >> On 2019/8/1 12:14, Jaegeuk Kim wrote:
> >>> On 07/31, Chao Yu wrote:
> >>>> On 2019/7/31 2:02, Jaegeuk Kim wrote:
> >>>>> On 07/29, Chao Yu wrote:
> >>>>>> On 2019/7/29 13:57, Jaegeuk Kim wrote:
> >>>>>>> On 07/23, Chao Yu wrote:
> >>>>>>>> On 2019/7/23 10:36, Jaegeuk Kim wrote:
> >>>>>>>>> On 07/19, Chao Yu wrote:
> >>>>>>>>>> Pinning a file is heavy, because skipping pinned files make GC
> >>>>>>>>>> running with heavy load or no effect.
> >>>>>>>>>
> >>>>>>>>> Pinned file is a part of NOCOW files, so I don't think we can 
> >>>>>>>>> simply drop it
> >>>>>>>>> for backward compatibility.
> >>>>>>>>
> >>>>>>>> Yes,
> >>>>>>>>
> >>>>>>>> But what I concerned is that pin file is too heavy, so in order to 
> >>>>>>>> satisfy below
> >>>>>>>> demand, how about introducing pin_file_2 flag to triggering IPU only 
> >>>>>>>> during
> >>>>>>>> flush/writeback.
> >>>>>>>
> >>>>>>> That can be done by cold files?
> >>>>>>
> >>>>>> Then it may inherit property of cold type file, e.g. a) goes into cold 
> >>>>>> area; b)
> >>>>>> update with very low frequency.
> >>>>>>
> >>>>>> Actually pin_file_2 could be used by db-wal/log file, which are updated
> >>>>>> frequently, and should go to hot/warm area, it does not match above 
> >>>>>> two property.
> >>>>>
> >>>>> How about considering another name like "IPU-only mode"?
> >>>>>
> >>>>>               fallocate         write    Flag         GC
> >>>>> Pin_file:     preallocate       IPU      FS_NOCOW_FL  Not allowed
> >>>>> IPU_file:     Not preallocate   IPU      N/A          Default by 
> >>>>> temperature
> >>>>
> >>>> One question, do we need preallocate physical block address for IPU_file 
> >>>> as
> >>>> Pin_file? since it can enhance db file's sequential read performance, 
> >>>> not sure,
> >>>> db can handle random data in preallocated blocks.
> >>>
> >>> db file will do atomic writes, which can not be used with this. -wal may 
> >>> be able
> >>
> >> Now WAL mode were set by default in Android, so most of db file are -wal 
> >> type now.
> > 
> > Will be back again tho.
> 
> R?

Q.

> 
> > 
> >>
> >>> to preallocate blocks, but it can eat disk space unnecessarily.
> >>
> >> I meant .db-wal file rather than .db.
> >>
> >> Yes, that's ext4 style, that would bring better performance due to less 
> >> holes in
> >> block distribution.
> >>
> >> I don't think we need to worry about space issue for db-wal file. I tracked
> >> .db-wal file's update before:
> >> - there are very frequently truncation and deletion, that means the 
> >> preallocated
> >> blocks won't exist for long time.
> >> - and also there are very frequently append writes, I suspect there almost 
> >> very
> >> few preallocate block are not written.
> >> - total db-wal file number is less.
> > 
> > Sometimes it can be large enough for system.
> 
> For this, it's trade off:
> - lose a few disk space at the very begin of db-wal lifecycle Or
> - face fragment and read performance degradation.
> 
> > If it's from user apps and short lived, why do we need preallocation?
> 
> It triggers sequential read on db-wal file during checkpoint, though it's 
> short
> lived, still it can affect performance.
> 
> What do you think of doing some performance test on WAL file to decide the
> preallocation policy?

Good idea. Can we?

> 
> Thanks,
> 
> > 
> >>
> >>>
> >>>>
> >>>> Other behaviors looks good to me. :)
> >>>>
> >>>> I plan to use last bit in inode.i_inline to store this flag.
> >>>
> >>> Why not using i_flag like FS_NOCOW_FL?
> >>
> >> Oops, as you listed in last email, I can see you don't want to break
> >> FS_NOCOW_FL's semantics for backward compatibility.
> >>
> >>                    Flag
> >> IPU_file           N/A                     
> >>
> >> If we plan to use FS_NOCOW_FL, that's what this patch has already did, you 
> >> can
> >> merge it directly... :P
> >>
> >>>
> >>>>
> >>>>> Cold_file:    Not preallocate   IPU      N/A          Move in cold area
> >>>>> Hot_file:     Not preallocate   IPU/OPU  N/A          Move in hot area
> >>>>
> >>>> Should hot file be gced to hot area? That would mix new hot data with 
> >>>> old 'hot'
> >>>> data which actually become cold.
> >>>
> >>> But, user explicitly specified this is hot.
> >>
> >> With current implementation, GC will migrate data from hot/warm/cold area 
> >> to
> >> cold area.
> >>
> >> Thanks,
> >>
> >>>
> >>>>
> >>>> Thanks,
> >>>>
> >>>>>
> >>>>>>
> >>>>>> Thank,
> >>>>>>
> >>>>>>>
> >>>>>>>>
> >>>>>>>>>
> >>>>>>>>>>
> >>>>>>>>>> So that this patch propose to separate nocow and pinfile semantics:
> >>>>>>>>>> - NOCoW flag can only be set on regular file.
> >>>>>>>>>> - NOCoW file will only trigger IPU at common writeback/flush.
> >>>>>>>>>> - NOCow file will do OPU during GC.
> >>>>>>>>>>
> >>>>>>>>>> For the demand of 1) avoid fragment of file's physical block and
> >>>>>>>>>> 2) userspace don't care about file's specific physical address,
> >>>>>>>>>> tagging file as NOCoW will be cheaper than pinned one.
> >>>>>>>>
> >>>>>>>> ^^^
> >>>>>>>>
> >>>>>>>> Thanks,
> >>>>>>>>
> >>>>>>>>>>
> >>>>>>>>>> Signed-off-by: Chao Yu <yuch...@huawei.com>
> >>>>>>>>>> ---
> >>>>>>>>>> v2:
> >>>>>>>>>> - rebase code to fix compile error.
> >>>>>>>>>>  fs/f2fs/data.c |  3 ++-
> >>>>>>>>>>  fs/f2fs/f2fs.h |  1 +
> >>>>>>>>>>  fs/f2fs/file.c | 22 +++++++++++++++++++---
> >>>>>>>>>>  3 files changed, 22 insertions(+), 4 deletions(-)
> >>>>>>>>>>
> >>>>>>>>>> diff --git a/fs/f2fs/data.c b/fs/f2fs/data.c
> >>>>>>>>>> index a2a28bb269bf..15fb8954c363 100644
> >>>>>>>>>> --- a/fs/f2fs/data.c
> >>>>>>>>>> +++ b/fs/f2fs/data.c
> >>>>>>>>>> @@ -1884,7 +1884,8 @@ static inline bool 
> >>>>>>>>>> check_inplace_update_policy(struct inode *inode,
> >>>>>>>>>>  
> >>>>>>>>>>  bool f2fs_should_update_inplace(struct inode *inode, struct 
> >>>>>>>>>> f2fs_io_info *fio)
> >>>>>>>>>>  {
> >>>>>>>>>> -  if (f2fs_is_pinned_file(inode))
> >>>>>>>>>> +  if (f2fs_is_pinned_file(inode) ||
> >>>>>>>>>> +                  F2FS_I(inode)->i_flags & F2FS_NOCOW_FL)
> >>>>>>>>>>            return true;
> >>>>>>>>>>  
> >>>>>>>>>>    /* if this is cold file, we should overwrite to avoid 
> >>>>>>>>>> fragmentation */
> >>>>>>>>>> diff --git a/fs/f2fs/f2fs.h b/fs/f2fs/f2fs.h
> >>>>>>>>>> index 596ab3e1dd7b..f6c5a3d2e659 100644
> >>>>>>>>>> --- a/fs/f2fs/f2fs.h
> >>>>>>>>>> +++ b/fs/f2fs/f2fs.h
> >>>>>>>>>> @@ -2374,6 +2374,7 @@ static inline void f2fs_change_bit(unsigned 
> >>>>>>>>>> int nr, char *addr)
> >>>>>>>>>>  #define F2FS_NOATIME_FL                   0x00000080 /* do not 
> >>>>>>>>>> update atime */
> >>>>>>>>>>  #define F2FS_INDEX_FL                     0x00001000 /* 
> >>>>>>>>>> hash-indexed directory */
> >>>>>>>>>>  #define F2FS_DIRSYNC_FL                   0x00010000 /* dirsync 
> >>>>>>>>>> behaviour (directories only) */
> >>>>>>>>>> +#define F2FS_NOCOW_FL                     0x00800000 /* Do not 
> >>>>>>>>>> cow file */
> >>>>>>>>>>  #define F2FS_PROJINHERIT_FL               0x20000000 /* Create 
> >>>>>>>>>> with parents projid */
> >>>>>>>>>>  
> >>>>>>>>>>  /* Flags that should be inherited by new inodes from their 
> >>>>>>>>>> parent. */
> >>>>>>>>>> diff --git a/fs/f2fs/file.c b/fs/f2fs/file.c
> >>>>>>>>>> index 7ca545874060..ae0fec54cac6 100644
> >>>>>>>>>> --- a/fs/f2fs/file.c
> >>>>>>>>>> +++ b/fs/f2fs/file.c
> >>>>>>>>>> @@ -1692,6 +1692,7 @@ static const struct {
> >>>>>>>>>>    { F2FS_NOATIME_FL,      FS_NOATIME_FL },
> >>>>>>>>>>    { F2FS_INDEX_FL,        FS_INDEX_FL },
> >>>>>>>>>>    { F2FS_DIRSYNC_FL,      FS_DIRSYNC_FL },
> >>>>>>>>>> +  { F2FS_NOCOW_FL,        FS_NOCOW_FL },
> >>>>>>>>>>    { F2FS_PROJINHERIT_FL,  FS_PROJINHERIT_FL },
> >>>>>>>>>>  };
> >>>>>>>>>>  
> >>>>>>>>>> @@ -1715,7 +1716,8 @@ static const struct {
> >>>>>>>>>>            FS_NODUMP_FL |          \
> >>>>>>>>>>            FS_NOATIME_FL |         \
> >>>>>>>>>>            FS_DIRSYNC_FL |         \
> >>>>>>>>>> -          FS_PROJINHERIT_FL)
> >>>>>>>>>> +          FS_PROJINHERIT_FL |     \
> >>>>>>>>>> +          FS_NOCOW_FL)
> >>>>>>>>>>  
> >>>>>>>>>>  /* Convert f2fs on-disk i_flags to FS_IOC_{GET,SET}FLAGS flags */
> >>>>>>>>>>  static inline u32 f2fs_iflags_to_fsflags(u32 iflags)
> >>>>>>>>>> @@ -1753,8 +1755,6 @@ static int f2fs_ioc_getflags(struct file 
> >>>>>>>>>> *filp, unsigned long arg)
> >>>>>>>>>>            fsflags |= FS_ENCRYPT_FL;
> >>>>>>>>>>    if (f2fs_has_inline_data(inode) || 
> >>>>>>>>>> f2fs_has_inline_dentry(inode))
> >>>>>>>>>>            fsflags |= FS_INLINE_DATA_FL;
> >>>>>>>>>> -  if (is_inode_flag_set(inode, FI_PIN_FILE))
> >>>>>>>>>> -          fsflags |= FS_NOCOW_FL;
> >>>>>>>>>>  
> >>>>>>>>>>    fsflags &= F2FS_GETTABLE_FS_FL;
> >>>>>>>>>>  
> >>>>>>>>>> @@ -1794,6 +1794,22 @@ static int f2fs_ioc_setflags(struct file 
> >>>>>>>>>> *filp, unsigned long arg)
> >>>>>>>>>>    if (ret)
> >>>>>>>>>>            goto out;
> >>>>>>>>>>  
> >>>>>>>>>> +  if ((fsflags ^ old_fsflags) & FS_NOCOW_FL) {
> >>>>>>>>>> +          if (!S_ISREG(inode->i_mode)) {
> >>>>>>>>>> +                  ret = -EINVAL;
> >>>>>>>>>> +                  goto out;
> >>>>>>>>>> +          }
> >>>>>>>>>> +
> >>>>>>>>>> +          if (f2fs_should_update_outplace(inode, NULL)) {
> >>>>>>>>>> +                  ret = -EINVAL;
> >>>>>>>>>> +                  goto out;
> >>>>>>>>>> +          }
> >>>>>>>>>> +
> >>>>>>>>>> +          ret = f2fs_convert_inline_inode(inode);
> >>>>>>>>>> +          if (ret)
> >>>>>>>>>> +                  goto out;
> >>>>>>>>>> +  }
> >>>>>>>>>> +
> >>>>>>>>>>    ret = f2fs_setflags_common(inode, iflags,
> >>>>>>>>>>                    f2fs_fsflags_to_iflags(F2FS_SETTABLE_FS_FL));
> >>>>>>>>>>  out:
> >>>>>>>>>> -- 
> >>>>>>>>>> 2.18.0.rc1
> >>>>>>>>> .
> >>>>>>>>>
> >>>>>>> .
> >>>>>>>
> >>>>> .
> >>>>>
> >>> .
> >>>
> > .
> > 


_______________________________________________
Linux-f2fs-devel mailing list
Linux-f2fs-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/linux-f2fs-devel

Reply via email to