> As we know, the observability is very important for a service. I think > we might need to define and export some metric to let people know the > ovfs daemon service status > > For now, we have some layer out of box in the OpenDAL(like Prometheus, > OpenTelementry, Dtrace). I'm not sure we should add more metric like > cache hit rate and anything else or not.
Hi, I agree that observability is important. However, it seems we're going too far. The feature requests keep extending, making it a never-ending project. I suggest we leave them as points for future expansion. On Wed, Mar 20, 2024, at 23:24, Manjusaka wrote: > On 2024/3/20 22:33, Runjie Yu wrote: >> Got it, thanks for your suggestions, I'll keep this in mind. >> >> I've put the main content of the proposal in a Google Docs, here's the link >> https://docs.google.com/document/d/1huy8vHcoCTf-GausabR3PCwXIXfRJAEckeyTulp2gDU/edit?usp=sharing >> >> I've specified in the deliverables section a description of the target for >> each storage type, S3 for object storage and HDFS for file storage. >> >> ``` >> 1) A code repository that implements the functions described in the project >> details. The services implemented by OVFS in the code repository need to >> meet the following requirements: (1) VirtioFS implementation, well >> integrated with VMs and QEMU, able to correctly handle VMs read and write >> requests to the file system. (2) Supports the use of distributed object >> storage systems and distributed file systems as storage backends, and >> provides complete and correct support for at least one specific storage >> service type for each storage system type. S3 can be used as the target for >> object storage systems, and HDFS can be used as the target for distributed >> file systems. (3) Supports related configurations of various storage >> systems. Users can configure storage system access and use according to >> actual needs. When an error occurs, users can use the configuration file to >> restart services. >> ``` >> >> On Tue, Mar 19, 2024 at 10:41 PM Manjusaka <[email protected]> wrote: >> >>> On 2024/3/19 20:57, 余润杰 wrote: >>>> Thank you for your suggestion. >>>> >>>> For the first point, I will update this in the proposal with an exact >>> goal for each storage system type. For the second point, I assume this >>> cache is shared by all VMs running in the same host OS. >>>> >>>> Regarding cloud documents, I think this is a very good suggestion. Yes, >>> I need to create and maintain a cloud document. This is not only easy to >>> browse, but by updating and maintaining this document during the GSoC >>> cycle, it helps us focus on our goals and demonstrate the phased results of >>> development. >>>> >>>> For now, I will create a Google Docs tomorrow to display the content of >>> the existing proposal. >>>> >>>> Thanks again for your advice! >>>> >>>> Manjusaka <[email protected] <mailto:[email protected]>> >>> 于2024年3月19日周二 17:48写道: >>>> >>>> On 2024/3/19 16:58, 余润杰 wrote: >>>> > Hi, Xuanwo and Manjusaka. >>>> > >>>> > I hope this email didn’t bother you! >>>> > >>>> > Applications for GSoC 2024 contributors opened today, and I hope >>> to join the GSoC project in Apache OpenDAL as a candidate. I have added you >>> to the list of mentors for the ovfsproject proposal and hope to have the >>> opportunity to be mentored by you! >>>> > >>>> > /Project Mentors: Xuanwo ([email protected] <mailto: >>> [email protected]> <mailto:[email protected] <mailto:[email protected]>>), >>> Manjusaka ([email protected] <mailto:[email protected]> <mailto: >>> [email protected] <mailto:[email protected]>>)/ >>>> > >>>> > I have supplemented and modified some of the content based on >>> previous proposal, mainly including the following points: >>>> > >>>> > 1) Based on the discussion in the previous email, the name of the >>> project was changed from ovirtiofs to ovfs. >>>> > >>>> > 2) Added explanation of ovfs design philosophy. >>>> > >>>> > 3) Avoid ovfs persisting any metadata. >>>> > >>>> > 4) Added potential application scenarios. >>>> > >>>> > 5) Added project deliverables. >>>> > >>>> > 6) Added Why Me And Why Do I Wish To Take Part In GSoC 2024 >>> section. >>>> > >>>> > I hope to submit the proposal this week. I'd like to know if there >>> are still areas that need to be revised or discussed before the proposal is >>> formally submitted. >>>> > >>>> > Have a nice day! >>>> >>>> Hi Runjie >>>> >>>> Glad to hear from you! >>>> >>>> Nice proposal! BTW maybe you can upload the document to a website >>> like Google Docs, Gist, so other people can preview the docs online(LOL >>>> >>>> Most LGTM about this version proposal, I may have some >>> issues/suggestions >>>> >>>> 1. We can make our target to implement only one service backend for >>> each category of the service(like S3 for blob, HDFS for file like, KV >>> storage is not in the plan). This will help us to focus on the function, >>> not the corner behavior. >>>> It would also can help us to reach the full-fuction tested target(I >>> think it's important for us) >>>> >>>> 2. About the cache, I would like to ask: Is the cache shared by all >>> the VM? or each VM would have their own cache >>>> >>>> >>>> Thanks for your nice proposal, have a nice day >>>> >>>> Best >>>> >>>> Manjusaka >>>> >>> >>> Sorry, I forget something in the previous email >>> >>> About the cache, I have another suggestion. I think we should split it >>> into two parts: the read cache and the write cache. The people can choose >>> to enable the cache base on their circumstance >>> >>> For example, if the user mount a S3 bucket as backend which is modified in >>> high frequency(modified by other serivce), the people shouldn't enable the >>> read cache. >>> >>> I think this is would good for the production usage. >>> >>> Best >>> >>> Manjusaka >>> >> > > Thanks for your public docs, I would like to say this is the most > extraordinary proposal I have ever seen before. Great Job! > > BTW, I'm not sure the following should be included into the original > proposal, for now, this is just personal idea. > > As we know, the observability is very important for a service. I think > we might need to define and export some metric to let people know the > ovfs daemon service status > > For now, we have some layer out of box in the OpenDAL(like Prometheus, > OpenTelementry, Dtrace). I'm not sure we should add more metric like > cache hit rate and anything else or not. > > WDYT? > > Best > > Manjusaka -- Xuanwo
