[ https://issues.apache.org/jira/browse/HBASE-14439?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Ben Lau updated HBASE-14439: ---------------------------- Description: Ticket for work in progress on new FileSystem abstractions. Previously, we (Yahoo) submitted a ticket that would add support for humongous (1 million region+) tables via a hierarchical layout (HBASE-13991). However open source is moving in a similar but not identical direction in the future and so the patch will not be merged into open source. We will be working on a different patch now with folks from open source. It will create/add to 2 layers-- a path abstraction layer and a use-oriented abstraction layer. The path abstraction layer is epitomized by classes like FsUtils (and in the patch new classes like AFsLayout). The use oriented abstraction layer is epitomized by existing classes like MasterFileSystem/HRegionFileSystem (and possibly new classes later) that build on the path abstraction layer and focus on 'doing things' (eg creating regions) and less on the gritty details like the paths. This work on abstracting and isolating the paths from the use cases will help Yahoo not diverge too much from open source with its internal 'Humongous' table hierarchical layout, while also helping open source move further towards the eventual goal of redoing the FS layout in a similar (but different) hierarchical layout later that focuses on data directory uniformity (unlike the humongous patch) and storing hierarchy in the meta table instead which enables new optimizations (see HBASE-14090.) Attached to this ticket is some work we've done at Yahoo so far that will be put into an open source HBase branch for further collaboration. The patch is not meant to be complete yet and is a work in progress. (Please wait on patch comments/reviews.) It also includes some Yahoo-specific 'humongous' layout code that will be removed before submission in open source. was: Ticket for work in progress on new FileSystem abstractions. Previously, we (Yahoo) submitted a ticket that would add support for humongous (1 million region+) tables via a hierarchical layout (HBASE-13991). However open source is moving in a similar but not identical direction in the future and so the patch will not be merged into open source. We will be working with Cloudera on a different patch now. It will create/add to 2 layers-- a path abstraction layer and a use-oriented abstraction layer. The path abstraction layer is epitomized by classes like FsUtils (and in the patch new classes like AFsLayout). The use oriented abstraction layer is epitomized by existing classes like MasterFileSystem/HRegionFileSystem (and possibly new classes later) that build on the path abstraction layer and focus on 'doing things' (eg creating regions) and less on the gritty details like the paths. This work on abstracting and isolating the paths from the use cases will help Yahoo not diverge too much from open source with its internal 'Humongous' table hierarchical layout, while also helping open source move further towards the eventual goal of redoing the FS layout in a similar (but different) hierarchical layout later that focuses on data directory uniformity (unlike the humongous patch) and storing hierarchy in the meta table instead which enables new optimizations (see HBASE-14090.) Attached to this ticket is some work we've done at Yahoo so far that will be put into an open source HBase branch for further collaboration. The patch is not meant to be complete yet and is a work in progress. (Please wait on patch comments/reviews.) It also includes some Yahoo-specific 'humongous' layout code that will be removed before submission in open source. > New/Improved Filesystem Abstractions > ------------------------------------ > > Key: HBASE-14439 > URL: https://issues.apache.org/jira/browse/HBASE-14439 > Project: HBase > Issue Type: Sub-task > Reporter: Ben Lau > Assignee: Matteo Bertozzi > Attachments: abstraction.patch > > > Ticket for work in progress on new FileSystem abstractions. Previously, we > (Yahoo) submitted a ticket that would add support for humongous (1 million > region+) tables via a hierarchical layout (HBASE-13991). However open source > is moving in a similar but not identical direction in the future and so the > patch will not be merged into open source. > We will be working on a different patch now with folks from open source. It > will create/add to 2 layers-- a path abstraction layer and a use-oriented > abstraction layer. The path abstraction layer is epitomized by classes like > FsUtils (and in the patch new classes like AFsLayout). The use oriented > abstraction layer is epitomized by existing classes like > MasterFileSystem/HRegionFileSystem (and possibly new classes later) that > build on the path abstraction layer and focus on 'doing things' (eg creating > regions) and less on the gritty details like the paths. > This work on abstracting and isolating the paths from the use cases will help > Yahoo not diverge too much from open source with its internal 'Humongous' > table hierarchical layout, while also helping open source move further > towards the eventual goal of redoing the FS layout in a similar (but > different) hierarchical layout later that focuses on data directory > uniformity (unlike the humongous patch) and storing hierarchy in the meta > table instead which enables new optimizations (see HBASE-14090.) > Attached to this ticket is some work we've done at Yahoo so far that will be > put into an open source HBase branch for further collaboration. The patch is > not meant to be complete yet and is a work in progress. (Please wait on > patch comments/reviews.) It also includes some Yahoo-specific 'humongous' > layout code that will be removed before submission in open source. -- This message was sent by Atlassian JIRA (v6.3.4#6332)