Re: [Pulp-dev] Importers/Exporters

Justin Sherrill Wed, 19 Feb 2020 07:30:14 -0800


On 2/14/20 1:09 PM, David Davis wrote:

Grant and I met today to discuss importers and exporters[0] and we'dlike some feedback before we proceed with the design. To sum up thisfeature briefly: users can export a repository version from one Pulpinstance and import it to another.
# Master/Detail vs Core
So one fundamental question is whether we should use a Master/Detailapproach or just have core control the flow but call out to plugins toget export formats.
To give some background: we currently define Exporters (ieFileSystemExporter) in core as Master models. Plugins extend thismodel which allows them to configure or customize the Exporter. Thiswas necessary because some plugins need to export Publications (alongwith repository metadata) while other plugins who don't havePublications or metadata export RepositoryVersions.
The other option is to have core handle the workflow. The user wouldcall a core endpoint and provide a RepositoryVersion. This would workbecause for importing/exporting, you wouldn't ever use Publicationsbecause metadata won't be used for importing back into Pulp. Ifneeded, core could provide a way for plugin writers to write customhandlers/exporters for content types.
If we go with the second option, the question then becomes whether weshould divorce the concept of Exporters and import/export. Or do wealso switch Exporters from Master/Detail to core only?
# Foreign Keys
Content can be distributed across multiple tables (eg UpdateRecord hasUpdateCollection, etc). In our export, we could either use primarykeys (UUIDs) or natural keys to relate records. The former assumesthat UUIDs are unique across Pulp instances. The safer but morecomplex alternative is to use natural keys. This would involve storinga set of fields on a record that would be used to identify a relatedrecord.
# Incremental Exports
There are two big pieces of data contained in an export: the datasetof Content from the database and the artifact files. An incrementalexport cuts down on the size of an export by only exporting thedifferences. However, when performing an incremental export, we couldstill export the complete dataset instead of just a set of differences(additions/removals/updates). This approach would be simpler and itwould allow us to ensure that the new repo version matches theexported repo version exactly. It would however increase the exportsize but not by much I think--probably some number of megabytes at most.

If its simper, i would go with that. Saving even ~100-200 MB isn't thatbig of a deal IMO. the biggest savings is in the RPM content.


[0] https://pulp.plan.io/issues/6134

David

_______________________________________________
Pulp-dev mailing list
[email protected]
https://www.redhat.com/mailman/listinfo/pulp-dev

_______________________________________________
Pulp-dev mailing list
[email protected]
https://www.redhat.com/mailman/listinfo/pulp-dev

Re: [Pulp-dev] Importers/Exporters

Reply via email to