[Pulp-dev] Performance while syncing very large repositories

Lubos Mjachky Fri, 28 Feb 2020 02:59:40 -0800

Dear colleagues,

I am currently working on the issue https://pulp.plan.io/issues/6121. It
was reported that syncs of very large repositories take a huge amount of
time compared to Pulp 2.


I profiled the code and realized that that we are repeatedly fetching data
from the database in a loop and manually excluding units which should not
be added to a repository mirrored by Pulp.

Today, I submitted a PR (https://github.com/pulp/pulpcore/pull/565) that
may resolve this issue. Instead of the aforesaid loop, there is now used a
single database call which does the same thing, I suppose. Please, focus on
the last note https://pulp.plan.io/issues/6121#note-11 to better understand
my findings. Also, do not hesitate to review the submitted PR.

Thank you!

_______________________________________________
Pulp-dev mailing list
[email protected]
https://www.redhat.com/mailman/listinfo/pulp-dev

[Pulp-dev] Performance while syncing very large repositories

Reply via email to